Risk and Uncertainty
Risk and Uncertainty
.
ISBN: 978-1708284817
http://www.econ.ucdavis.edu/faculty/bonanno/
I intend to add, some time in the future, a further collection of exercises with detailed
solutions. Details will appear on my web page.
I am very grateful to Elise Tidrick for teaching me how to use spacing and formatting
in a (perhaps unconventional) way that makes it easier for the reader to learn the material.
I would like to thank Mathias Legrand for making the latex template used for this book
available for free (the template was downloaded from http://www.latextemplates.com/
template/the-legrand-orange-book ).
v.2 6-21
Contents
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
I Insurance
2 Insurance: basic notions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.1 Uncertainty and lotteries 15
2.2 Money lotteries and attitudes to risk 16
2.3 Certainty equivalent and the risk premium 19
2.4 Insurance: basic concepts 21
2.5 Isoprofit lines 25
2.6 Profitable insurance requires risk aversion 30
2.6.1 Insuring a risk-neutral individual . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
2.6.2 Insuring a risk-averse individual . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.6.3 The profit-maximizing contract for a monopolist . . . . . . . . . . . . . . . . . . . . 32
2.6.4 Perfectly competitive industry with free entry . . . . . . . . . . . . . . . . . . . . . . 34
2.7 Exercises 36
2.7.1 Exercises for Section 2.2: Money lotteries and attitudes to risk . . . . . . . . . 36
2.7.2 Exercises for Section 2.3: Certainty equivalent and risk premium . . . . . . 38
2.7.3 Exercises for Section 2.4: Insurance: basic concepts . . . . . . . . . . . . . . . . 39
2.7.4 Exercises for Section 2.5: Isoprofit lines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
2.7.5 Exercises for Section 2.6: Profitable insurance requires risk aversion . . . . 42
2.8 Solutions to Exercises 42
3 Expected Utility Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
3.1 Expected utility: theorems 55
3.2 Expected utility: the axioms 63
3.3 Exercises 71
3.3.1 Exercises for Section 3.1: Expected utility: theorems . . . . . . . . . . . . . . . . . 71
3.3.2 Exercises for Section 3.2: Expected utility: the axioms . . . . . . . . . . . . . . . . 73
3.4 Solutions to Exercises 74
II Risk Sharing
6 Risk Sharing and Pareto Efficiency . . . . . . . . . . . . . . . . . . . . . . 165
6.1 Sharing an uncertain surplus 165
6.2 The Edgeworth box 167
6.3 Points of tangency 175
6.3.1 Risk averse Principal and risk neutral Agent . . . . . . . . . . . . . . . . . . . . . . . 175
6.3.2 Risk neutral Principal and risk averse Agent . . . . . . . . . . . . . . . . . . . . . . . 176
6.3.3 A general principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
6.3.4 Both parties risk averse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
6.3.5 Both parties risk neutral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
6.3.6 Pareto efficiency for contracts in the interior of the Edgeworth box . . . 182
6.4 Pareto efficient contracts on the sides of the Edgeworth box 183
6.4.1 Risk averse Principal and risk neutral Agent . . . . . . . . . . . . . . . . . . . . . . . 183
6.4.2 Risk neutral Principal and risk averse Agent . . . . . . . . . . . . . . . . . . . . . . . 184
6.4.3 Both parties risk averse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
6.5 The Edgeworth box when the parties have positive initial wealth 187
6.6 More than two outcomes 193
6.6.1 Risk-neutral Principal and risk-averse Agent . . . . . . . . . . . . . . . . . . . . . . . 194
6.6.2 Risk-averse Principal and risk-neutral Agent . . . . . . . . . . . . . . . . . . . . . . . 197
6.6.3 Both parties risk neutral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
6.6.4 Both parties risk averse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 198
6.7 Exercises 199
6.7.1 Exercises for Section 6.1: Sharing an uncertain surplus . . . . . . . . . . . . . . 199
6.7.2 Exercises for Section 6.2: The Edgeworth box . . . . . . . . . . . . . . . . . . . . . . 199
6.7.3 Exercises for Section 6.3: Points of tangency . . . . . . . . . . . . . . . . . . . . . . 201
6.7.4 Exercises for Section 6.4: Pareto efficient contracts on the sides of the
Edgeworth box . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 204
6.7.5 Exercises for Section 6.5: The Edgeworth box when the parties have positive
initial wealth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206
6.7.6 Exercises for Section 6.6: More than two outcomes . . . . . . . . . . . . . . . . 207
6.8 Solutions to Exercises 209
V Moral Hazard
10 Moral Hazard in Insurance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 343
10.1 Moral hazard or hidden action 343
10.2 Moral hazard and insurance 344
10.2.1 Two levels of unobserved effort . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 345
10.2.2 The reservation utility locus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 348
10.2.3 The profit-maximizing contract for a monopolist . . . . . . . . . . . . . . . . . . . 354
10.3 Exercises 359
10.3.1 Exercises for Section 10.2.1: Two levels of unobserved effort . . . . . . . . . 359
10.3.2 Exercises for Section 10.2.2: The reservation utility locus . . . . . . . . . . . . . 361
10.3.3 Exercises for Section 10.2.3: The profit-maximizing contract . . . . . . . . . . 362
10.4 Solutions to Exercises 363
12 Glossary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 413
1. Introduction
This book offers an introduction to the economic analysis of uncertainty and information.
Life is made up of a never-ending sequence of decisions. Many decisions – such as what to watch on
television or what to eat for breakfast – do not have major consequences. Other decisions – such as whether
or not to invest all of one’s savings in the purchase of a house, or whether to purchase earthquake insurance –
can have a significant impact on one’s life. We will concern ourselves with decisions that potentially have a
considerable impact on the wealth of the individual in question.
Most of the time the outcome of a decision is influenced by external factors that are outside the decision
maker’s control, such as the side effects of a new drug, or the future price of real estate, or the occurrence
of a natural phenomenon (such as a flood, or a fire, or an earthquake). While one is typically aware of the
existence of such external factors, as the saying goes “It is difficult to make predictions, especially about
the future”.1 Most decisions are shrouded in uncertainty and this book is about how uncertainty affects the
actions and decisions of economic agents.
We begin by examining, in Chapter 2, what explains the existence and profitability of insurance markets.
For this we simply appeal to the definition of risk aversion, without the need for the full power of expected
utility theory.
Chapter 3 develops the Theory of Expected Utility, which is central to the rest of the book.
In Chapter 4 we use the theory of expected utility to re-examine the notion of attitude to risk (risk
aversion, risk neutrality and risk love), discuss how to measure the degree of risk aversion of an individual
and develop a test for determining when, of two alternative risky prospects, one can unambiguously be
labeled as being more risky than the other.
With the help of expected utility theory, in Chapter 5 we study the demand side of insurance markets.
We then put together the analysis of the supply side of insurance, developed in Chapter 2, with the analysis
of the demand side, to determine the equilibrium of an insurance industry under two opposite scenarios: the
case where the industry is a monopoly and the case where there is perfect competition with free entry.
In Chapter 6 we address the issue of efficient risk sharing. We consider the case of an individual,
referred to as “the Principal”, who is contemplating hiring another individual, referred to as “the Agent”, to
perform a task, whose outcome is uncertain (because it is affected by external factors). We consider all the
possible forms of payment to the Agent (e.g. a fixed wage or a payment contingent on the outcome) and ask
1 This saying is often attributed to the physicist Niels Bohr, but apparently it is an old Danish proverb.
12 Chapter 1. Introduction
what contracts are Pareto efficient, in the sense that there is no other contract that they both prefer. We study
how Pareto efficiency relates to the optimal way of allocating risk between the two parties to the contract.
In Chapters 7-9 we turn to the issue of asymmetric information. It is often the case that one of the
two parties to a contract has more information than the other party about aspects of the transaction that are
relevant to both parties. For example, the seller of a used car has knowledge about the quality of the car
that the potential buyer cannot easily acquire before the purchase, or a job applicant knows more about
herself than the potential employer can find out from an interview. Asymmetric information can manifest
itself in different forms. One type of asymmetric information gives rise to the phenomenon of “adverse
selection”, which is studied in Chapters 7 and 8. Chapter 7 deals with the general phenomenon of adverse
selection, with particular focus on the market for used durable goods, while Chapter 8 is devoted to the
analysis of adverse selection in insurance markets. This is the situation where there are different types of
individuals, with different propensities to incur losses, and – while each individual knows his or her own
type – the insurance company does not. We study how the asymmetry of information affects the decisions
of the suppliers of insurance and re-examine the conditions for an equilibrium in the two types of industry
structure examined in Chapter 5, namely monopoly and perfect competition.
In Chapter 9 we study another phenomenon that arises in the context of asymmetric information, namely
the phenomenon of “signaling”. Signaling refers to the attempt by the informed party to credibly convey
information to the uninformed party. When the latter is uncertain about the characteristics, or “type”, of
individual he/she is about to sign a contract with, he/she might offer contractual terms that are unappealing to
some individuals, thereby creating an incentive for the “better” types to engage in costly activities that allow
them to “separate themselves” from the worse types and to credibly convey information about themselves.
While the asymmetric information studied in Chapters 7-9 is also referred to as “hidden type”, the
informational asymmetry studied in Chapters 10 and 11 is called “hidden action” or “moral hazard”. It refers
to situations where what cannot be observed by one of the two parties to a contract is not the type of the other
party, but his/her behavior. When such behavior has an effect on the outcome, it becomes important for the
uninformed party to design the contract in such a way that it creates an incentive for the other party to act in a
“desirable” way. For example, in the case of insurance, the probability that the insured individual will face a
loss – and thus apply for a reimbursement from the insurance company – may be affected by the behavior of
the individual, in particular by the effort and care exerted in loss prevention. In such a situation the insurance
company might want to offer only insurance contracts that will create an incentive for the insured to exert
appropriate effort towards reducing the probability of loss. Chapter 10 deals with the phenomenon of moral
hazard in insurance, while Chapter 11 revisits the Principal-Agent relationships studied in Chapter 6 and
analyses the effect of moral hazard in that context.
Whenever possible, throughout the book we have tried to illustrate the relevant concepts graphically in
two-dimensional diagrams. The book is richly illustrated with approximately 150 figures and tables.
At the end of each section of each chapter the reader is invited to test his/her understanding of the
concepts introduced in that section by attempting several exercises. In order not to break the flow of the
exposition, the exercises are collected in a section at the end of the chapter. Complete and detailed answers
for each exercise are given in the last section of each chapter. In total, the book contains more than 150 fully
solved exercises. Attempting to solve the exercises is an integral part of learning the material covered in this
book.
The book was written in a way that should be accessible to anyone with minimum knowledge of calculus,
in particular the ability to calculate the (partial) derivative of a function of one or two variables.
This book does not necessarily follow conventional formatting standards. Rather, the intention was to
break each argument into clearly outlined steps, highlighted by appropriate spacing.
I
Insurance
Throughout this book we will represent the uncertainty facing a decision-maker in terms of lotteries.1
This assumes that the decision-maker is always able to assign probabilities to the possible outcomes. We
interpret these probabilities either as “objective” probabilities, obtained from relevant past data, or as
“subjective” estimates by the individual. For example, an individual who is considering whether or not to
insure her bicycle against theft, knows that there are two relevant basic outcomes: either the bicycle will be
stolen or it will not be stolen. Furthermore, she can look up data on past bicycle thefts in her area and use the
proportion of bicycles that were stolen as an objective estimate of the probability that her bicycle will be
stolen; alternatively, she can use a more subjective estimate: for example she might use a lower probability
of theft than suggested by the data, because she knows herself to be very conscientious and – unlike other
people – to always lock her bicycle when left unattended.
In this chapter we will focus on lotteries where the outcomes are sums of money. More general lotteries
will be considered in Chapter 3.
We assume that the individual in question is able to rank any two money lotteries. For
example,
if asked
$400
to choose between getting $400 for sure, which can be viewed as the degenerate lottery , and
1
$900 $0
the lottery2 1 1 , she will be able to tell us if she prefers one lottery to the other or is indifferent
2 2
between the two. In general, there is no “right answer” to this question, as there is no right answer to the
question “do you prefer coffee or tea?”: it is a matter of individual taste.
$x1 $x2 ... $xn
Definition 2.2.2 Given a money lottery L = , its expected value is the
p1 p2 ... pn
number E[L] = x1 p1 + x2 p2 + ... + xn pn .
Definition 2.2.3 Let L be a non-degenerate money lottery (that is, a money lottery where at least
two different outcomes are assigned positive probability)a and consider the choice between L and the
degenerate lottery
$E[L]
1
(that is, the choice between facing the lottery L or getting the expected value of L with certainty).
Then
1 For
some analysis of decision-making in situations where the individual is not able to assign probabil-
ities to the outcomes see my book Decision Making (http://faculty.econ.ucdavis.edu/faculty/
bonanno/DM_Book.html ).
2 We can think of this lottery as tossing a fair coin and then giving the individual $900 if it comes up
• An individual who prefers $E[L] for certain to L is said to be risk averse (relative to L).
• An individual who is indifferent between $E[L] for certain and L is said to be risk neutral (relative
to L).
• An individual who prefers L to $E[L] for certain is said to be risk loving or risk seeking (relative
to L).
a A money lottery $x1 $x2 ... $xn
is non-degenerate if, for all i = 1, 2, ..., n, pi < 1.
p1 p2 ... pn
3 That is, if she considers lottery A to be at least as good as lottery B and she considers lottery B to be at
least as good as lottery C then she considers A to be at least as good as C.
18 Chapter 2. Insurance: basic notions
L1 ∼ $45 $43 ∼ L2 .
Thus, by transitivity, L1 L2 (see Exercises 2.10-2.13).
On the other hand, knowing that an individual is risk averse relative to every money lottery, has transitive
preferences over money lotteries and prefers more money to less, is not sufficient to predict how she will
choose between two arbitrary money lotteries. For!example, as we will see in Chapter 3, it is possible that one
!
$28 $10 $50
risk-averse individual will prefer L3 = (whose expected value is 28) to L4 = 1 1
1 2 2
(whose expected value is 30), while another risk-averse individual will prefer L4 to L3 .
Similarly, knowing that an individual is risk loving relative to every money lottery, has transitive
preferences over money lotteries and prefers more money to less, is not sufficient to predict how she will
choose between two arbitrary money lotteries.
R Note that “rationality” does not, and should not, dictate whether an individual should
be risk neutral, risk averse or risk loving: an individual’s attitude to risk is merely
a reflection of that individual’s preferences. It is a generally accepted principle
that de gustibus non est disputandum (in matters of taste, there can be no disputes).
According to this principle, there is no such thing as an irrational preference and thus
there is no such thing as an irrational attitude to risk.
From an empirical point of view, however, most people reveal through their choices
(e.g. the decision to buy insurance) that they are risk averse, at least when the stakes
are sufficiently high. It is also possible (as we will see in Chapter 4) for an individual
to have different attitudes to risk, depending on how high the stakes are (e.g. an
individual might display risk aversion, by purchasing home insurance, as well as risk
love, by purchasing a lottery ticket).
4 Thus
A B means that the individual prefers A to B and A ∼ B means that the individual is indifferent
between A and B.
2.3 Certainty equivalent and the risk premium 19
notation: interpretation:
L1 L2 the individual prefers L1 to L2
L1 ∼ L2 the individual is indifferent between L1 and L2
L1 % L2 the individual considers L1 to be at least as good as L2 ,
that is, either L1 L2 or L1 ∼ L2 .
We shall assume that the individual is able to rank any two lotteries (her preferences are complete) and her
ranking is transitive:
We shall also assume throughout that the individual prefers more money to less, that is,
! !
$x $y
if and only if x > y. (2.1)
1 1
Suppose that, for every money lottery L there is a sum of money, denoted by CL , that the individual
considers to be just as good as the lottery L; then we call CL the certainty equivalent of lottery L for that
individual.
Definition 2.3.1 The certainty equivalent of a money lottery L is that sum of money CL such that
$x1 ... $xn $CL
L= ∼
p1 ... pn 1
Typically, the certainty equivalent of a given money lottery will be different for different individuals. However,
all risk-neutral individuals will share the same certainty equivalent; in fact, it follows from the definition of
risk neutrality (Definition 2.2.3) that
• for a risk-neutral individual, the certainty equivalent of a money lottery L coincides with the expected
value of L:
CL = E[L].
On the other hand, for a risk-averse individual (who, furthermore, prefers more money to less and whose
preferences are complete and transitive) the certainty equivalent of a money lottery will be less than the
expected value:
CL < E[L].
5 InExercises 2.10-2.13 the reader is asked to prove that transitivity of the “at least as good” relation
implies transitivity of strict preference and of indifference.
20 Chapter 2. Insurance: basic notions
$E[L]
In fact, by definition of risk aversion, L and, by definition of certainty equivalent, L ∼
1
$CL $E[L] $CL
. Thus, by transitivity, ; hence, by (2.1),
1 1 1
E[L] > CL . Similarly,
CL > E[L].
From the notion of certainty equivalent we derive another notion which can be used to compare the
degree of risk aversion across individuals.
Definition 2.3.2 The risk premium of a money lottery L, denoted by RL , is the amount by which the
expected value of L can be reduced to induce indifference between the lottery itself and the reduced
amount for certain:
$x1 ... $xn $(E[L] − RL )
L= ∼
p1 ... pn 1
It follows from Definitions 2.3.1 and 2.3.2 that CL = E[L] − RL or, equivalently,
RL = E[L] −CL .
Thus, for a risk-neutral individual the risk premium is zero, while for a risk-averse individual the risk
premium is positive (and for a risk-loving individual the risk premium is negative). Furthermore, we can
label a risk-averse individual as more risk-averse (relative to lottery L) than another risk-averse individual
if the risk premium for the former is larger than the risk premium for the latter. In fact, the risk premium
can be interpreted as the price (relative to the expected value) that the individual is willing to pay to avoid
facing lottery L. For example, consider three individuals: Ann, Bob and Carla. They all have the same initial
wealth $6, 000 and they are facing the lottery L where with probability !50% their wealth is wiped out and
$0 $12, 000
with probability 50% their wealth is doubled: L = 1 1
. Suppose that Ann’s risk premium
2 2
for this lottery is RAnn Bob
L = 900, Bob’s is RL = 500 and Carla’s is RL
Carla = 0. Then Ann and Bob are risk
averse and Ann is more risk averse than Bob, while Carla is risk neutral. Ann would be willing to pay up to
$900 (thus reducing her wealth from $6, 000 to $5, 100) in order to avoid the lottery L, while Bob would
only be willing to pay up to $500 (thus reducing his wealth from $6, 000 to $5, 500) in order to avoid the
lottery L; on the other hand, Carla would not be willing to pay any amount of money to avoid L, since she is
indifferent between keeping her initial wealth of $6, 000 and playing lottery L.
Insurance markets are a good example of situations where uncertainty can be represented by means of money
lotteries.
Consider an individual who has an initial wealth of $W0 > 0 and faces the possibility of a loss in the
amount of $` (0 < ` ≤ W0 ) with probability p (0 < p < 1). For example, it could be an individual who owns
a plot of land worth $80, 000 and a house built on it worth $220, 000 (so that W0 = 80, 000 + 220, 000 =
300, 000). She is worried about the possibility of a fire destroying the house (thus ` = 220, 000) and,
according to publicly available data, the probability of this happening in her area is 2% (thus p = 0.02). An
insurance company offers her a contract and she has to decide whether or not to purchase that contract. An
insurance contract is typically expressed in terms of two numbers: the premium, which we will denote by h,
and the deductible, which we will denote by d. The premium can be thought of as the price of the contract:
it is paid no matter whether the loss is incurred or not. The deductible is the portion of the loss that is not
covered. If d = 0 we say that the contract offers full insurance, while if d > 0 then we say that the contract
offers partial insurance:
In the above example, if the deductible is $40, 000 then, if the loss occurs, the insurance company makes a
payment to the insured in the amount of $(` − d) = $(220, 000 − 40, 000) = $180, 000 (and, of course, if the
loss in not incurred then the insurance company does not make any payments to the insured).
22 Chapter 2. Insurance: basic notions
The following table summarizes the notation used in this book in the context of insurance:
W0 initial wealth
` potential loss
p probability of loss
h premium
d deductible
`−d insured amount
(h, d) insurance contract.
It will be useful to represent the initial situation and possible insurance contracts graphically. We shall
do so by using wealth diagrams where, on the horizontal axis, we represent the individual’s wealth if a loss
occurs, denoted by W1 , and, on the vertical axis, the individual’s wealth if there is no loss, denoted by W2 ;
we shall also refer to the former as wealth in the bad state and to the latter as wealth in the good state. The
no-insurance situation can be represented in the wealth diagram as the point NI = (W0 − `,W0 ), as shown in
Figure 2.1.
0 W1
W0 − ` wealth in bad state
Figure 2.1: The no-insurance point (NI) and the set of possible insurance contracts (the
shaded triangle).
The purpose of an insurance contract is to protect the individual in case she experiences
a loss: thus an insurance contract can be thought of as a point in the diagram where the
horizontal coordinate is larger than W0 − ` (which is the individual’s wealth in the bad
state if she does not insure), while the vertical coordinate is smaller than W0 because of the
premium. The set of possible insurance contracts (encoded in terms of the corresponding
wealth levels for the individual, in the bad state and in the good state), is shown in Figures
2.1 and 2.2 as a shaded triangle. The “45o line”– which is the line out of the origin with an
2.4 Insurance: basic concepts 23
angle of 45o – is the set of points (W1 ,W2 ) such that W1 = W2 . As we will see below, the
points on the 45o line represent full-insurance contracts.6
How do we translate an insurance contract (h, d), expressed in terms of premium h and
deductible d, into a point in the (W1 ,W2 ) diagram? If the individual purchases contract
(h, d) then she pays the premium h in any case (that is, whether or not she incurs a loss) and
thus her wealth in the good state is equal to W2 = W0 − h; the premium reduces her wealth
also in the bad state, but in this state there is a further reduction due to the deductible, so
that W1 = W0 − h − d = W2 − d. Conversely, given a contract expressed as a point (W1 ,W2 )
we can recover the premium and deductible as follows: h = W0 −W2 and d = W2 −W1 . It
is clear from this that d = 0 if and only if W1 = W2 , that is, if and only if the point lies on
the 45o line.
NI 45o line
5,000
A
4,500
B
4,100
0 W1
3,000 3,900 4,100 wealth in bad state
In Figure 2.2 three points are shown: the no-insurance point NI = (3, 000, 5, 000) and
two possible insurance contracts: A = (W1A = 3, 900, W2A = 4, 500) and
B B
B = (W1 = 4, 100, W2 = 4, 100). From NI we deduce that
W0 = 5, 000 and ` = 5, 000 − 3, 000 = 2, 000.
Let hA denote the premium of contract A and dA the deductible; then
hA = W0 −W2A = 5, 000 − 4, 500 = 500 and dA = W2A −W1A = 4, 500 − 3, 900 = 600.
6 Expressed in terms of premium and deductible, the set of insurance contracts is {(h, d) : h > 0, d ≥
0, h + d < `} ∪ {(0, `)}. We have added the trivial contract with h = 0 and d = ` for convenience: it is
equivalent to no insurance. Expressed in terms of wealth levels, the set of insurance contracts is {(W1 ,W2 ) :
W0 − ` < W1 ≤ W2 < W0 } ∪ {(W0 − `,W0 )}.
24 Chapter 2. Insurance: basic notions
Similarly, let hB denote the premium of contract B and dB the deductible; then
Figure 2.3 shows how to view the premium and deductible corresponding to a contract
A = (W1A ,W2A ).
W2
NI 45o line
W0
premium hA
A
W0 − hA
deductible dA
W0 − hA − dA
0 W0 − ` W0 − hA − dA
W1
Figure 2.3: The graphical representation of the premium hA and the deductible dA corre-
sponding to a contract A = (W1A ,W2A ).
As shown in Figure 2.1, there are many potential insurance contracts (the points in the
shaded triangle). Will an insurance company be willing to offer any of them? Would an
individual be willing to accept any of them? The first question has to do with the incentives
of the supplier of contracts (the insurer), while the second question has to do with the
incentives of the potential customer (the insured).
We will address the first question in the next sections and postpone a full analysis of
the second question to Chapter 4.
Definition 2.5.1 A line in the (W1 ,W2 ) plane joining all the contracts that give rise to
the same expected profit is called an isoprofit line.
p
We want to show that an isoprofit line is a downward-sloping straight line with slope − 1−p .
Let A = W1A ,W2A and B = W1B ,W2B be two contracts that yield the same expected profit,
that is,
Deleting W0 − p` from both sides of the equation and rearranging the terms we get
Figure 2.4 shows an isoprofit line and two contracts, A and B, on this line.
(probability 1 − p)
W2
Thus
Consider two insurance contracts A = W1A ,W2A and B = W1B ,W2B . From the point
of view of the potential buyer, these two contracts correspond to the wealth lotteries
W0 − hA − dA W0 − hA W0 − hB − dB W0 − hB
A= and B =
p 1− p p 1− p
where hA = W0 −W2A is the premium of contract A and dA = W2A −W1A is the deductible of
contract A and, similarly, hB = W0 −W2B and dB = W2B −W1B . Let
be the expected profits from contracts A and B, respectively. We want to show that
that is, A and B lie on the same isoprofit line if and only if the two wealth lotteries A and B
have the expected value. In fact,
Thus E[A] = E[B] if and only if hA + pdA = hB + pdB if and only if (subtracting p` from
both sides)
hA + pdA − p` = hB + pdB − p`
| {z } | {z }
=π(A) =π(B)
For each point in the (W1 ,W2 ) plane there is an isoprofit line that goes through that
p
point. Hence the plane is filled with parallel isoprofit lines (each with slope − 1−p ).
Let A = W1A ,W2A and B = W1B ,W2B be two insurance contracts and suppose that B lies
below the isoprofit line that goes through A, as shown in Figure 2.5, so that π(B) 6= π(A).
Does B represent a contract that yields higher or lower profits than A? In other words, if
the isoprofit line through one contract is below the isoprofit line through another contract,
which of the two lines corresponds to a higher level of profit?
W2
isoprofit line
A 45o line
C
B D
isoprofit
line
0 W1
W0 − hD
W0 − hC
As shown in Figure 2.5, let C be the full-insurance contract that lies on the isoprofit
line through A and D the full-insurance contract that lies on the isoprofit line through
B. Then π(A) = π(C) and π(B) = π(D). Thus, if we show that π(D) > π(C) then it
follows that π(B) > π(A). The proof that π(D) > π(C) is straightforward: contract D
28 Chapter 2. Insurance: basic notions
W2
direction of
increasing profits
π3 > π2 > π1
π1
π2 isoprofit
π3 lines
W1
0
Among the isoprofit lines there is one which is of particular interest, namely the zero-
profit line, that is, the line that joins all the contracts that yield zero profits. Like all the
p
other isoprofit lines, this is a straight line with slope − 1−p . Furthermore, it goes through
the no-insurance point NI; in fact, NI can be thought of as a trivial contract with zero
premium and full deductible: such a contract obviously involves zero profits because the
insurance company receives no payment (hNI = 0) and makes no payment (dNI = ` so that
` − dNI = 0).
8 In
Exercise 2.21 the reader is asked to give an alternative proof of the fact that, if contract B lies below
the isoprofit line through contract A, then π(B) > π(A).
2.5 Isoprofit lines 29
The zero-profit line is shown in Figure 2.7. Points above the line represent contracts that
yield negative profits and points below the line represent contracts that yield positive profits.
W2
zero-profit line
NI C 45o line
W0 B
A
π(B) = 0
π(A) > 0
π(C) < 0
0 W1
W0 − `
For the potential customer, such a contract corresponds to the wealth lottery
W0 − h W0 − h − d
A=
1− p p
whose expected value is
By risk neutrality, the individual is indifferent between NI and E[NI] for sure and is also
indifferent between A and E[A] for sure. Assuming that the individual prefers more money
to less, by (2.6) she prefers E[NI] for sure to E[A] for sure: denoting, as before, indifference
by ∼ and strict preference by , we can write this as
NI ∼ E[NI] E[A] ∼ A.
NI A,
that is, the individual strictly prefers not insuring to purchasing contract A. Hence it is not
possible for an insurance company to make positive profits by selling insurance contracts
to risk-neutral individuals: the individuals will simply not buy the offered insurance
contracts.
Although it is intuitively clear that also a risk-loving individual would reject any
insurance contract that would yield non-negative profits to the insurer, the proof requires
more tools than we have developed so far.9
Thus we are left with the case of a risk-averse individual, to which we now turn.
9 In
Exercise 2.22 (at the end of this chapter) the reader is asked to show that a risk-loving individual
would reject a full-insurance contract that yields zero profits to the insurance company.
2.6 Profitable insurance requires risk aversion 31
W2
B if B A
C then
CA
W1
Now consider a risk-averse individual. By definition of risk aversion, she will strictly prefer
the full-insurance contract B with premium h = p` to not insuring, since such contract
leaves her with a wealth of W0 − p` for sure and W0 − p` is the expected value of the
no-insurance lottery NI:
B NI. (2.7)
Contract B yields zero profits for the insurer:11
π(B) = 0. (2.8)
By continuity of preferences and (2.7), any contract C sufficiently close to B will also be
such that
C NI. (2.9)
10 This is similar to the property of real numbers that, if b > a then any number c in a sufficiently small
interval around b will also be greater than a.
11 In fact, contract B lies at the intersection of the zero-profit line and the 45o line.
32 Chapter 2. Insurance: basic notions
If we choose such a contract C which is below the zero-profit line, then – as we saw in
Section 2.5 – π(C) > π(B) and thus, by (2.8), π(C) > 0 and, by (2.9), the individual will
purchase contract C, since it makes her better off relative to not insuring. Hence it is
possible to sell a profitable contract to a risk-averse individual.
The above argument is illustrated graphically in Figure 2.9.
W2
zero-profit line
45o line
NI
C B
Risk aversion ⇒ B NI
Continuity ⇒ C NI
C below 0-π line ⇒ π(C) > 0
W1
0
Figure 2.9: Contract C yields positive profits and is better than no insurance.
that is, A and B lie on the same isoprofit line and hence the monopolist is indifferent
between these two contracts. The customer, however, would strictly prefer contract A to
contract B:A B. In fact, purchasing contract B = (hB , dB ) can be viewed as playing
W0 − hB W0 − hB − dB
the lottery whose expected value is W0 − hB − pdB and the
1− p p
full-insurance contract A guarantees this amount with certainty; thus, by the assumed risk-
aversion of the customer, A B. By continuity of preferences, any contract C sufficiently
close to A would still be such that C B and thus, by transitivity (since, by hypothesis,
B NI) C NI. Choosing such a contract below the isoprofit line going through contracts
A and B ensures that π(C) > π(B). Hence if the monopolist were to switch from contract
B to contract C the customer would still purchase insurance and the monopolist’s profits
would increase. The argument is illustrated in Figure 2.10.
W2
45o line
isoprofit line
B
A
C
Hypothesis: BNI
Risk aversion ⇒ AB
Continuity ⇒ CB
Transitivity ⇒ CNI
C below isoprofit line ⇒ π(C)>π(B)
W1
Figure 2.10: Contract C yields higher profits than contract B and is still better than NI.
Thus a monopolist would offer a full-insurance contract to the potential customer. What
is the maximum premium that the monopolist would be able to charge for full-insurance
without turning the customer away? We can answer this question by appealing to the
notion of risk premium (Definition 2.3.2): the monopolist can set the premium up to the
amount
hmax = p` + RNI
W0 W0 − `
where RNI is the customer’s risk premium for the no-insurance lottery NI = 1− p p
;
that is, the maximum premium the customer would be willing to pay for full insur-
ance is equal to the expected loss, p`, augmented by the risk premium, RNI . In fact,
E[NI] = W0 − p` and thus, by definition of risk premium,
W0 − p` − RNI
NI ∼ .
1
34 Chapter 2. Insurance: basic notions
In other words, if the customer purchases insurance at premium hmax = p` + RNI then she
is guaranteed the certainty equivalent of the no-insurance lottery. If offered full insurance
at this premium, the potential customer would be indifferent between insuring and not
insuring; thus the monopolist might want to offer full insurance at a slightly lower premium
in order to provide the customer with an incentive to purchase insurance.
In the previous section we considered the extreme case of complete absence of competition,
that is, the case where the insurance industry is a monopoly. In this section we consider the
opposite extreme, namely an insurance industry where competition is so intense that profits
are driven down to zero. The story that is often told for such a mythical industry is that
there is free entry into the industry and thus, if firms in the industry are making positive
profits, then some new entrepreneur will enter seeking to share in these profits; entry of new
firms intensifies competition and drives profits down. We shall assume that all the potential
customers in the industry are identical, in the sense that they have the same preferences,
the same initial wealth and face the same potential loss with the same probability (the case
where potential customers are not identical will be analysed in Chapter 8). Furthermore,
we assume that if a new contract is introduced that the insured customers prefer to their
current contract, then they will switch to the new contract.
2. there is no unexploited profit opportunity in the industry, that is, there is no currently
not offered contract that would yield positive profits to a (existing or new) firm that
offered that contract.
By adopting a simple extension of the argument used in the previous section, we now show
that at a competitive free-entry equilibrium all the active firms, that is, all the firms that
are actually selling insurance,12 offer the same contract, namely the “fair” full-insurance
contract with premium h = p`.
12 There could be inactive firms whose contracts nobody purchases: these firms are also trivially making
zero profits.
2.6 Profitable insurance requires risk aversion 35
The first step in the argument is that – by the zero-profit condition – any actually
purchased contract must lie on the zero-profit line. Suppose that there is a contract, call
it A, that is currently being purchased by some customers (thus A % NI) and is different
from the “fair” full-insurance contract with premium h = p`; call the latter contract B: see
Figure 2.11. By definition of risk aversion, it must be that B A and, by continuity of
preferences, any contract sufficiently close to B must also be better than A (thus, since
A % NI, by transitivity of preferences such a contract B is better than no insurance). Pick
a contract C sufficiently close to B (so that C A) and below the zero-profit line (see
Figure 2.11). Then π(C) > 0 and thus a firm that offered this contract would attract all the
customers who are currently purchasing contract A and would make positive profit, so that
the initial situation cannot be a free-entry competitive equilibrium.
W2
zero-profit line
45o line
NI
A
C B Hypothesis: A%NI
Risk aversion ⇒ BA
Continuity ⇒ CA
C below 0-π line⇒ π(C)>0
W1
0
Figure 2.11: Contract C yields positive profits and is better than A (and NI).
2.7 Exercises
The solutions to the following exercises are given in Section 2.8 at the end of this chapter.
2.7.1 Exercises for Section 2.2: Money lotteries and attitudes to risk
o1 o2 o3
Exercise 2.2 Consider the lottery 3 5 2 where
10 10 10
• o1 is the outcome where you get $100 and an A in the class on the Economics of
Uncertainty and Information,
• o2 is the outcome where you get a free trip to Disneyland (which would normally
cost $500) and a C in the class and
• o3 is the outcome where you get a $150 gift certificate at Amazon.com and a B in
the class.
If you are risk neutral, what sum of money would you consider to be just as good as the
lottery?
Exercise 2.3 Given the choice between getting $18 for sure or playing the lottery
$10 $20 $30
3 5 2
10 10 10
James – who likes money (that is, prefers more money to less) – chooses to get $18 for
sure. Is he risk neutral?
Exercise 2.7 Sam has a debilitating illness and has been offered two mutually exclusive
courses of action:
(1) take some well-known drugs which have been tested for a long time, or
(2) take a new experimental drug.
If he chooses (1) then for certain his pain will be reduced to a bearable level. If he
chooses (2) then he has a 50% chance of being completely cured and a 50% chance of
no benefits from the drug and possibly some harmful side effects. He chose (1). What
is his attitude to risk?
Exercise 2.8 Shirley owns a house worth $200, 000. The value of the building is
$75, 000 and the value of the land is $125, 000. In the area where she lives there is
a 10% probability that a fire will completely destroy the building (on the other hand,
the land would not be affected by a fire). An insurance company offers a policy that
covers the full replacement cost of the building in the event of fire (that is, there is no
deductible). The premium for this policy is $7, 500 per year. What attitude to risk must
Shirley have in order to purchase the insurance policy? [Hint: think in terms of wealth
levels.]
38 Chapter 2. Insurance: basic notions
Exercise 2.9 Bill’s entire wealth consists of the money in his bank account: $12, 000.
Bill’s friend Bob claims to have discovered a great investment opportunity, which
would require an investment of $10, 000. Bob does not have any money and asks Bill
to provide the $10, 000. According to Bob, the investment could yield a return of
$150, 000, in which case Bob will return the initial $10, 000 to Bill and then give him
50% of the remaining $140, 000. According to Bob the probability that the investment
will be successful is 12% and the probability that the initial investment of $10, 000
will be completely lost is 88%. Bill decides to go ahead with the investment and gives
$10, 000 to Bob. What is Bill’s attitude to risk?
2.7.2 Exercises for Section 2.3: Certainty equivalent and risk premium
Exercise 2.10 In Section 2.3 we defined three relations over money lotteries: the strict
preference relation (denoted by ), the indifference relation (denoted by ∼) and the ‘at
least as good’ relation (denoted by %). As a matter of fact, one can simply postulate
just one relation, the ‘at least as good’ relation %, and derive the other two from it as
follows:
• L1 L2 if and only if L1 % L2 and it is not the case that L2 % L1 ,
• L1 ∼ L2 if and only if L1 % L2 and also L2 % L1 .
Recall that a relation % over a set L of money lotteries is complete if, for every two
lotteries L1 , L2 ∈ L , either L1 % L2 or L2 % L1 (or both) and is transitive if, for every
three lotteries L1 , L2 , L3 ∈ L , if L1 % L2 and L2 % L3 then L1 % L3 .
Prove that if the ‘at least as good’ relation % is complete and transitive then the
derived ‘strict preference’ relation is also transitive, that is, if L1 L2 and L2 L3
then L1 L3 .
Exercise 2.11 As in Exercise 2.10 take the ‘at least as good’ relation % as primitive and
derive from it the indifference relation ∼. Prove that if the ‘at least as good’ relation %
is transitive then the derived indifference relation ∼ is also transitive, that is, if L1 ∼ L2
and L2 ∼ L3 then L1 ∼ L3 .
Exercise 2.12 As in Exercise 2.10 take the ‘at least as good’ relation % as primitive
and derive from it the ‘strict preference’ relation and the indifference relation ∼.
Prove that if the ‘at least as good’ relation % is transitive then if L1 L2 and L2 ∼ L3
then L1 L3 .
Exercise 2.13 As in Exercise 2.10 take the ‘at least as good’ relation % as primitive
and derive from it the ‘strict preference’ relation and the indifference relation ∼.
Prove that if the ‘at least as good’ relation % is transitive then if L1 ∼ L2 and L2 L3
then L1 L3 .
2.7 Exercises 39
good state
W2
NI
600
A
500 B
450
W1
100 250 425 bad state
Exercise 2.17 Consider again the information given in Exercise 2.15 (shown in Figure
2.12). Assume that the probability of loss is 20%.
(a) Calculate the expected profit from contract A.
(b) Calculate the expected profit from contract B.
(c) Draw the zero-profit line.
(d) Draw the isoprofit line that goes through contract A.
(e) Draw the isoprofit line that goes through contract B.
Exercise 2.18 The equation of the zero-profit line is W2 = 8, 100 − 19 W1 . The individ-
ual’s initial wealth is $7, 600.
(a) What is the probability of loss?
(b) Calculate the potential loss `.
(c) Find a full-insurance contract, call it A, that yields a profit of $40.
(d) Find a contract, call it B, that lies on the isoprofit line through A and has a
deductible of $1, 500.
(e) Find a contract, call it C, with deductible d = 2, 000 that yields a profit of $25.
(f) Write the equation of the isoprofit line that goes through contract A (in the wealth
diagram).
(g) Write the equation of the isoprofit line that goes through contract C (in the wealth
diagram).
2.7 Exercises 41
Exercise 2.19 Consider the wealth diagram shown in Figure 2.13. Let p = 0.2.
(a) Interpret each point (including NI) as an insurance contract and express it in
terms of premium and deductible.
(b) Calculate the expected profit from each contract.
(c) Find the equation of the isoprofit line that goes through each contract (including
the one that goes through point NI).
(good state)
W2
NI 45o line
1,600
1,556 A
1,480 B
W1
0 1,024 1,390 (bad state)
2.7.5 Exercises for Section 2.6: Profitable insurance requires risk aversion
Exercise 2.23 Prove that a risk-loving individual strictly prefers no insurance to a “fair”
full-insurance contract (that is, a full-insurance contract that yields zero profits to the
insurance company).
Exercise 2.24 Ann has an initial wealth of $24, 000 and faces a potential loss of
$15, 000 with probability 20%. The risk premium of the no-insurance lottery for Ann is
$2, 000. If Ann is offered full insurance for a premium of $4, 920 will she take it?
Exercise 2.25 Bob has an initial wealth of $18, 000 and faces a potential loss of
$10, 000 . The risk premium of the no-insurance lottery for Bob is $900. The maximum
premium that he is willing to pay for full insurance is $4, 000. What is the probability
of loss?
3 1 3 2 3 263
10 + 15 + 0 (18) + 20 + 25 + 0 (30) + 36 = = $21.92.
12 12 12 12 12 12
(b) A risk-neutral person is indifferent between the lottery and $21.92 for sure. Assum-
ing that she prefers more money to less, she will prefer $23 to $21.92. Thus, if her
preferences are transitive, she will prefer $23 to the lottery.
3
Solution to Exercise 2.2. One might be tempted to compute the “expected value” 10 100 +
5 2
10 500 + 10 150 = 310 and answer: $310. However, this answer would be wrong, because
the given lottery is not a money lottery: the outcomes are not just sums of money (they do
involve sums of money but also what grade you get in the class). The definition of risk
neutrality can only be applied to money lotteries.
2.8 Solutions to Exercises 43
3 5 2
Solution to Exercise 2.3. The expected value of the lottery is 10 10 + 10 20 + 10 30 = 19. If
James were risk-neutral he would consider the lottery to be just as good as getting $19 for
sure and would therefore choose the lottery (since getting $19 is better than getting $18).
Hence, he is not risk neutral.
24 12 48 6
Solution to Exercise 2.4 The expected value of the lottery 1 2 1 2 is
6 6 6 6
1 2 1 2
6 24 + 6 12 + 6 48 + 6 6 = 18.
Solution to Exercise 2.5 This was a trick question! There is no expected value because
the outcomes are not numbers.
3 1 3 2 3 263
is E[L] = 12 10 + 12 15 + 0 (18) + 12 20 + 12 25 + 0 (30) + 12 36 = 12 = $21.92.
(b) Since Ann prefers more money to less, she prefers $21.92 to $20 ($21.92 $20).
She said that she prefers $20 to lottery L ($20 L). Thus, since her preferences are
transitive, she prefers $21.92 to lottery L ($21.92 L). Hence, she is risk averse.
(c) The answer is: we cannot tell. First of all, since Bob prefers more money to less,
he prefers $21.92 to $20 ($21.92 $20). Bob could be risk neutral, because a risk
neutral person would be indifferent between L and $21.92 (L ∼ $21.92); since Bob
prefers $21.92 to $20 and has transitive preferences, if risk neutral he would prefer
L to $20. However, Bob could also be risk loving: a risk-loving person prefers L to
$21.92 (L $21.92) and we know that he prefers $21.92 to $20; thus, by transitivity,
he would prefer L to $20. But Bob could also be risk averse: he could consistently
prefer $21.92 to L and L to $20 (for example, he could consider L to be just as good
as $20.50).
Solution to Exercise 2.7 Just like Exercise 2.5, this was a trick question! Here the basic
outcomes are not sums of money but states of health. Since the described choice is
not one between money lotteries, the definitions of risk aversion/neutrality/love are not
applicable.
Solution to Exercise 2.8 The decision not to buy insurance is the decision to face the fol-
lowing lottery: with probability 0.9 Shirley’s wealth will be $200, 000, with probability 0.1
it will be $125, 000. The expected value of this lottery is: 0.9(200, 000) + 0.1(125, 000) =
$192, 500. The insurance policy guarantees a wealth of $200, 000 − 7, 500 = 192, 500.
Hence Shirley will buy the insurance policy if she is risk-averse, will be indifferent be-
tween buying and not buying if she is risk-neutral and will prefer not to buy if she is
risk-loving.
44 Chapter 2. Insurance: basic notions
Solution to Exercise 2.9 If Bill refuses to invest, his wealth is $12,000 for sure. If Bill
$2, 000 $82, 000
gives $10, 000 to Bob to invest then he faces the following lottery: L = .
0.88 0.12
88 12
The expected value of L is E[L] = 100 2, 000 + 100 82, 000 = $11, 600. If Bill were risk
averse he would prefer $11,600 for sure to the investment (lottery L) and obviously he
will prefer $12,000 to $11,600; thus he would prefer $12,000 for sure to the investment;
since he decides to go ahead with the investment he is not risk averse. If Bill were risk
neutral he would be indifferent between $11,600 for sure and the investment (lottery L)
and obviously he will prefer $12,000 to $11,600; thus he would prefer $12,000 for sure
to the investment; since he decides to go ahead with the investment he is not risk neutral
either. Hence Bill is risk-loving.
wealth in
good state
W2
NI 45o line
38,000
B
36,000 C
35,000
0
13,000 27,000 35,000 W1
wealth in
bad state
(good state)
W2
45o line
NI
C
B
0 W1 (bad state)
(c) See Figure 2.16. All three profit lines have a slope of − 0.2 1
0.8 = − 4 . The equation of
the zero-profit line is W2 = 625 − 14 W1 .
(good state)
W2
NI
600
A
500 π =0
450 π = 50
B π = 55
W1
0 100 250 425 (bad state)
(b) The no-insurance point is on the zero-profit line. We know that the vertical coordinate
of the no-insurance point is W0 = 7, 600. Thus, to find the horizontal coordinate,
which is equal to W0 − `, we must solve the equation 7, 600 = 8, 100 − 19 W1 which
gives W1 = 4, 500. Thus ` = 7, 600 − 4, 500 = 3, 100.
1
(c) A full-insurance contract with premium h yields a profit of h − p` = h − 10 3, 100 =
h − 310. Thus we want h to solve the equation h − 310 = 40. Hence contract A has
a premium of 350 (and, of course, zero deductible).
(d) A contract with premium h and deductible 1, 500 yields a profit of h − p`+1, 500p =
h − 310 + 150 = h − 160. Hence, since B is on the same isoprofit line as A, it must
be that h − 160 = 40, that is, h = 200. Thus contract B has a premium of 200 and a
deductible of 1, 500.
(e) Similar reasoning as in Part (d): we need h − p` + 2, 000p to be equal to 25. Hence
we must solve h − 310 + 200 = 25, which gives h = 135. Thus contract C has a
premium of 135 and a deductible of 2, 000.
(f) The equation of an isoprofit line is of the form W2 = a − 19 W1 . To find the value of
a for the isoprofit line that goes through contract A we must solve 7, 600 − 350 =
72,500
a − 19 (7, 600 − 350) to get a = 9 = 8, 055.56. Thus the equation of the isoprofit
line that goes through contract A is W2 = 8, 055.56 − 91 W1 .
(b) The expected profit from contract (h, d) is π = h − p(` − d). Thus π(NI) = 0,
π(A) = −38, π(B) = 22.8 and π(C) = 94.8.
p
(c) All the isoprofit lines are straight lines and all have the same slope given by − 1−p =
1
− 5
4 = − 41 . Thus starting at a point (W1 ,W2 ), if you reduce W1 to 0 then the vertical
5
coordinate changes to W2 + 14 W1 , yielding the vertical intercept. Applying this to
point NI we get that by reducing the horizontal coordinate by 1,024, the vertical
1,024
coordinate increases by 4 = 256 to 1, 600 + 256 = 1, 856. Hence the equation
of the isoprofit line that goes through point NI (which is the zero-profit line) is
W2 = 1, 856 − 14 W1 . Applying the same procedure we get that
(good state,
probability 1 − p)
W2
zero-profit line
NI 45o line
W0
C
A
B
set of contracts
with π ≥ 0
0 W1
W0 − ` (bad state,
probability p)
(a) The set of contracts that yield non-negative profits is shown as a shaded triangle in
Figure 2.17.
(b) Contract C shown in Figure 2.17 is the contract that is most preferred by a risk-averse
individual among the contracts in the shaded triangle. C is a full-insurance contract
with premium h = p` guaranteeing a wealth of W0 − p`. Any other contract on the
zero-profit line is worse than contract C because it gives rise to a non-degenerate
lottery with expected value W0 − p`. Any other contract on the thick part of the 45o
line is worse than contract C because it has a higher premium (while still being a
full-insurance contract). Finally, any contract, say A, inside the shaded area lies on
an isoprofit line that goes through a contract which is on the thick part of the 45o
line below point C, call this point B (see Figure 2.17); then B is better than A (since
it guarantees the expected value of A) but is worse than C, so that C is better than A.
(c) The contracts on the thick line from NI to C: the individual is indifferent among all
these contracts, because they correspond to lotteries that have the same expected
value as the NI lottery. On the other hand, contracts below this line have an expected
value which is less than the expected value of the NI lottery and thus are worse than
NI.
2.8 Solutions to Exercises 51
Solution to Exercise 2.21 Let contract B be vertically below contract A as shown in Figure
2.18
(probability 1 − p)
W2
isoprofit line
45o line
A
W0 − hA
∆ B
W0 − (hA + ∆)
| {z }
hB isoprofit line π(B) − π(A)
= ∆ − p∆
= (1 − p)∆ > 0
0 W1 (probability p)
W0 − hA − dA
= W0 − (hA + ∆)
| {z }
hB −dB
hence dB − dA = −∆
Then W1B = W1A and W2B < W2A . From the latter we deduce that the premium of contract
B is higher than the premium of contract A. Let ∆ = W2A − W2B > 0 be the amount by
which B’s premium exceeds A’s premium. From the fact that W1B = W1A we calculate the
difference between the deductible of contract B and the deductible of contract A as follows:
=W2A −∆ 1 =W A
z}|{ z}|{
W B − W1B = W2A −W1A − ∆,
| 2 {z } | {z }
dB dA
so that
dB − dA = −∆.
Thus, when contract B is vertically below contract A, then hB is greater than hA and, letting
∆ be the amount by which hB exceeds hA (∆ = hB − hA ), dB − dA = −∆, that is, the
deductible of contract B is less than the deductible of contract A by an amount equal to
∆. From this we deduce that contract B yields higher profits that contract A, because it
yields an extra $∆ for sure (the premium is received with probability 1) and involves an
extra payment of $∆ to the insured only with probability p (the payment is made with
probability p < 1), that is,13
π(B) − π(A) = ∆ − p∆ = (1 − p)∆ > 0.
13 This conclusion can be verified directly, as follows: π(B) = W0 − W2B − p` + p(W2B − W1B ) and
52 Chapter 2. Insurance: basic notions
Solution
to Exercise 2.22
The no-insurance option corresponds to the wealth lottery
W0 W0 − `
NI = whose expected value is
1− p p
Suppose that the individual is risk neutral and is offered a full-insurance contract (h, 0)
which yields zero profits, that is,
W0 − h
For the potential customer, such a contract corresponds to the wealth lottery C =
1
whose expected value is
E[C] = W0 − h =
|{z} E[NI]. (2.12)
using (2.10) and (2.11)
By risk neutrality, the individual is indifferent between NI and E[NI] for sure (NI ∼ E[NI])
and is also indifferent between C and E[NI] for sure (since – by (2.12) – C and E[NI] for
sure are the same contract):
NI ∼ E[NI] ∼ C.
NI ∼ C.
π(A) = W0 −W2A − p` + p(W2A −W1A ) so that (recall that W1B = W1A and thus pW1B = pW1A )
Solution to Exercise 2.23 The proof is a simple modification of theproof for Exercise
W0 W0 − `
2.22: the no-insurance option corresponds to the wealth lottery NI =
1− p p
whose expected value is
Suppose that the individual is risk loving and is offered a full-insurance contract (h, 0)
that yields zero profits, that is,
h = p`. (2.14)
W0 − h
For the potential customer, such a contract corresponds to the wealth lottery C =
1
E[C]
which is the same as = E[C] = W0 − h |{z}= E[NI]. Since the individual is risk
1
by (2.14)
loving, he prefers NI to E[NI] for sure (NI E[NI]) and is indifferent between C and
E[NI] for sure, since it is the same lottery (C ∼ E[NI]); thus NI E[NI] ∼ C. Assuming
that the individual’s preferences are transitive, it follows that NI C.
Solution to Exercise 2.24 The maximum premium that Ann is willing to pay for full
insurance is hmax = p` + RNI = 15 15, 000 + 2, 000 = 5, 000. Since she is offered full
insurance at a lower premium, namely 4, 920, she will take it (she is better off with full
insurance at this premium than with no insurance).
Solution to Exercise 2.25 The maximum premium that Bob is willing to pay for full insur-
ance is hmax = p` + RNI = p10, 000 + 900. We are told that this is equal to 4, 000. Thus, to
find p we solve the equation p10, 000 + 900 = 4, 000 whose solution is
3,100
p= 10,000 = 0.31.
3. Expected Utility Theory
We will assume throughout this chapter that O is a finite set: O!= {o1 , o2 , ..., om } (m ≥ 1).
o1 o2 ... om
Thus, an element of L (O) is of the form with 0 ≤ pi ≤ 1, for all
p1 p2 ... pm
i = 1, 2, ..., m, and p1 + p2 + ... + pm = 1. We will use the symbol L (with or without
subscript) to denote an element of L (O), that is, a simple lottery. Lotteries are used to
represent situations of uncertainty.! For example, if m = 4 and the individual faces the
o1 o2 o3 o4
lottery L = 2 then she knows that, eventually, the outcome will be
5 0 51 25
one and only one of o1 , o2 , o3 , o4 , but does not know which one; furthermore, she is able
to quantify her uncertainty by assigning probabilities to these outcomes. We interpret
these probabilities either as objectively obtained from relevant (past) data or as subjective
estimates by the individual, as explained in Chapter 2 (Section 2.1).
The assignment of zero probability to a particular basic outcome is taken to be an
expression of belief, not impossibility: the individual is confident that the outcome will not
arise, but she cannot rule out that outcome on logical grounds or by appealing to the laws
of nature.
Among the elements of L (O) there are the degenerate lotteries that assign probability!1
o1 o2 o3 o4
to one basic outcome: for example, if m = 4 one degenerate lottery is .
0 0 1 0
To simplify the notation we will often denote
! degenerate lotteries as basic outcomes, that
o1 o2 o3 o4
is, instead of writing we will simply write o3 . Thus, in general,
0 0 1 0
!
o1 ... oi−1 oi oi+1 ... om
the degenerate lottery will be denoted by oi . As
0 0 0 1 0 0 0
another simplification, we will often omit those outcomes
!that are assigned zero probability.
o1 o2 o3 o4
For example, if m = 4, the lottery 1 will be written more simply as
3 0 23 0
!
o1 o3
1 2 .
3 3
Throughout this chapter we shall call the individual under consideration the Decision-
Maker, or DM for short. The theory of expected utility assumes that the DM has a complete
and transitive ranking % of the elements of L (O) (indeed, this is one of the axioms listed
in the next section). As in Chapter 2, the interpretation of L % L0 is that the DM considers
L to be at least as good as L0 . By completeness, given any two lotteries L and L0 , either
L L0 (the DM prefers L to L0 ) or L0 L (the DM prefers L0 to L) or L ∼ L0 (the DM is
indifferent between L and L0 ). Furthermore, by transitivity, for any three lotteries L1 , L2
and L3 , if L1 % L2 and L2 % L3 , then L1 % L3 . Besides completeness and transitivity, a
number of other “rationality” constraints are postulated on the ranking % of the elements
of L (O); these constraints are the so-called Expected Utility Axioms and are discussed in
3.1 Expected utility: theorems 57
Definition 3.1.1 A ranking % of the elements of L (O) that satisfies the Expected Utility
Axioms (listed in the next section) is called a von Neumann-Morgenstern ranking.
The two theorems in this section are the key results in the theory of expected utility.
Theorem 3.1.1 [von Neumann-Morgenstern, 1944].
Let O = {o1 , o2 , ..., om } be a set of basic outcomes and L (O) the set of simple lotteries
over O. If % is a von Neumann-Morgenstern ranking of the elements of L (O) then
there exists a function U : O → R, called a von Neumann-Morgenstern utility function,
that assigns a number (called utility) to every basic outcome and is such
that, for any
!
o1 o2 ... om o1 o2 ... om
two lotteries L = and L0 = ,
p1 p2 ... pm q1 q2 ... qm
where
! !
U(o1 ) U(o2 ) ... U(om ) U(o1 ) U(o2 ) ... U(om )
U(L) = , U(L0 ) = ,
p1 p2 ... pm q1 q2 ... qm
E[U(L)] is the expected value of the lottery U(L) and E[U(L0 )] is the expected value of
the lottery U(L0 ), that is,
E[U(L)] is called the expected utility of lottery L (and E[U(L0 )] the expected utility of
lottery L0 ).
We say that any function U : O → R that satisfies the property that, for any two lotteries
L and L0 , L % L0 if and only if E[U(L)] ≥ E[U(L0 )] represents the preferences (or
ranking) %.
Before we comment on Theorem 3.1.1 we give an example of how one can use it.
Theorem 3.1.1 sometimes allows us to predict an individual’s choice between two lotteries
C and D if we know how that individual ranks two different lotteries A and B.
58 Chapter 3. Expected Utility Theory
For example, suppose we observe that Susan is faced with the choice between lotteries A
and B below and she says that she prefers A to B:
! !
o1 o2 o3 o1 o2 o3
A= B=
0 0.25 0.75 0.2 0 0.8
With this information we can predict which of the following two lotteries C and D she will
choose, if she has von Neumann-Morgenstern preferences:
! !
o1 o2 o3 o1 o2 o3
C= D= = o2 .
0.8 0 0.2 0 1 0
Let U be a von Neumann-Morgenstern utility function whose existence is guaranteed
by Theorem 3.1.1. Let U(o1 ) = a, U(o2 ) = b and U(o3 ) = c (where a, b and c are
numbers). Then, since Susan prefers A to B, the expected utility of A must be greater than
the expected utility of B: 0.25b + 0.75c > 0.2a + 0.8c. This inequality is equivalent to
0.25b > 0.2a + 0.05c or, dividing both sides by 0.25, b > 0.8a + 0.2c. It follows from this
and Theorem 3.1.1 that Susan prefers D to C, because the expected utility of D is b and the
expected utility of C is 0.8a + 0.2c. Note that, in this example, we merely used the fact
that a von Neumann-Morgenstern utility function exists, even though we do not know what
the values of this function are.
Theorem 3.1.1 is an example of a “representation theorem” and is a generalization
of a similar result for the case of the ranking of a finite set of basic outcomes O. It is
not difficult to prove that if % is a complete and transitive ranking of O then there exists
a function U : O → R, called a utility function, such that, for any two basic outcomes
o, o0 ∈ O, U(o) ≥ U(o0 ) if and only if o % o0 . Now, it is quite possible that an individual
has a complete and transitive ranking of O, is fully aware of her ranking and yet she is not
able to answer the question “what is your utility function?”, perhaps because she has never
heard about utility functions. A utility function is a tool that we can use to represent her
ranking, nothing more than that. The same applies to von Neumann-Morgenstern rankings:
Theorem 3.1.1 tells us that if an individual has a von Neumann-Morgenstern ranking of
the set of lotteries L (O) then there exists a von Neumann-Morgenstern utility function
that we can use to represent her preferences, but it would not make sense for us to ask the
individual “what is your von Neumann-Morgenstern utility function?” (indeed this was a
question that could not even be conceived before von Neumann and Morgenstern stated
and proved Theorem 3.1.1 in 1944!).
Theorem 3.1.1 tells us that a von Neumann-Morgenstern utility function exists; the
next theorem can be used to actually construct such a function, by asking the individual to
answer a few questions, formulated in a way that is fully comprehensible to her (that is,
without using the word ‘utility’). The theorem says that, although there are many utility
functions that represent a given von Neumann-Morgenstern ranking, once you know one
3.1 Expected utility: theorems 59
function you “know them all”, in the sense that there is a simple operation that transforms
one function into the other.
Proof. The proof of Part A of Theorem 3.1.2 is very simple. Let a and b be two numbers,
with a > 0. The hypothesis is that U : O → R is a von Neumann-Morgenstern ! utility
o1 ... om
function that represents %, that is, that, for any two lotteries L = and
p1 ... pm
!
o 1 ... om
L0 = ,
q1 ... qm
Multiplying both sides of inequality (3.1) by a > 0 and adding (p1 + · · · + pm ) b to the
left-hand side and (q1 + · · · + qm ) b to the right-hand side1 we obtain
L % L0 if and only if p1V (o1 ) + ... + pmV (om ) ≥ q1V (o1 ) + ... + qmV (om ),
that is, the function V is a von Neumann-Morgenstern utility function that represents the
ranking %. The proof of Part B will be given later, after introducing more notation and
some observations.
Suppose that the DM has a von Neumann-Morgenstern ranking of the set of lotteries
L (O). Since among the lotteries there are the degenerate ones that assign probability 1 to
a single basic outcome, it follows that the DM has a complete and transitive ranking of
the basic outcomes. We shall write obest for a best basic outcome, that is, a basic outcome
which is at least as good as any other basic outcome (obest % o, for every o ∈ O) and oworst
for a worst basic outcome, that is, a basic outcome such that every other outcome is at least
as good as it (o % oworst , for every o ∈ O). Note that there may be several best outcomes
(then the DM would be indifferent among them) and several worst outcomes; then obest
will denote an arbitrary best outcome and oworst an arbitrary worst outcome. We shall
assume throughout that the DM is not indifferent among all the outcomes, that is, we shall
assume that obest oworst .
We now show that, in virtue of Theorem 3.1.2, among the von Neumann-Morgenstern
utility functions that represent a given von Neumann-Morgenstern ranking % of L (O),
there is one that assigns the value 1 to the best basic outcome(s) and the value 0 to the worst
basic outcome(s). To see this, consider an arbitrary von Neumann-Morgenstern utility
function F : O → R that represents % and define G : O → R as follows: for every o ∈ O,
G(o) = F(o) − F(oworst ). Then, by Theorem 3.1.2 (with a = 1 and b = −F(oworst )), G
is also a utility function that represents % and, by construction, G(oworst ) = F(oworst ) −
F(oworst ) = 0; note also that, since obest oworst , it follows that G(obest ) > 0. Finally,
G(o)
define U : O → R as follows: for every o ∈ O, U(o) = G(o ) . Then, by Theorem
best
3.1.2 (with a = G(o1 ) and b = 0), U is a utility function that represents % and, by
best
construction, U(oworst ) = 0 and U(obest ) = 1. For example, if there are six basic outcomes
and the ranking of the basic outcomes is o3 ∼ o6 o1 o4 o2 ∼ o5 , then one can
take as obest either o3 or o6 and as oworst either o2 or o5 ; furthermore, if F is given by
o1 o2 o3 o4 o5 o6 o1 o2 o3 o4 o5 o6
then G is the function and U is
2 −2 8 0 −2 8 4 0 10 2 0 10
o1 o2 o3 o4 o5 o6
the function .
0.4 0 1 0.2 0 1
The transformations described above show how to normalize any given utility function.
Armed with the notion of a normalized utility function we can now complete the proof of
Theorem 3.1.2.
3.1 Expected utility: theorems 61
V (o) = a2 G(o) + b2 . We can invert the latter transformation and obtain that, for every
o ∈ O, G(o) = Va(o)
2
− ba22 . Thus, we can transform F into U, which – as proved above – is
the same as V , and then transform V into G thus obtaining the following transformation of
F into G:
a1 b1 − b2
G(o) = aF(o) + b where a = > 0 and b = .
a2 a2
R Theorem 3.1.2 is often stated as follows: a utility function that represents a von
Neumann-Morgenstern ranking % of L (O) is unique up to a positive affine transfor-
mation. An affine transformation is a function f : R → R of the form f (x) = ax + b
with a, b ∈ R. The affine transformation is positive if a > 0.
Because of Theorem 3.1.2, a von Neumann-Morgenstern utility function is usually
referred to as a cardinal utility function.
Theorem 3.1.1 guarantees the existence of a utility function that represents a given
von Neumann-Morgenstern ranking % of L (O) and Theorem 3.1.2 characterizes the set
of such functions. Can one actually construct a utility function that represents a given
ranking? The answer is affirmative: if there are m basic outcomes one can construct an
individual’s von Neumann-Morgenstern utility function by asking her at most (m − 1)
questions. The first question is “what is your ranking of the basic outcomes?”. Then
we can construct the normalized utility function by first assigning the value 1 to the
best outcome(s) and the value 0 to the worst outcome(s). This leaves us with at most
(m − 2) values to determine. For this we appeal to one of the axioms discussed in the next
section, namely the Continuity Axiom, which says that, for every basic outcome oi there
62 Chapter 3. Expected Utility Theory
is a probability pi ∈ [0, 1] such that the DM is indifferent between oi for certain and the
lottery that gives a best outcome !with probability pi and a worst outcome with probability
obest oworst
(1 − pi ): oi ∼ . Thus, for each basic outcome oi for which a utility has
pi 1 − pi
not been determined yet,! we should ask the individual to tell us the value of pi such that
obest oworst
oi ∼ ; then we can set Ui (oi ) = pi , because the expected utility of the
pi 1 − pi
!
obest oworst
lottery is piUi (obest ) + (1 − pi )Ui (oworst ) = pi (1) + (1 − pi ) 0 = pi .
pi 1 − pi
Example 3.1 Suppose that there are five basic outcomes, that is, O = {o1 , o2 , o3 , o4 , o5 }
and the DM, who has von Neumann-Morgenstern preferences, tells us that her ranking
of the basic outcomes is as follows: o2 o1 ∼ o5 o3 ∼ o4 . Then we can begin by
assigning utility 1 to the best outcome!o2 and utility 0 to the worst outcomes o3 and
outcome: o1 o2 o3 o4 o5
o4 : . There is only one value left to be determined,
utility: ? 1 0 0 ?
namely the utility of o1 (which is also the utility of o5 , since o1 ∼ o5 ). To find this
value, we ask the DM
! to tell us what value of p makes her indifferent between the lottery
o2 o3
L= and outcome o1 with certainty. Suppose that her answer is: 0.4. Then
p 1− p
her normalized von Neumann-Morgenstern utility function is
!
outcome: o1 o2 o3 o4 o5
.
utility: 0.4 1 0 0 0.4
Knowing this, we can predict her choice among any set of lotteries over these five basic
outcomes.
We can now turn to the list of rationality axioms proposed by von Neumann and Morgen-
stern. This section makes heavy use of mathematical notation and, as mentioned in the
previous section, if the reader is not interested in understanding in what sense the theory
of expected utility captures the notion of rationality, he/she can skip it without affecting
his/her ability to understand the rest of this book.
Let O = {o1 , o2 , . . . , om } be the set of basic outcomes and L (O) the set of simple
lotteries, that is, the set of probability distributions over O. Let % be a binary relation on
L (O). We say that % is a von Neumann-Morgenstern ranking of L (O) if it satisfies the
following four axioms or properties.
Axiom 1 [Completeness and transitivity]. % is complete (for every two lotteries L and
L0 either L % L0 or L0 % L or both) and transitive (for any three lotteries L1 , L2 and L3 , if
L1 % L2 and L2 % L3 then L1 % L3 ).
As noted in the previous section, Axiom 1 implies that there is a complete and transitive
ranking of the basic outcomes. Recall that obest denotes a best basic outcome and oworst
denotes a worst basic outcome and that we are assuming that obest oworst , that is, that the
DM is not indifferent among all the basic outcomes.
! !
obest oworst obest oworst
Axiom 2 [Monotonicity]. % if and only if p ≥ q.
p 1− p q 1−q
!
x1 x2 ... xr
Definition 3.2.1 A compound lottery is a lottery of the form
p1 p2 ... pr
where each xi is either an element of O or an element of L (O).
!
o1 o2 o3 o4
For example, let m = 4. Then L = 2 is a simple lottery (an element of
5 0 15 25
64 Chapter 3. Expected Utility Theory
L (O)), while
! !
o1 o2 o3 o4 o1 o2 o3 o4
1 1 1 1 o1 1
0 15 53
C= 3 6 3 6 5
1 1 1
2 4 4
1 1
2 1 4
4
1 1 1 1 1 1 3
3 6 3 6 5 5 5
o1 o2 o3 o4 o1 o1 o3 o4
Next we define the simple lottery L(C) corresponding to a compound lottery C. Before
introducing the formal definition, we shall explain in an example how to construct such
a simple lottery. Continuing with the example of the compound lottery C given above
and illustrated in Figure 3.1, first we replace a sequence of edges with a single edge and
associate with it the product of the probabilities along the sequence of edges, as shown in
Figure 3.2.
1 1 1 1 1 1 1 3
6 12 6 12 4 20 20 20
o1 o2 o3 o4 o1 o1 o3 o4
Figure 3.2: Simplification of Figure 3.1 obtained by merging paths into simple edges and
associating with the simple edges the products of the probabilities along the path.
o1 o2 o3 o4 o1 o2 o3 o4
2 With r = 3, x1 = 1 1 1 1 , x2 = o1 , x3 = 1 1 3 , p1 = 21 , p2 = 1
4 and
3 6 3 6 5 0 5 5
p3 = 41 .
3.2 Expected utility: the axioms 65
Then we add up the probabilities of each outcome,as shown in Figure3.3. Thus, the
o1 o2 o3 o4
simple lottery L(C) that corresponds to C is L(C) = 28 5 13 14 , namely the
60 60 60 60
lottery shown in Figure 3.3.
28 5 13 14
60 60 60 60
o1 o2 o3 o4
Figure 3.3: Simplification of Figure 3.2 obtained by adding, for each outcome, the proba-
bilities of that outcome.
x1 x2 ... xr
Definition 3.2.2 Given a compound lottery C = the correspond-
p1 p2 ... pr
o1 o2 ... om
ing simple lottery L(C) = is defined as follows. First of all, for
q1 q2 ... qm
i = 1, . . . , m and j = 1, . . . , r, define
1 if x j = oi
0 if x j = ok with k 6= i
oi (x j ) =
o1 ... oi−1 oi oi+1 ... om
si if x j =
s1 ... si−1 si si+1 ... sm
r
Then qi = ∑ p j oi (x j ).
j=1
so that
o1 (x1 ) = 13 , o1 (x2 ) = 1, and o1 (x3 ) = 1
5
66 Chapter 3. Expected Utility Theory
Proof of Theorem 3.1.1. To simplify the notation, throughout this proof we will assume
that we have renumbered the basic outcomes in such a way that obest = o1 and oworst = o!
m.
o1 om
First of all, for every basic outcome oi , let ui ∈ [0, 1] be such that oi ∼ .
ui 1 − ui
The existence of such a value ui is guaranteed by the Continuity Axiom (Axiom 3); clearly
u1 = 1 and um = 0. Now consider an arbitrary lottery
!
o1 ... om
L1 = .
p1 ... pm
This is done through a repeated application of the Independence Axiom (Axiom 4), as
follows. Consider the compound lottery
!
o1 om
o1 o3 ... om
C2 = u2 1 − u2
p1 p2 p3 ... pm
!
o1 om
obtained by replacing o2 in lottery L1 with the lottery that the DM
u2 1 − u2
3.2 Expected utility: the axioms 67
Note that o2 is assigned probability 0 in L2 and thus we have omitted it. By Axiom 4,
L1 ∼ L2 . Now apply the same argument to L2 : let
!
o1 om
o1 ... om−1 om
C3 = u3 1 − u3
p1 + p2 u2 p3 ... pm−1 pm + p2 (1 − u2 )
!
o1 ... om
L3 = .
p1 + p2 u2 + p3 u3 ... pm + p2 (1 − u2 ) + p3 (1 − u3 )
and
m m−1 m m−1
p2 (1 − u2 ) + ... + pm−1 (1 − um−1 ) + pm = ∑ pi − ∑ pi ui = p1 + ∑ pi − ∑ pi ui − p1
i=2 i=2 i=2 i=2
m m−1 m
= (since u1 =1 and um =0) ∑ pi − ∑ piui − p1u1 − pmum =
m
since ∑ pi =1
1 − ∑ pi ui .
i=1 i=2 i=1
i=1
68 Chapter 3. Expected Utility Theory
o1 om
Thus, Lm−1 =
, which proves (3.3). Now define the following
m m
∑ pi ui 1 − ∑ pi ui
i=1 i=1
utility function U : {o1 , ..., om } → [0, 1]: U(oi ) = ui , where, as before, for every basic
o1 om
outcome oi , ui ∈ [0, 1] is such that oi ∼
. Consider two arbitrary lotteries
ui 1 − ui
o1 ... om o1 ... om
and L0 = . We want to show that L % L0 if and only
L=
p1 ... pm q1 ... qm
m m
if E [U(L)] ≥ E [U(L0 )], that is, if and only if ∑ pi ui ≥ ∑ qi ui . By (3.3), L ∼ M, where
i=1 i=1
o1 om o1 om
and also L0 ∼ M 0 , where M 0 =
M=
.
m m m m
∑ pi ui 1 − ∑ pi ui ∑ qi ui 1 − ∑ qi ui
i=1 i=1 i=1 i=1
Thus, by transitivity of %, L % L0 if and only if M % M0; by the Monotonicity Axiom
m m
(Axiom 2), M % M 0 if and only if ∑ pi ui ≥ ∑ qi ui .
i=1 i=1
The following example, known as the Allais paradox, suggests that one should view
expected utility theory as a “prescriptive” or “normative” theory (that is, as a theory
about how rational people should choose) rather than as a descriptive theory (that is,
as a theory about the actual behavior of individuals). In 1953 the French economist
Maurice Allais published a paper regarding a survey he had conducted in 1952 concerning
a hypothetical decision problem. Subjects “with good training in and knowledge of the
theory of probability, so that they could be considered to behave rationally” were asked to
rank the following pairs of lotteries:
! !
$5 Million $0 $1 Million $0
A= 89 11 versus B = 90 10
100 100 100 100
and
! !
$5 Million $1 Million $0 $1 Million
C= 89 10 1 versus D = .
100 100 100 1
3.2 Expected utility: the axioms 69
Most subjects reported the following ranking: A B and D C. Such ranking violates
the axioms of expected utility. To see this, let O = {o1 , o2 , o3 } with o1 = $5 Million,
o2 = $1 Million and o3 = $0. Let us assume that the individual in question prefers more
money to less, so that o1 o2 o3 and has a von Neumann-Morgenstern ranking ! of
$5 Million $0
the lotteries over L (O) . Let u2 ∈ (0, 1) be such that D ∼ (the
u2 1 − u2
existence of such u2 is guaranteed by the Continuity Axiom). Then, since D C, by
transitivity
!
$5 Million $0
C. (3.4)
u2 1 − u2
89 10
Hence, by the Monotonicity Axiom, u2 > 100 + 100 u2 , that is,
89
u2 > 90 . (3.5)
Let B0 be the simple lottery corresponding to the following compound lottery, constructed
!
$5 Million $0
from B by replacing the basic outcome ‘$1 Million’ with :
u2 1 − u2
!
$5 Million $0
$0
u2 1 − u2 .
90 10
100 100
Then
!
$5 Million $0
B0 = 90 90 .
100 u2 1 − 100 u2
Another well-known paradox is the Ellsberg paradox. Suppose that you are told that an
urn contains 30 red balls and 60 more balls that are either blue or yellow. You don’t know
how many blue or how many yellow balls there are, but the number of blue balls plus the
number of yellow ball equals 60 (they could be all blue or all yellow or any combination
of the two). The balls are well mixed so that each individual ball is as likely to be drawn as
any other. You are given a choice between bets A and B, where
A = you get $100 if you pick a red ball and nothing otherwise,
B = you get $100 if you pick a blue ball and nothing otherwise.
Many subjects in experiments state a strict preference for A over B: A B. Consider now
the following bets:
C = you get $100 if you pick a red or yellow ball and nothing otherwise,
D = you get $100 if you pick a blue or yellow ball and nothing otherwise.
Do the axioms of expected utility constrain your ranking of C and D? Many subjects in
experiments state the following ranking: A B and D % C. All such people violate the
axioms of expected utility. The fraction of red balls in the urn is 30 1
90 = 3 . Let p2 be the
fraction of blue balls and p3 the fraction of yellow balls (either of these can be zero: all
we know is that p2 + p3 = 60 2
90 = 3 ). Then A, B,C and D can be viewed as the following
lotteries:
! !
$100 $0 $100 $0
A= 1 , B=
3 p 2 + p3 p2 13 + p3
! !
$100 $0 $100 $0
C= 1 , D= 2 1
3 + p3 p2 p2 + p3 = 3 3
Let U be the normalized von Neumann-Morgenstern utility function that represents the
individual’s ranking; then U($100) = 1 and U(0) = 0. Thus,
E [U(A)] = 13 , E [U(B)] = p2 , E [U(C)] = 31 + p3 , and E [U(D)] = p2 + p3 = 23 .
Hence, A B if and only if 13 > p2 , which implies that p3 > 31 , so that E [U(C)] = 13 + p3 >
E [U(D)] = 32 and thus C D (similarly, B A if and only if 13 < p2 , which implies that
E [U(C)] < E [U(D)] and thus D C).
3.3 Exercises
The solutions to the following exercises are given in Section 3.4 at the end of this chapter.
Exercise 3.1 Ben is offered a choice between the following two money lotteries:
! !
$4, 000 $0 $3, 000
A= and B = .
0.8 0.2 1
He says he strictly prefers B to A. Which of the following two lotteries, C and D, will
Ben choose if he satisfies the axioms of expected utility and prefers more money to
less?
! !
$4, 000 $0 $3, 000 $0
C= , D= .
0.2 0.8 0.25 0.75
Exercise 3.2 There are three basic outcomes, o1 , o2 and o3 . Ann satisfies the axioms of
expected utility theory and her preferences over lotteries involving these three outcomes
can be represented by the following von Neumann-Morgenstern utility function:
Jennifer says that she is indifferent between lottery L1 and getting $2,000 for certain.
She is also indifferent between lottery L2 and getting $1,000 for certain. Finally, she
says that between L3 and L4 she would chose L3 . Is she rational according to the theory
of expected utility? [Assume that she prefers more money to less.]
72 Chapter 3. Expected Utility Theory
Exercise 3.6 There are five basic outcomes. Jane has a von Neumann-Morgenstern
ranking of the set of lotteries over basic outcomes
that can be represented
by either of
o1 o2 o3 o4 o5
the following utility functions U and V : U : 44 170 −10 26 98 .
V : 32 95 5 23 59
(a) Show how to normalize each of U and V and verify that you get the same
normalized utility function.
(b) Show how to transform U into V with a positive affine transformation of the form
x 7→ ax + b with a, b ∈ R and a > 0.
3.3 Exercises 73
! !
$28 $10 $50
Exercise 3.7 Consider the following lotteries: L3 = , L4 = 1 1 .
1 2 2
(a) Ann has the following von Neumann-Morgenstern utility function: UAnn ($m) =
√
m. How does she rank the two lotteries?
(b) Bob has the following von Neumann-Morgenstern utility function: UBob ($m) =
m4
2m − 100 3 . How does he rank the two lotteries?
(c) Verify that both Ann and Bob are risk averse, by determining what they would
choose between lottery L4 and its expected value for certain.
Exercise 3.8 Let O = {o1 , o2 , o3 , o4 }. Find the simple lottery corresponding to the
following compound lottery
! ! !
o1 o2 o3 o4 o1 o3 o4 o2 o3
2 1 3 1 o2 1 1 3 1 2
5 10 10 5 5 5 5 3 3
1 1 1 1
8 4 8 2
Exercise 3.9 Let O = {o1 , o2 , o3 , o4 }. Suppose that the DM has a von Neumann-
Morgenstern ranking of L (O) and states the following indifference:
! !
o2 o4 o3 o4
o1 ∼ 1 3 and o2 ∼ 3 2 .
4 4 5 5
Do not add any information to what is given above (in particular, do not make any
assumptions about which outcome is best and which is worst).
74 Chapter 3. Expected Utility Theory
Exercise 3.10 — More difficult. Would you be willing to pay more in order to reduce
the probability of dying within the next hour from one sixth to zero or from four sixths
to three sixths? Unfortunately, this is not a hypothetical question: you accidentally
entered the office of a mad scientist and have been overpowered and tied to a chair. The
mad scientist has put six glasses in front of you, numbered 1 to 6, and tells you that
one of them contains a deadly poison and the other five contain a harmless liquid. He
says that he is going to roll a die and make you drink from the glass whose number
matches the number that shows up from the rolling of the die. You beg to be exempted
and he asks you “what is the largest amount of money that you would be willing to
pay to replace the glass containing the poison with one containing a harmless liquid?”.
Interpret this question as “what sum of money x makes you indifferent between
(1) leaving the poison in whichever glass contains it and rolling the die, and
(2) reducing your wealth by $x and rolling the die after the poison has been replaced by
a harmless liquid”. Your answer is: $X.
Then he asks you “suppose that instead of one glass with poison there had been four
glasses with poison (and two with a harmless liquid); what is the largest amount of
money that you would be willing to pay to replace one glass with poison with a glass
containing a harmless liquid (and thus roll the die with 3 glasses with poison and 3 with
a harmless liquid)?”. Your answer is: $Y .
Show that if X > Y then you do not satisfy the axioms of Expected Utility Theory.
[Hint: think about what the basic outcomes are; assume that you do not care about
how much money is left in your estate if you die and that, when alive, you prefer more
money to less.]
Solution to Exercise 3.3 We can take the set of basic outcomes to be {$3000, $2000, $1000,
$500}. Suppose that there is a von Neumann-Morgenstern utility function U that represents
Jennifer’s preferences. We can normalize it so that U($3000) = 1 and U($500) = 0. Since
Jennifer is indifferent between L1 and $2000, U($2000) = 56 (since the expected utility
of L1 is 56 (1) + 61 (0) = 56 ). Since she is indifferent between L2 and $1000, U($1000) = 23
(since the expected utility of L2 is 23 (1) + 31 (0) = 23 ). Thus, E[U(L3 )] = 14 (1) + 14 65 +
1 2
1 5 1 5
1 2 3 3 5
4 3 + 4 (0) = 8 and E[U(L 4 )] = 2 6 + 2 3 = 4 . Since 4 > 8 , Jennifer should prefer
1 1 1 1 2
(b) The expected utility of L1 is 5 ln(30) + 10 ln(28) + 10 ln(24) + 5 ln(18) + 5 ln(8) =
1 2 1
2.741 while the expected utility of L2 is 10 ln(30) + 5 ln(28) + 2 ln(8) = 2.713. Thus,
o1 o2 o3 o4 o5
54 180 0 36 108
Ū : 180 = 0.3 180 = 1 180 = 0 180 = 0.2 180 = 0.6
To normalize V first subtract 5 from each value and then divide by 90. Denote the
normalization of V by V̄ . Then
o1 o2 o3 o4 o5
27 90 0 18 54
V̄ : 90 = 0.3 90 =1 90 =0 90 = 0.2 90 = 0.6
(b) The transformation is of the form V (o) = aU(o) + b. To find the(values of a and
44a + b = 32
b plug in two sets of values and solve the system of equations .
170a + b = 95
The solution is a = 21 , b = 10. Thus, V (o) = 12 U(o) + 10.
28 4
(b) Bob prefers L4 to L3 (L4 Bob L3 ). In fact, E [UBob (L3 )] = 2(28) − 1003 = 55.3853
h 4
i h 4
i
10
while E [UBob (L4 )] = 21 2(10) − 100 1 50
3 + 2 2(50) − 1003 = 56.87.
lottery L4 .
!
o1 o2 o3 o4
Solution to Exercise 3.8 The simple lottery is 18 103 95 24 . For example, the
240 240 240 240
1 1
+ 14 (1) + 18 (0) + 12 1 103
probability of o2 is computed as follows: 8 10 3 = 240 .
3.4 Solutions to Exercises 77
Solution to Exercise 3.9 Using the stated indifference, use lottery L to construct the
compound lottery
! !
o2 o4 o3 o4
1 3 3 2 o3 o4
4 4 5 5
,
1 2 1 1
3 9 9 3
!
o1 o2 o3 o4
whose corresponding simple lottery is L0 = 1 11 121 . Then, by the Indepen-
0 12 45 180
dence Axiom, L ∼ L0 .
Solution to Exercise 3.10 Let W be your initial wealth. The basic outcomes are:
1. you do not pay any money, do not die and live to enjoy your wealth W (denote this
outcome by A0 ),
2. you pay $Y , do not die and live to enjoy your remaining wealth W − Y (call this
outcome AY ),
3. you pay $X, do not die and live to enjoy your remaining wealth W − X (call this
outcome AX ),
4. you die (call this outcome D); this could happen because (a) you do not pay any
money, roll the die and drink the poison or (b) you pay $Y , roll the die and drink the
poison; we assume that you are indifferent between these two outcomes.
The maximum amount $P that you are willing to pay is that amount that makes you
indifferent between (1) rolling the die with the initial number of poisoned glasses and (2)
giving up $P and rolling the die with one less poisoned glass.
Thus – based on your answers – you are indifferent between the two lotteries
! !
D A0 AX
1 5 and
6 6 1
78 Chapter 3. Expected Utility Theory
Thus,
1
U(D) + 56 U(A0 ) = U(AX ) and 4
U(D) + 26 U(A0 ) = 36 U(D) + 36 U(AY ) .
|6 {z } |6 {z } | {z }
1 5 5 4 2 2 3 3
= 6 0+ 6 1= 6 = 6 0+ 6 1= 6 = 6 0+ 6 U(AY )
5 2
Hence, U(AX ) = 6 and U(AY ) = 3 = 46 , so that U(AX ) > U(AY ), contradicting (3.6).
4. Money lotteries revisited
It is clear that one utility function that satisfies (4.1) is the identity function U(m) = m. In
fact, when U is the identity function
In Chapter 3 we assumed that the set of possible monetary outcomes (over which lotteries
were defined) was finite. If we allow for any non-negative amount of money then the vNM
utility-of-money function of a risk-neutral individual is represented by the 45o line, as
shown in Figure 4.1.
utility
U
45o line
0 m
money
Figure 4.1: The identity function U(m) = m represents the vNM preferences of a risk-
neutral individual.
such that
E[L] L if and only if U (E[L]) > E[U(L)], that is, if and only if,
(4.3)
U p1 m1 + ... + pn mn
| {z }
> p1U(m1 ) + ... + pnU(mn ) .
| {z }
E[L] E[U(L)]
The inequality in (4.3) is shown graphically in Figure 4.2 when n = 2, that is, with reference
$m1 $m2
to the lottery L = with m1 < m2 and 0 < p < 1.
p 1− p
utility
U
U(m2 )
U(m1 )
$m1 $m2
L=
p 1− p 0 m1 m2 m
money
pm1 + (1 − p)m2
| {z }
E[L]
Figure 4.2: A graphical representation of the inequality U (E[L]) > E[U(L)], that is
U (pm1 + (1 − p)m2 ) > pU(m1 ) + (1 − p)U(m2 ).
In Figure 4.2, the point pm1 + (1 − p)m2 is a point on the horizontal axis between m1
and m2 (the closer to m1 , the closer p is to 1). Since the individual is assumed to prefer
more money to less, U(m1 ) < U(m2 ). The point pU(m1 ) + (1 − p)U(m2 ) is a point on the
vertical axis between U(m1 ) and U(m2 ). To find that point, draw a straight-line segment
from the point (m1 ,U(m1 )) to the point (m2 ,U(m2 )) (the dashed line in Figure 4.2) and
go vertically up from the point pm1 + (1 − p)m2 on the horizontal axis to the dashed line
and from there horizontally to the vertical axis. By (4.3) this point must be below the point
U (pm1 + (1 − p)m2 ).
82 Chapter 4. Money lotteries revisited
It follows from the above discussion that, if we draw a continuous vNM utility-of-
money function for a risk-averse individual, the corresponding curve must lie above the
straight-line segment joining any two points on the graph, as shown in Figure 4.3. This
property is know as strict concavity.
utility
U
U(m2 )
U (pm1 + (1 − p)m2 )
pU(m1 ) + (1 − p)U(m2 )
U(m1 )
0 m1 m2 m
money
pm1 + (1 − p)m2
An argument similar to the one used in the previous section leads to the conclusion that,
if we draw a continuous vNM utility-of-money function for a risk-loving individual, the
graph must lie below the straight-line segment joining any two points on the graph, as
shown in Figure 4.4. This property is know as strict convexity.
utility
U
U(m2 )
U(m1 )
0 m1 m2 m
money
pm1 + (1 − p)m2
Definition 4.1.2 A function f : R+ → R is strictly convex if, for every x, y ∈ R and for
every p ∈ (0, 1),
f (px + (1 − p)y) < p f (x) + (1 − p) f (y).
utility
U
0 m1 m2 m
money
Figure 4.5: The utility function of an individual with different attitudes to risk for different
lotteries.
4.1 von Neumann Morgenstern preferences over money lotteries 85
4.1.5 Attitude to risk and the second derivative of the utility function
If the vNM utility-of-money function of an individual is a smooth function (or, at least,
twice differentiable) then we can relate the attitude to risk (that is, the shape of the graph
of the function) to the second-derivative of the utility function, using the following result
from calculus.
Let f : R+ → R be a twice differentiable function, then
d2 f
• f is strictly concave if and only if (x) < 0, for every x ∈ R+ ,
dx2
d2 f
• f is strictly convex if and only if (x) > 0, for every x ∈ R+ ,
dx2
d2 f
• the graph of f is a straight line if and only if (x) = 0, for every x ∈ R+ .
dx2
√
For example, an individual whose vNM utility-of-money function is U(m) = m is risk
1 d 2U √1
averse, since dU
dm = 2 m and thus dm2 = − 4 m3 which is negative for every m > 0. Figure
√
4.6 shows the graph of this function.
utility
√
U(m) = m
0 m
money
√
Figure 4.6: The graph of the utility function U(m) = m.
86 Chapter 4. Money lotteries revisited
2
On the other hand, an individual whose vNM utility-of-money function is U(m) = m8
d 2U 1
is risk loving, since dU m
dm = 4 and thus dm2 = 4 > 0 . Figure 4.7 shows the graph of this
function.
utility
2
U(m) = m8
0 m
money
m2
Figure 4.7: The graph of the utility function U(m) = 8 .
(given our assumption about the size of potential losses, W + xi ≥ 0, for every i = 1, . . . , n),
whose expected value is
E[L] = W + E[M].
If the individual has vNM preferences represented by the utility-of-money function U(m),
the expected utility of L is
The risk premium associated with lottery L and utility function U is that amount of
money RLU such that the agent is indifferent between lottery L and the sum of money
$(E[L] − RLU ) for sure:
U (E[L] − RLU ) = E[U(L)].
Thus RLU is the maximum amount that an individual with vNM utility function U is willing
to forego to exchange the risky prospect L for a non-risky (i.e. certain) one with the same
expected value.
For example, consider the money lottery
$11 $20
M= 3 2
5 5
whose expected value is E[M] = 35 11 + 25 20 = $14.6, and an individual whose initial wealth
is W = $5√and whose vNM preferences can be represented by the utility-of-money function
U(m) = m. Then, in terms of wealth levels, the individual is facing the lottery
$16 $25
L= 3 2
5 5
whose expected value√ is E[L√0 ] = $(14.6 + 110) = $124.6; the expected utility of lottery
L0 is E[U(L0 )] = 53 121 + 25 130 = 11.1607. Hence, √ for this lottery and this individual,
the risk premium is the solution to the equation 124.6 − RL0U = 11.1607, which is
RL0U = $0.0388.
Thus – as measured by the risk premium– the degree of risk aversion (incorporated
in a given vNM utility function U) towards a given money lottery M, varies with the
individual’s initial wealth. In the above example, when the individual’s initial wealth is
only $5, she is prepared to avoid the risky prospect M by reducing the expected value of
the corresponding wealth lottery by an amount of up to 24 cents, but if her initial wealth is
$110 then she is only prepared to reduce the expected value of the corresponding wealth
lottery by up to 4 cents.
88 Chapter 4. Money lotteries revisited
R From now on we will use the expression wealth lottery to refer to a lottery whose
outcomes are levels of wealth for the individual (such as lotteries L and L0 above,
constructed from the money lottery M by adding the individual’s initial wealth to
every outcome in M).
The reader should try to prove the following: suppose that, given a wealth lottery L
and a vNM utility function U, the risk premium is $r; then the risk premium remains
$r if the utility-of-money function U is replaced by a positive affine transformation
V of U, that is, for every m, V (m) = aU(m) + b, with a > 0. In other words, if, for
every m, V (m) = aU(m) + b, with a > 0, then RLV = RLU .
Instead of comparing, for a fixed utility function, the risk premium of a given money
lottery across different levels of initial wealth, we can also compare the risk premium,
for a fixed wealth lottery, for different utility functions (representing different prefer-
ences, hence different individuals).
Figure 4.8 shows the risk premium for the wealth
$m1 $m2
lottery L = and two different utility functions, U and V . Let RLU be the
p 1− p
risk premium associated with U and RLV be the risk premium associated with V ; then
RLV = [pm1 + (1 − p)m2 ] − m̂V > RLU = [pm1 + (1 − p)m2 ] − m̂U , where m̂V is the cer-
tainty equivalent of lottery L for V and m̂U is the certainty equivalent of lottery L for U
(see Chapter 2, Section 2.3 for the notion of certainty equivalent).
utility
U(m2 ) = V (m2 )
V U
E[U(L)] = E[V (L)]
= U(m̂U ) = V (m̂V )
risk premium for V
wealth
m1 m̂V m̂U m2
pm1 + (1 − p)m2
| {z }
E[L]
Figure 4.8: The graphs of two utility functions U and V and the risk premia corresponding
to the wealth lottery that gives $m1 with probability p and $m2 with probability (1 − p).
4.2 Measures of risk aversion 89
Let us now focus on the issue of comparing different utility functions in terms of the
the extent to which they express risk aversion. Using the notion of risk premium, one
possibility is given in the following definition.
Since we identified risk aversion with concavity of the vNM utility function, it seems
that the more concave the utility function, the more risk averse the individual is. This
intuition is confirmed in Figure 4.8 (on page 88): V is more concave than U and indeed it
incorporates more risk aversion, as measured by the size of the risk premium.
Since we identified concavity with negativity of the second derivative of the utility function,
one might be tempted to conclude that an individual with vNM utility-of-money function V
is more risk-averse than an individual with vNM utility-of-money function U if, in absolute
d 2V d 2U
value, the second derivative of V is larger than the second derivative of U: dm 2 > dm2 .
Unfortunately, this is not correct, because it violates the requirement that, if V is a positive
affine transformation of U, then V and U represent the same preferences and thus the same
degree of risk aversion (as was pointed out in the remark on page 88). For example, let
d 2V d 2U d 2U
V (m) = 2U(m), for every m ≥ 0. Then, for every m, dm 2 (m) = 2 dm2 (m) > dm2 (m)
U 00 (m)
AU (m) = − .
U 0 (m)
The minus sign makes this expression positive (since U 0 (m) > 0, because the individual is
assumed to prefer more money to less, and U 00 (m) < 0, since the individual is assumed to
be risk averse).
Note that the Arrow-Pratt measure of absolute risk aversion is a local measure, since it
varies with the amount of money considered; that is, typically, if m1 6= m2 then AU (m1 ) 6=
AU (m2 ).
Let us verify that the Arrow-Pratt measure of risk aversion is invariant to affine
transformations. Let a and b be real numbers, with a > 0, and let V (m) = aU(m) + b, for
(m) 00
(m) U (m) 00 00
every m ≥ 0. Then V 0 (m) = aU 0 (m) and V 00 (m) = aU 00 (m), so that VV 0 (m) = aU
aU 0 (m) = U 0 (m)
and thus AV (m) = AU (m).
Using the Arrow-Pratt measure of risk aversion we can introduce a second definition
of “more risk averse”.
√
For example, according to Definition 4.2.2, which of m and ln(m) incorporates
greater risk aversion? Let us compute the Arrow-Pratt measure of risk aversion for these
two functions.
d √ d2 √
Since dm m = − 2√1 m and dm 2 m = − √1 3 , A√ (m) = 2m 1
.
4 m
1 d2
On the other hand, since d
dm ln(m) = m and dm2
ln(m) = − m12 , Aln (m) = m1 .
1 1
Thus, since, for every m > 0, m > 2m we have that, for every m > 0, Aln (m) > A√ (m)
and thus, according to Definition√ 4.2.2, the utility function ln(m) incorporates more risk
aversion than the utility function m.
Note that, in general, there may be utility functions U and V that cannot be ranked
according to Definition 4.2.2. For example, it may be that case AU (m) > AV (m) for values
of m in some interval and AU (m) < AV (m) for values of m in some other interval: see
Exercise 4.11.
Yet a third definition of “more risk averse” relies on the intuition that “more concave”
means “more risk averse”:
Definition 4.2.3 Let U and V be two concave vNM utility-of-money functions. We say
that V incorporates more risk aversion than U if there exists a strictly increasing and
concave function f : R → R such that, for every m ≥ 0, V (m) = f (U(m)). In this case
we say that V is a concave transformation of U.
√
increasing, concave function, V (m) = ln ( m)
For example, since ln(x) is a strictly √
is a concave transformation of U(m) = m and thus, according to Definition 4.2.3, V
incorporates more risk aversion than U.
Of course, having three different definitions of greater risk aversion is rather confusing:
which of the three is the “correct” definition? Furthermore, while the condition in Definition
4.2.2 is somewhat easier to verify, the condition in Definition 4.2.1 is not very practical,
since it would require considering all possible wealth lotteries, and the condition in
Definition 4.2.3 is also hard to verify: how can one tell if one function is a concave
transformation of another? Luckily, it turns out that the three definitions are in fact
equivalent. The following theorem was proved by John Pratt in 1964.3
3 John W. Pratt, Risk aversion in the small and in the large, Econometrica, Vol. 32, No. 1/2, 1964, pp.
122-136.
4.2 Measures of risk aversion 91
Theorem 4.2.1 Let U(m) and V (m) be two functions. Then the following conditions
are equivalent:
1. RLV ≥ RLU , for every non-degenerate wealth lottery L.
2. AV (m) ≥ AU (m), for every m.
3. There exists a strictly increasing and concave function f : R → R such that
V (m) = f (U(m)), for every m ≥ 0.
4 In other words, rU (m) is the absolute value of the elasticity of marginal utility, U 0 (m), with respect to m.
92 Chapter 4. Money lotteries revisited
utility
U(m) = 1 − e−m
0 m
money
−m ) = e−m d2
Since d
dm (1 − e and dm2
(1 − e−m ) = −e−m it follows that the Arrow-Pratt
measure of absolute risk aversion for this function is a constant:
−e−m
A(m) = − = 1.
e−m
5 On the other hand, the Arrow-Pratt measure of relative risk aversion is constant: r√ (m) = mA√ (m) =
1
m 2m = 12 .
6 And the Arrow-Pratt measure of relative risk aversion is constant: rln (m) = mAln (m) = m m1 = 1.
4.4 Higher risk 93
Indeed, this is a special case of the class of CARA (Constant Absolute Risk Aversion)
utility functions, which is the class of functions of the form
1
U(m) = 1 − e−λ m = 1 −
eλ m
d2
where λ is a positive constant. In fact, d
dm 1 − e−λ m = λ e−λ m and dm2
1 − e−λ m =
−λ 2 e−λ m so that the Arrow-Pratt measure of absolute risk aversion is equal to λ .
We saw above that the utility function ln(m) is characterized by constant relative risk
aversion. That is not the only function with this property. The class of CRRA (Constant
Relative Risk Aversion) utility functions contains, besides the natural logarithm function,7
the functions of the form
m(1−λ )
U(m) = with λ > 0, λ 6= 1.
1−λ
λ
For these functions the Arrow-Pratt measure of absolute risk aversion is AU (m) = m so
that the Arrow-Pratt measure of relative risk aversion is rU (m) = mAU (m) = λ .
A lottery over {m1 , m2 , . . . , mn } coincides with a probability distribution over this set.
7 And, more generally, the logarithmic functions loga (m) with a > 1.
94 Chapter 4. Money lotteries revisited
Let L be one such lottery, whose probabilities are {p1 , p2 , . . . , pn } (pi is the probability of
prize $mi , for every i = 1, 2, . . . , n)8 and let M be another lottery, whose probabilities are
{q1 , q2 , . . . , qn }:
$m1 $m2 . . . $mn $m1 $m2 . . . $mn
L= and M= .
p1 p2 . . . pn q1 q2 . . . qn
R Note that we allow for the possibility that some of the pi ’s and qi ’s are zero.
Pi = p1 + · · · + pi and Qi = q1 + · · · + qi
For example,
$26 $40 $58 $80 $96 $26 $40 $58 $80 $96
L= 7 3 1 9 >FSD M = 7 4 9
20 0 20 20 20 20 0 20 0 20
Theorem 4.4.1 Let L and M be two money lotteries (over the same set of prizes). Then
L >FSD M
if and only if
9 The theorem can be proved using Abel’s Lemma, which says that if a1 , . . . , an and b1 , . . . , bn are real
n n−1
numbers, then, letting Ai = a1 + · · · + ai and Bi = b1 + · · · + bi , ∑ ai bi = ∑ Ai (bi − bi+1 ) + An bn (see
i=1 i=1
https://planetmath.org/abelslemma ). To prove Theorem 4.4.1 using Abel’s Lemma, let ai = qi − pi
and bi = U(mi ).
96 Chapter 4. Money lotteries revisited
Now let us construct a new lottery M by taking the probability assigned to the prize $4,
namely 13 , and spreading it equally between the prizes $3 and $5, as shown inn Figure 4.10.
It is easy to check that the expected value of M is the same as the expected value of L,
namely 5.
$2 $3 $4 $5 $9
L: 13 0 31 0 13
M: 1 1 1 1
3 6 0 6 3
Intuitively, a risk-averse person should dislike the change from L to M because it involves
more risk: the prize of $4 has been replaced with a non-degenerate “sub-lottery” with
expected value of $4. By definition of risk aversion, the sub-lottery is worse than its
expected value.
To perform a ‘worsening” of lottery L it is not even necessary to spread out the entire
probability of prize $4; for example, we could merely take away half that probability,
namely 16 , and spread it equally between the prizes $3 and $5 thus obtaining the alternative
lottery
0 $2 $3 $4 $5 $9
M = 1 1 1 1 1 .
3 12 6 12 3
It is easy to check that also the expected value of M0
is 5. In Exercise 4.18√the reader
is asked to verify that an individual with utility-of-money function U(m) = m strictly
prefers L to M 0 and M 0 to M.
In fact, according to the next definition, M is a mean-preserving spread of M 0 , which,
in turn, is a mean-preserving spread of L.
Before giving the formal definition of a mean-preserving spread, let us gain some
intuition, as follows. Start with a lottery
$m1 $m2 . . . $mn
L=
p1 p2 . . . pn
and fix three monetary prizes mi , m j and mk with mi < m j < mk and assume that m j has
positive probability in L, that is, p j > 0. Since m j is strictly between mi and mk , there is a
δ ∈ (0, 1) such that m j = (1 − δ )mi + δ mk , in fact
m j − mi
δ= . ()
mk − mi
4.4 Higher risk 97
Now focus on the part of lottery L that involves the three prizes mi , m j and mk :
. . . $mi . . . $m j . . . $mk . . .
. . . $pi . . . $p j . . . $pk . . .
Let α ∈ (0, 1) and let us reduce the probability of m j from p j to p j − α p j and spread the
probability α p j between mi and mk in the proportions (1 − δ ) and δ , respectively, where
δ is given by (). This is shown in Figure 4.11.
. . . pi +(1 − δ )α p j . . . p j −α p j . . . pk +δ α p j . . .
Let M be the new lottery so constructed. Thus L and M differ only in the probabilities
assigned to prizes mi , m j and mk . The contribution of these prizes to the calculation of the
expected value of the initial lottery L is
pi mi + p j m j + pk mk
while the contribution of these prizes to the calculation of the expected value of the new
lottery M is
pi + (1 − δ )α p j mi + p j − α p j m j + pk + δ α p j mk
= pi mi + p j m j + pk mk + α p j (1 − δ )mi + δ mk − m j
| {z }
=0 because (1−δ )mi +δ mk =m j
= pi mi + p j m j + pk mk .
Hence E[L] = E[M].
We are now ready to give the definition of a mean-preserving spread.10
10 The notion of a mean preserving spread of a probability distribution was introduced by Michael
Rothschild and Joseph Stiglitz in “Increasing risk I: A definition”, Journal of Economic Theory, 1970, Vol.
2, pp. 225-243. Their definition involved re-assigning probabilities across four different points, but that
definition is equivalent to one that is based on reassigning probabilities across three different points, as
shown by Eric Rasmusen and Emmanuel Petrakis, “Defining the mean-preserving spread: 3-pt versus 4-pt”,
in: Decision making under risk and uncertainty: new models and empirical findings, edited by John Geweke,
Amsterdam: Kluwer, 1992. Rasmusen and Petrakis’ definition is different from ours, but equivalent to it.
Their definition is as follows. Take three points a1 , a2 , a3 with a1 < a2 < a3 ; a mean-preserving spread
is a triple of probabilities γ1 , γ2 , γ3 such that (1) γ1 + γ3 = γ2 ≤ p2 (where p2 is the initial probability of
a2 ) and (2) γ1 a1 − γ2 a2 + γ3 a3 = 0. The initial probability of ai is pi and the modified probabilities are
p1 + γ1 , p2 − γ2 , p3 + γ3 . To convert it into our definition let α = γ1p+γ
2
3
and δ = aa23 −a1
−a1 . Invert the operations
to go from our definition to theirs.
98 Chapter 4. Money lotteries revisited
Definition 4.4.2 Let
! !
$m1 $m2 . . . $mn $m1 $m2 . . . $mn
L= and M= .
p1 p2 ... pn q1 q2 ... qn
m j − mi
qi = pi + (1 − δ ) α p j , q j = p j − α p j, qk = pk + δ α p j with δ = .
mk − mi
Theorem 4.4.2 Let L and M be two money lotteries (over the same set of prizes). Then
L >SSD M
if and only if
E[U(L)] > E[U(M)], for every strictly increasing and concave utility function U.
4.5 Exercises
The solutions to the following exercises are given in Section 4.6 at the end of this chapter.
4.5.1 Exercises for Section 4.1: vNM preferences over money lotteries
11 This result is best known to economists from the 1970 paper by Rothschild and Stiglitz mentioned in
Footnote 10. However, in a later article (Michael Rothschild and Joseph Stiglitz, Addendum to ‘Increasing
risk I: A definition’, Journal of Economic Theory, 1972, Vol. 5, p. 306) the authors themselves acknowledged
that their main result could have been derived from earlier contributions by mathematicians.
100 Chapter 4. Money lotteries revisited
Exercise 4.2 Consider again the wealth lottery of Exercise 4.1, but a different agent:
√
Jim, whose vNM utility-of-money function is U(m) = m. Answer the same questions
as in Exercise 4.1 but referring to Jim.
Exercise 4.3 What attitude to risk is incorporated in the following vNM utility-of-
money functions? Base your answer on the sign of the second derivative of the utility
function.
(a) ln(m + 1)
(b) 8 + m1.65
(c) 2 + 7m.
Exercise 4.4 Let m denote the amount of money (measured in millions of dollars) and
suppose that it varies in the interval [0, 1]. John’s utility-of-money function is given by:
U(m) = −m2 + 2m − 4.
V (m) = −3 m2 − 2m .
Exercise 4.6 As in Exercise 4.2, consider Jim, whose vNM utility-of-money function
√
is V (m) = m, and the lottery
$8 $18 $24 $28 $30
L= 2 1 1 1 1
5 5 10 10 5
Exercise 4.7 As in Exercise 4.4, let m denote the amount of money, measured in
millions of dollars, and suppose that it varies in the interval [0, 1]. John’s utility-of-
2
by: U(m) = −m + 2m − 4 while Jenny’s utility function is:
money function is given
2
V (m) = −3 m − 2m . Calculate the Arrow-Pratt measures of absolute and relative
risk aversion for John and Jenny and compare them.
$24 $12 $48
Exercise 4.8 Amy faces the wealth lottery 2 3 1 and tells you that she
6 6 6
considers it equivalent to getting $18 for sure.
(a) Calculate the risk premium for lottery L for Amy.
(b) What is Amy’s attitude to risk?
√
(c) Could Amy’s vNM utility-of-money function be U(m) = m ?
(b) What is the risk premium associated with lottery L1 for Bill?
(c) What is the risk premium associated with lottery L2 for Bill?
102 Chapter 4. Money lotteries revisited
Exercise 4.10 Consider the following money lottery, where the outcomes are changes
in wealth:
−$50 $120
M= 1 3 .
4 4
Berta’s vNM utility-of-money function is U(m) = ln(m).
(a) Suppose that Berta’s inital wealth is $80. Write the wealth lottery corresponding
to lottery M above and calculate the risk premium for this lottery for Berta.
(b) Suppose that Berta’s inital wealth is $200. Write the wealth lottery corresponding
to lottery M above and calculate the risk premium for this lottery for Berta.
$120 $180 $260
Exercise 4.11 Consider the wealth lottery L = 2 2 1 and the fol-
5 5 5
lowing vNM utility-of-money functions defined for m ∈ [0, 300]:
√ m 2 m
U(m) = m and V (m) = − − 36 + + 1, 400.
10 20
(a) Write an equation whose solution gives RLU (the risk premium for lottery L and
utility function U) and verify that the solution is RLU = 3.6949.
(b) Write an equation whose solution gives RLV (the risk premium for lottery L and
utility function V ) and verify that the solution is RLV = 6.848.
(c) Using the Arrow-Pratt measure of absolute risk aversion, which of U and V
incorporates greater risk aversion?
Exercise 4.12 Plot the following utility functions in the same diagram:
2
Exercise 4.14 Consider the quadratic utility-of-money function U(m) = cm − m2 ,
where c is a positive constant and m ∈ [0, c). For this function is the Arrow-Pratt
measure of absolute risk aversion decreasing, constant or increasing?
4.5 Exercises 103
Exercise 4.17 Consider the lotteries of Exercise 4.16. Since it is not the case that M
dominates L in terms of first-order stochastic dominance, by Theorem 4.4.1 there must
be an increasing utility-of-money function U such that E[U(L)] > E[U(M)]. Construct
such a function. Note that you don’t need to define a function over the entire set of non-
negative real numbers: it is enough to define a function over the set {26, 40, 58, 80, 96}.
Exercise 4.18 Consider the following lotteries, which were discussed at the beginning
of Section 4.4.2:
$2 $3 $4 $5 $9
L= 1
3 0 31 0 13
0 $2 $3 $4 $5 $9 $2 $3 $4 $5 $9
M = 1 1 1 1 1 and M = 1 1 .
3 12 6 12 3 3 6 0 16 13
√
Show that an individual with utility-of-money function U(m) = m strictly prefers L
to M 0 and M 0 to M.
104 Chapter 4. Money lotteries revisited
= 77.88.
√
d
(c) dm (20 m − 4) = 20 2√1 m = 10
√
m
.
d2 √ 3
(20 m − 4) = 10 − 12 m− 2 = − √5 3 < 0, for every m > 0.
(d) dm2 m
(e) Jennifer is risk-averse since the second derivative of her utility function is negative
for every m > 0.
4.6 Solutions to Exercises 105
d2
(b) dm2
(8 + m1.65 ) = 1.0725
m0.35
> 0, for every m > 0. Thus risk love.
d2
(c) dm2
(2 + 7m) = 0. Thus risk neutrality.
1 1
Thus A(900) = 1,800 and A(1, 600) = 3,200 .
106 Chapter 4. Money lotteries revisited
1
V 00 (m) − √ 3 1
A(m) = − 0 = − 41 m =
V (m) √ 2m
2 m
the same as for Jennifer (as it should be, since they have the same preferences). Thus
1 1
A(900) = 1,800 and A(1, 600) = 3,200 .
1 m
AU (m) = AV (m) = and rU (m) = rV (m) = .
1−m 1−m
(a) When
Berta’s initial
wealth is $80, the corresponding wealth lottery is
$30 $200
L= 1 3 , whose expected value is $157.5. The risk premium is given
4 4
by the solution to
which is $33.0304.
(b) When
Berta’s initial
wealth is $200, the corresponding wealth lottery is
$150 $320
L= 1 3 , whose expected value is $277.5. The risk premium is given
4 4
by the solution to
which is $12.7199.
$120 $180 $260
Solution to Exercise 4.11 We are considering the wealth lottery L = 2 2 1 .
5 5 5
The expected value of L is 172.
√ √ √ √
(a) RLU is the solution to 172 − R = 25 120 + 25 180 + 15 260. The solution is
RLU = 3.6949.
172−R
2
− 10 − 36 + 172−R
20 + 1, 400 =
h 2 120 i
2 120
5 − 10 − 36 + 20 + 1, 400
h i
2
+ 52 − 180 180
10 − 36 + 20 + 1, 400
h 2
i
1 260 260
+ 5 − 10 − 36 + 20 + 1, 400 .
utility
1 − e−3m
1 − e−m
money
0
By constructing the corresponding cumulative distribution functions one can see that
M >FSD L (Qi ≤ Pi for every i = 1, . . . , 5 and Q3 < P3 ):
4.6 Solutions to Exercises 109
By constructing the corresponding cumulative distribution functions one can see that,
according to the criterion of first-order dominance, it is neither the case that L dominates
6 5
M (since, for example, P1 = 20 > Q1 = 20 ) nor the case that M dominates L (since
13 12
Q4 = 20 > P4 = 20 ):
Solution to Exercise 4.17
The lotteries are:
$26 $40 $58 $80 $96 $26 $40 $58 $80 $96
L= 6 4 2 8 and M= 5 4 2 2 7
20 20 20 0 20 20 20 20 20 20
1
Since L assigns an additional probability of 20 to $96 (relative to M), it is sufficient to have
a “big jump” in utility going from $80 to $96. For example, consider the following utility
function:
$26 $40 $58 $80 $96
Utility U : 1 2 3 4 10
6 4 2 8 5 4 2 2
Then E[U(L)] = 20 1 + 20 2 + 20 3 + 20 10 = 5 and E[U(M)] = 20 1 + 20 2 + 20 3 + + 20 4+
7
20 10 = 4.85. Thus an individual with this (strictly increasing) utility function prefers
lottery L to lottery M. One can also easily construct a strictly increasing utility function
according to which lottery M is preferred to lottery L (big jump at $80, small jump
at $96).
110 Chapter 4. Money lotteries revisited
3 9 18 8 2
(a) E[L] = 40 4 + 40 16 + 40 25 + 40 36 + 40 49 = 24.8.
23 9 3 8 17
(b) E[M] = 200 4 + 40 16 + 8 25 + 40 36 + 200 49 = 24.8.
√ 3 9
√ √ √ √
(c) E[U(L)] = 4 + 40
40 16 + 18 8 2
40 25 + 40 36 + 40 49 = 4.85
23
√ 9
√ √ √ √
(d) E[U(M)] = 200 4 + 40 16 + 38 25 + 40
8 17
36 + 200 49 = 4.8.
$6 $23 $44 $51 $70
M0 = .
77 1 1 83
192 12 0 12 192
$6 $23 $44 $51 $70
M= .
77 11 4349
192 94 0 0 9024
must be decreased.
We want to relate the shape of the indifference curves of an individual to her attitude
towards risk.
y
(probability 1 − p)
yA A
yA −yB p
xA −xB = − 1−p
rise
yB B
| {z }
run
x
0 xA xB (probability p)
p
slope: − 1−p
We now show that, if we take two lotteries A and B that yield the same expected utility
(so that they lie on the same indifference curve), then all the lotteries on the line segment
joining A and B (apart from A and B themselves) correspond to higher levels of expected
utility than A and B . Hence, since the utility function is assumed to be strictly increasing,
it follows that the indifference curve to which A and B belong, must lie below the line
segment that joins A and B, that is, the indifference curve must be convex towards the
origin.
As before, fix an arbitrary p ∈ (0, 1) and consider all the lotteries of the form
$x $y
p 1− p
which can be identified with points in the positive quadrant of the cartesian plane (x, y).
Let A = (xA , yA ) and B = (xB , yB ) lie on the same indifference curve, that is,
Fix an arbitrary t ∈ (0, 1) and consider the point C = tA + (1 − t)B on the line segment
joining A and B, which represents the lottery
txA + (1 − t)xB tyA + (1 − t)yB
C= .
p 1− p
Then
By (5.1),
= tE[U(A)] + (1 − t)E[U(B)]
= t û + (1 − t)û = û.
y
(probability 1 − p)
yA A
yB B
EU = û
x
0 xA xB (probability p)
txA + (1 − t)xB
Now consider the case of a risk-loving individual. Recall from Chapter 4 that the utility-of-
money function U(m) of a risk-loving individual is strictly convex, that is, for every x > 0
and y > 0 and for every t ∈ (0, 1),
With an argument similar to the one used in the previous section, one can show that, if
we take two lotteries A and B that yield the same expected utility – so that they lie on the
same indifference curve – all the lotteries on the line segment joining A and B (apart from
A and B themselves) correspond to lower levels of expected utility than A and B . Hence,
since the utility function is assumed to be strictly increasing, it follows that the indifference
curve to which A and B belong, must lie above the line segment that joins A and B, that is,
the indifference curve must be concave towards the origin, as shown in Figure 5.3.
y
(probability 1 − p)
A
yA
tyA + (1 − t)yB
tA + (1 − t)B = C
yB E[U(C)] < û
B
EU = û
x
0 xA xB (probability p)
txA + (1 − t)xB
Since B is close to A (that is, δ and ε are small), we can approximate the values of
U(xA + δ ) and U(yA + ε) using the derivative of U (that is, using a first-order Taylor
expansion):
that is,
rise
p U 0 (xA )
z}|{
ε
=− .
δ
|{z} 1 − p U 0 (yA )
run
p U 0 (xA )
− (5.9)
1 − p U 0 (yA )
1 Alternatively, one can derive the slope of an indifference curve at a point by using the implicit func-
tion theorem, which says the following. Let F : R2 → R be a continuously differentiable function and
(x0 , y0 ) ∈ R2 a point such that F(x0 , y0 ) = c; if ∂∂Fy (x0 , y0 ) 6= 0 then there is an interval (x0 − ε, x0 + ε) and a
differentiable function f : (x0 − ε, x0 + ε) → R such that (1) F(x0 , f (x0 )) = y0 , (2) F(x, f (x)) = c for every
∂F
(x0 ,y0 )
x ∈ (x0 − ε, x0 + ε) and (3) f 0 (x0 ) = − ∂∂Fx . To apply the implicit function theorem in this context, let
∂ y (x0 ,y0 )
F(x, y) = pU(x) + (1 − p)U(y) and let A = (xA , yA ) be a point where pU(xA ) + (1 − p)U(yA ) = û.
5.1 Binary lotteries and indifference curves 119
(xA ) 0
In the case of risk neutrality U 0 is constant and thus U 0 (xA ) = U 0 (yA ) so that U
U 0 (yA ) = 1;
p
hence the slope becomes − 1−p at every point, consistently with what we saw above.
Now let us see what (5.9) implies for a concave utility-of-money function, that is, for
the case of risk aversion. When the utility function is concave, the second derivative is
negative (U 00 (m) < 0), which means that the first derivative is decreasing, that is,
U 0 (m1 )
0 0
if m1 < m2 then U (m1 ) > U (m2 ) or 0 >1 ,
U (m2 )
as shown in Figure 5.4.
U 0 (m1 )
U 0 (m2 )
money money
0 m m1 m2 m
Figure 5.4: When the utility function is concave, marginal utility is decreasing.
U 0 (x)
R At a point A = (x, y) above the 45o line (where x < y) we have that U 0 (y) > 1 so that
p U 0 (x) p p U 0 (x) p
− <− or > .
1 − p U 0 (y) 1− p 1 − p U 0 (y) 1 − p
p
Hence the indifference curve is steeper than the straight line with slope − 1−p .
Conversely, at a point A = (x, y) below the 45o line (where x > y) we have that
U 0 (x)
U 0 (y) < 1 so that
p U 0 (x) p p U 0 (x) p
− 0
>− or 0
< .
1 − p U (y) 1− p 1 − p U (y) 1 − p
p
Hence the indifference curve is less steep than the straight line with slope − 1−p .
U 0 (x)
Finally at a point on the 45o line (where x = y) we have that U 0 (y) = 1 so that
p U 0 (x) p
− =− ;
1 − p U 0 (y) 1− p
p
hence the straight line with slope − 1−p is tangent to the indifference curve.
120 Chapter 5. Insurance: Part 2
2 √
Example 5.1 This example is illustrated in Figure 5.5. Let p = and U(m) = m and
5
√
$x $y 0 (m) = √ 1 U 0 (x) y
consider all the lotteries of the form 2 3 . Since, U ,
2 m U 0 (y)
= √
x
(for
5 5
x > 0 and y > 0).
Consider three points: (25,100), (64,64) and (121,36). The expected utility of these three
lotteries is the same, namely 8; hence these three points belong to the same indifference
curve.2
• Point (64,64) is on the 45o line and the slope of the indifference curve at that point is
√ !
p 64 p 2
− √ =− =− .
1− p 64 1− p 3
• Point (25,100) is above the 45o line and the slope of the indifference curve at that
point is
√ !
p 100 2 10 4
− √ =− =− .
1− p 25 3 5 3
• Point (121,36) is below the 45o line and the slope of the indifference curve at that
point is
√ !
p 36 2 6 4
− √ =− =− .
1− p 121 3 11 11
y
(probability 35 )
45o line
EU = 8
100
64
36 4
slope: − 11
x
0 (probability 25 )
25 64 121
slope:
slope: p
= − 1−p
− 43
= − 32
We will omit the case of risk love (convex utility function, concave indifference curves).
The reader should convince himself/herself that in this case an indifference curve is less
p
steep than the line of slope − 1−p at a point above the 45o line and steeper at a point below
o
the 45 line.
Using the tools developed in this chapter, we can draw the individual’s indifference curve
122 Chapter 5. Insurance: Part 2
that goes through the no-insurance point NI: it will be a decreasing and convex curve; we
shall call it the reservation indifference curve. Points below the reservation indifference
curve represent contracts that would yield lower expected utility than the no-insurance
option; thus the individual would reject any such contracts, if offered to her. Only contracts
represented by points on or above the reservation indifference curve will be acceptable to
the individual.
Thus the set of mutually beneficial insurance contracts is given by the area bounded
below by the reservation indifference curve, bounded above by the zero-profit line and
bounded on the right by the 450 line; it is shown as a shaded area in Figure 5.6.
Wealth in
good state
(probability 1 − p)
W2
NI 45o line
W0
C zero-profit
line
reservation
indifference
curve
W1 Wealth in
W0 − ` bad state
0 (probability p)
Figure 5.6: The shaded area is the set of mutually beneficial insurance contracts.
Contract C in Figure 5.6 is above the reservation indifference curve and thus yields
higher expected utility than the no-insurance option (that is, C is strictly preferred to NI by
the potential customer) and is below the zero-profit line and thus yields positive profit to
the insurer.
In Figure 5.6 the reservation indifference curve that goes through the no-insurance (NI)
point is steeper at that point than the zero-profit line. This follows from the analysis in
Section 5.1.4. Indeed this is true of any point that lies above the 45o line. Let A = (W1A ,W2A )
be a contract that lies above the 45o line. Then, by (5.9) the slope, at point A, of the
indifference curve that goes through A is equal to
0 A
p U (W1 )
− (5.10)
1 − p U 0 (W2A )
5.2 Back to insurance 123
Recall that the slope of the isoprofit line that goes through any point in the wealth diagram
p
is − 1−p . By the remark on page 119,
• At any point above the 45o line the indifference curve is steeper than the isoprofit
line that goes through that point.
• At any point on the 45o line the indifference curve is tangent to (has the same slope
as) the isoprofit line that goes through that point.
This is shown in Figure 5.7. Any contract that lies in the area above the indifference curve
that goes through contract A and below the isoprofit line through A, such as point B
in Figure 5.7, represents a contract that is better than A for the potential customer
(B yields higher expected utility than A) and is better than A for the insurance company (B
yields higher profits than A).
Wealth in
good state
(probability 1 − p)
W2
NI
W0 45o line
A
W2A
B
zero-profit
line
iso-profit
line
W1 Wealth in
bad state
0 W1A
(probability p)
W0 − `
Figure 5.7: The relative slope of an indifference curve and an isoprofit line.
124 Chapter 5. Insurance: Part 2
Wealth in
good state
(probability 1 − p)
W2
reservation
NI indifference 45o line
W0 curve
hmax
W0 − hmax C
W1 Wealth in
W0 − ` W0 − hmax bad state
0 (probability p)
Figure 5.8: Contract C is the full-insurance contract that would be offered by a monopolist.
3 Recall that, at any point on the 45o line, the indifference curve is tangent to the isoprofit line.
5.2 Back to insurance 125
hmax = p` + RNI
W0 − ` W0
where NI = is the no-insurance lottery. That is, hmax is equal to the
p 1− p
expected loss plus the risk premium of the no-insurance lottery.
1 √
For example, if W0 = 1, 600, ` = 700, p = 10 and U(m) = m then hmax is given by
the solution to the equation
p 1p 9p
1, 600 − h = 1, 600 − 700 + 1, 600
10 10
2. there is no unexploited profit opportunity in the industry, that is, there is no currently-
not-offered contract that would attract some custmers and yield positive profit to a
firm that offered that contract.
126 Chapter 5. Insurance: Part 2
By the zero-profit condition (Point 1), any equilibrium contract must be on the zero-
profit line. By the no-profitable-opportunity condition (Point 2), it cannot be a partial-
insurance contract, such as contract A in Figure 5.9, because a new entrant (or an existing
firm) could offer a contract in the region above the indifference curve through point A
and below the iso-profit line through point A, such as contract B in Figure 5.9; such a
contract would induce all those customers who were purchasing contract A to switch to B
and would yield positive profits to the insurance firm offering it. The only contract that is
immune to this is the contract at the intersection of the zero-profit line and the 45o line
(contract D in Figure 5.9).
Wealth in
good state
(probability 1 − p)
W2
NI
W0 45o line
A
B D
zero-profit
line
W1 Wealth in
bad state
0 W0 − ` (probability p)
Figure 5.9: Contract D is the full-insurance contract that would be offered at a free-entry
competitive equilibrium.
premium deductible
Contract 1: $82 0
Contract 2: $62 $100
Contract 3: $40 $200
This if he decides to insure, he will choose Contract 3. To see if he does decide to insure
we need to compare the expected utility of the √ best contract, namely√ Contract 3, with
the expected utility of no insurance, which is 0.2 1, 000 − 400 + 0.8 1, 000 = 30.1972.
Since Contract 3 (the best of the three offered contracts) is better than no insurance, he
will purchase Contract 3.
128 Chapter 5. Insurance: Part 2
deductible premium
0 $820
$100 $800
$140 $792
$260 $768
... ...
The set of possible choices is infinite, since any d ∈ [0, 4100] can be chosen by the
individual. Thus we can think of (5.12) as a line, similar to the budget line faced by a
consumer. We shall call it the insurance budget line.
It is useful to translate the line of equation (5.12) – which is expressed in terms of premium
and deductible – into a line in the wealth diagram (W1 ,W2 ). This is easily done by recalling
that h = W0 −W2 and d = W2 −W1 :
1 5W0 W1
W0 −W2 = 820 − (W2 −W1 ) , that is, W2 = − 1025 − . (5.13)
5 4 4
For example, if W0 = 6, 000 then (5.13) becomes W0 −W2 = 820 − 15 (W2 −W1 ), that is,
W1
W2 = 6, 475 − . (5.14)
4
Note that the line (in the wealth space) corresponding to equation (5.14) goes through the
no-insurance point NI = (1900, 6000); in fact, replacing W1 with the value 1900 in (5.14)
we get W2 = 6000.
5.3 Choosing from a menu of contracts 129
Wealth in
good state
(probability 1 − p)
W2
45o line
insurance budget line
W2 = 6475 − 41 W1
NI
6,000
5,180
W1 Wealth in
bad state
0 1,900 5,180
(probability p)
which – translated into the wealth space (by replacing h with (W0 − W2 ) and d with
(W2 −W1 )) – becomes
W0 − a b
W2 = − W1 . (5.16)
1−b 1−b
If (5.15) is such that a − b ` = 0 then the insurance budget line in the wealth space (defined
by (5.16)) goes through the no-insurance point NI = (W0 − `,W0 ),4 while if a − b ` > 0
then the insurance budget line in the wealth space (defined by (5.16)) goes through a point
vertically below NI.5
What contract, if any, would the individual choose from the insurance budget line?
We start with the case where the insurance budget line goes through the NI point. There
are three cases to consider.
Case 1: the reservation indifference curve is, at NI, as steep as, or less steep than, the
insurance budget line, as shown in Figure 5.11. It follows that the entire insurance budget
line lies below the reservation indifference curve and thus the individual will choose not
to insure.
Wealth in
good state
(probability 1 − p)
W2
NI 45o line
W0
reservation
insurance indifference curve
budget line
W1 Wealth in
W0 − ` bad state
0 (probability p)
Figure 5.11: Case 1: the insurance budget line lies below the reservation indifference
curve.
In the example above, where W0 = 6, 000, ` = 4, 100 and the insurance budget line is
given by the equation W2 = 6, 475 − W41 , we will be in Case 1 if and only if
0
p U (1, 900) 1
0
≤ .
1 − p U (6, 000) 4
19
For instance, if U(m) = ln(m), then Case 1 occurs if and only if p ≤ 259 = 0.0734.
5.3 Choosing from a menu of contracts 131
Case 2: the indifference curve that goes through the point at the intersection of the 45o
line and the insurance budget line is steeper than, or as steep as, the insurance budget line
at that point, as shown in Figure 5.12.6
Wealth in
good state
(probability 1 − p)
W2
45o line
NI
W0
W1 Wealth in
bad state
0 W0 − `
(probability p)
In this case, insurance is better than no insurance and the best contract (that is, the
contract that yields the highest expected utility) is the full insurance contract (the point at
the intersection of the 45o line and the insurance budget).
In the example above, where the insurance budget line is given by the equation
W2 = 6, 475 − W41 , we will be in Case 2 if and only if (recall that the slope of any in-
p
difference curve at any point on the 45o line is − 1−p )
p 1 1
≥ i.e. p≥ .
1− p 4 5
6 The indifference curve will be, at that point, as steep as the insurance budget line if and only if the
p
insurance budget line is the zero-profit line, since that slope will be − 1−p . It is steeper if and only if
the insurance budget line is less steep than the zero-profit line, which implies that all the contracts on the
insurance budget line – with the exception of the NI point – yield negative profits; thus it is unlikely that an
insurance company would offer such menu (unless it is subsidized by the government).
132 Chapter 5. Insurance: Part 2
Case 3: the indifference curve that goes through the point at the intersection of the 45o
line and the insurance budget line is less steep than the insurance budget line at that point,
as shown in Figure 5.13.
Wealth in
good state
(probability 1 − p)
W2
45o line
NI
W0
W1 Wealth in
bad state
0 W0 − `
(probability p)
In this case, there are partial-insurance contracts on the insurance budget line that are
better than full insurance (and than no insurance). Thus the individual will choose a partial
insurance contract.
Which of the many partial-insurance contracts will she choose? It cannot be a contract
where the indifference curve through it is either steeper, or less steep, than the insurance
budget line: in the former case there would be contracts on the budget line to the right of
that contract that would be better and in the latter case there would be contracts on the
budget line to the left of that contract that would be better. Hence the best contract is the
one at which the slope of the indifference curve at that point is equal to the slope of the
insurance budget line, that is, it is a contract at which the indifference curve through it is
tangent to the insurance budget line: it is that contract C = W1C ,W2C such that, letting
W2 = α − βW1 be the equation of the budget line (with α > 0 and β > 0),
0 C
p U (W1 )
W2C = α − βW1C and = β. (5.17)
1 − p U 0 (W2C )
5.3 Choosing from a menu of contracts 133
Wealth in
good state
(probability 1 − p)
W2
NI 45o line
W0
C
W1 Wealth in
W0 − ` bad state
0 (probability p)
In the example above, where W0 = 6, 000, ` = 4, 100 and the insurance budget line is
given by the equation W2 = 6, 475 − W41 , we will be in Case 3 if and only if7
0
p U (1, 900) 1 p 1
0
> and < .
1 − p U (6, 000) 4 1− p 4
19
For instance, if U(m) = ln(m), then Case 3 occurs if and only if p > 259 and p < 15 , that
19 1
is, if and only if p ∈ 259 , 5 . Fix a value of p in this range and continue to assume that
U(m) = ln(m). Then the optimal contract C = (W1C ,W2C ) is given by the solution to
W1 p W2 1
W2 = 6, 475 − and = .
4 1 − p W1 4
For example, if p = 17 then the optimal contract is C = (3700, 5550), that is, the individual
will choose a deductible of 5, 550 − 3, 700 = $1, 850 with a corresponding premium of
6, 000 − 5, 550 = $450.
7 The first inequality says that the reservation indifference curve is steeper at NI than the insurance budget
line. The second inequality says that the indifference curve that goes through the point on the 45o line that
lies on the insurance budget line is less steep at that point than the budget line.
134 Chapter 5. Insurance: Part 2
So far we have assumed that the insurance budget line goes through the no-insurance
point NI. If it does not then, given the assumptions stated in (5.15) on page 129, it will go
through a point vertically below the no-insurance point. There are several possibilities.
A first possibility is that the insurance budget line lies entirely below the reservation
indifference curve (this situation is similar to Case 1 considered above). In such a case
the individual will choose not to insure, since any of the offered contracts yields a lower
expected utility than no insurance.
A second possibility is that there are points on the insurance budget line that are
above the reservation indifference curve, as well as points that are below the reservation
indifference curve. This case can be further subdivided into two sub-cases.
The first sub-case is that there is only one point of intersection between the insurance
budget line and the reservation indifference curve, as shown in Figure 5.15. In this case
the entire segment of the budget line to the right of the intersection point lies above the
reservation indifference curve.
Wealth in
good state
(probability 1 − p)
W2
NI 45o line
W0
W1 Wealth in
W0 − ` bad state
0 (probability p)
Figure 5.15: The case where the insurance budget line does not go through NI.
5.3 Choosing from a menu of contracts 135
In this case,
• if the indifference curve that goes through the point at the intersection of the 45o
line and the insurance budget line is - at that point - steeper than, or as steep as,
the insurance budget line (as shown in the left panel of Figure 5.16), then the best
contract for the individual is the full-insurance contract;
• if the indifference curve that goes through the point at the intersection of the 45o
line and the insurance budget line is - at that point - less steep than the insurance
budget line (as shown in the right panel of Figure 5.16), then the best contract for the
individual is a partial-insurance contract, namely a point at which there is a tangency
between the budget line and the indifference curve that goes through that point (point
C in the right panel of Figure 5.16).
W2 W2
NI 45o NI 45o
W0 line W0 line
W1 W1
0 W0 − ` 0 W0 − `
Figure 5.16: In the case on the left the individual chooses full insurance, in the case on the
right partial insurance.
For example, let W0 = 3, 600, ` = 1, 100 and p = 15 . Suppose that the insurance
company is willing to offer any contact that yields a profit of $10. Then premium and
deductible are related by the equation h − p(` − d) = 10, that is, h = 230 − d5 . Let us
translate it into a line in the wealth space: 3, 600 −W2 = 230 − 15 (W2 −W1 ), that is,
W1
W2 = 4, 212.5 − .
4
This is the equation of the isoprofit line corresponding to a profit-level of 10. Call B the
point of intersection of the budget line and the 45o line. Then the slope of the isoprofit
line is equal to the slope, at point B, of the indifference curve that goes through point B.
Hence we are in the subcase shown in the left panel of Figure 5.16 and the individual will
choose the full-insurance contract, with a premium of $230 (assuming, of course, that
the utility-of-money function is such that the reservation indifference curve crosses the
insurance budget line).
On the other hand, if the insurance budget line is steeper than an isoprofit line, then
we are in the case shown in the right panel of Figure 5.16 and the individual will choose a
partial-insurance contract.
136 Chapter 5. Insurance: Part 2
The second sub-case is that there are two points of intersection between the insurance
budget line and the reservation indifference curve, as shown in Figure 5.17.
Wealth in
good state
(probability 1 − p)
W2
NI 45o line
W0
W1 Wealth in
W0 − ` bad state
0 (probability p)
In this case the individual will choose a partial-insurance contract, namely the contract
on the insurance budget line at which there is a tangency between the budget line and the
indifference curve that goes through that point (shown as point C in Figure 5.17).
We will not consider cases where the insurance budget line does not cover the entire
range [0, `] of possible deductibles and is thus a smaller segment than considered so far; it
should be clear, however, that the method would be the same, namely based on comparing
the relative slopes of indifference curves and the budget line. An example of this is given
in Exercise 5.16.
5.3 Choosing from a menu of contracts 137
such that
0 C
C C p U (W1 )
W2 = α − βW1 and = β.
1 − p U 0 (W2C )
8 The reader should convince herself/himself that, indeed, the following steps cover all the cases considered
in this section.
9 Here and elsewhere we are implicitly assuming that if the individual is indifferent between insuring and
facing mounting losses, some property insurers are pulling back from selling
policies in California and other western states where wildfire risk is elevated.
In California alone, damages had mounted to at least $12 billion by early
2018.
Clearly, the unavailability of insurance makes a risk-averse individual worse off. Is there
anything that he/she can do mitigate the welfare loss due to the inability to but insurance?
Imagine that Ann and Bob are friends who live in high-risk areas that insurers have
decided to no longer cover. To make things simple, imagine that Ann and Bob have the
same initial wealth $W0 and their homes are of equal value, so that they both face the same
potential loss of $` if a fire occurs; furthermore, assume that they face the same probability
p of a fire occurring. Without insurance they both face the same money lottery, namely
W0 W0 − `
NI =
1− p p
One possible course of action for Ann and Bob is to resort to mutual-insurance, that is, to
insure each other by signing the following contract:
Would such a contract make them better off relative to no insurance? Let us assume
they leave in locations that are far apart, so that the event of a fire in Ann’s area can
plausibly be treated as independent of the event of a fire in Bob’s area. When two events
are independent, the probability of them jointly occurring is equal to the product of the
individual probabilities. Thus the probabilities can be computed as follows:
Thus, each individual will suffers a loss of $ 2` (either in the form of a payment to a less
lucky friend or in the form of a loss of ` followed by a reimbursement, in the amount of 2` ,
10 https://www.cbsnews.com/news/california-wildfires-property-insurers-cancel-policies-because-of-risk/
5.4 Mutual insurance 139
from the luckier friend) with probability 2p(1 − p), so that the contract will give rise to
the following money lottery for each of them (‘MI’ stands for ‘Mutual Insurance’):
W0 − 2`
W0 W0 − `
MI =
(1 − p)2 2p(1 − p) p2
Assuming that Ann has vNM preferences over money lotteries and prefers more money to
less (a similar argument applies to Bob), we can represent her preferences by means of a
normalized vNM utility function as follows, with 0 < a < 1:
money: W0 W0 − 2` W0 − `
utility: 1 a 0
Then the expected utilities of the two lotteries are:
E[U(NI)] = 1 − p, and
Consider first the case where Ann is risk neutral. Then it must be that a = 12 . In fact,
W0 − 2`
W0 W0 − `
the lottery and the lottery 1 1 have the same expected value,
1 2 2
namely W0 − 2` and thus Ann must be indifferent between them. The expected utility of the
former is a and the expected utility of the latter is 12 ; thus a = 12 . When a = 12 , (2a − 1) = 0
and thus E[U(NI)] = E[U(MI)] so that Ann does not gain from signing the contract: with
the contract she is as well off as without the contract.
Next consider first the case where Ann is risk averse. Then it must be that a > 21 . In
W0 − 2`
fact, the lottery , whose expected utility is a, gives for sure the expected value
1
W0 W0 − `
of the lottery 1 1 , whose expected utility is 12 . By definition of risk aversion,
2 2
Ann prefers the former lottery to the latter. Hence a > 21 . When a > 12 , (2a − 1) > 0 and
thus (1 − p)[1 + p(2a − 1)] > 1 − p so that E[U(MI)] > E[U(NI)], that is, Ann is better
off with the contract than without the contract.
Thus we have shown that two risk-averse individuals can make themselves better off by
signing a mutual-insurance agreement, according to which they share the losses equally .
The astute reader will have realized that there was no need for a detailed proof of
the fact that a risk-averse individual will prefer lottery MI to lottery NI, since it follows
from the analysis of Chapter 4 (Section 4.4.2): lottery NI can be obtained from lottery
MI by means of a mean-preserving spread. First of all, the reader should verify that
E[NI] = E[MI] = W0 − p`. Secondly, by taking the probability of outcome W0 − 2` ,
namely 2p(1 − p), and spreading it equally to each of the other outcomes we obtain lottery
NI; in fact, (1 − p)2 + 21 2p(1 − p) = 1 − p and p2 + 12 2p(1 − p) = p.
5.5 Exercises
The solutions to the following exercises are given in Section 5.6 at the end of this chapter.
5.5.1 Exercises for Section 5.1: Binary lotteries and indifference curves
$x $y
Exercise 5.1 Consider all the lotteries of the form 1 2 with x ≥ 0 and y ≥ 0.
3 3
Assume that the individual in question is risk neutral.
(a) Write the equation of the indifference curve that goes through point
(x = 6, y = 10).
(b) Write the equation of the indifference curve that goes through point (10,8).
(c) Write the equation of the indifference curve that goes through point (4,9).
$x $y
Exercise 5.2 Consider all the lotteries of the form 1 2 with x ≥ 0 and y ≥ 0.
3 3
Consider an individual with von Neumann-Morgenstern utility-of-money function
U(m) = ln(m).
$10 $40
(a) Calculate the expected utility of lottery A = 1 2 .
3 3
$10 $10
(b) Calculate the expected utility of lottery B = 1 2 .
3 3
(c) Calculate the slope of the indifference curve at point A = (10, 40).
(d) Calculate the slope of the indifference curve at point B = (10, 10).
(e) In the (x, y)-plane draw the indifference curve that goes to point A = (10, 40) and
the indifference curve that goes to point B = (10, 10).
Exercise 5.3 Repeat Parts (a)-(e) of the previous question for the case of an individual
who is risk neutral.
5.5 Exercises 141
$x $y
Exercise 5.4 Consider all the lotteries of the form 2 1 with x ≥ 0 and y ≥ 0.
3 3
Consider an individual with von Neumann-Morgenstern utility-of-money function
U(m) = ln(m).
(a) Write an equation whose solutions give the setof lotteries that the individual
$4 $4
considers just as good as lottery A = 2 1 .
3 3
(b) Solve the equation of Part (a) and obtain a function y = f (x) whose graph is the
indifference curve that goes through point (4, 4).
(c) Write an equation whose solutions give the setof lotteries that the individual
$9 $4
considers just as good as lottery B = 2 1 .
3 3
(d) Solve the equation of Part (c) and obtain a function y = g(x) whose graph is the
indifference curve that goes through point (9, 4).
(e) Calculate the slope of the indifference curve at point A = (4, 4).
(f) Calculate the slope of the indifference curve at point B = (9, 4).
$x $y
Exercise 5.5 Consider all the lotteries of the form 1 4 with x ≥ 0 and y ≥ 0.
5 5
Let A = (100, 25), B = (4, 49) and C = (40, 40).
(a) Draw the indifference curves that go through points A, B and C for an√individual
with von Neumann-Morgenstern utility-of-money function U(m) = m.
(b) Draw the indifference curves that go through points A, B and C for a risk-neutral
individual.
1
Exercise 5.7 Frank has a wealth of $W0 . With probability p = 10 he faces a loss of
$`. The maximum he is willing to pay for full insurance is $800. The risk premium
associated with the lottery corresponding to no insurance is $500.
(a) What is the value of `?
(b) What is the maximum profit that a monopolist can make by selling insurance to
Frank?
Exercise 5.8 Bob owns a house. The value of the land is $75, 000 while the value of
the building is $110, 000. The rest of his wealth consists of the balance of his bank
account, which is $10, 000. Thus his current wealth is $195, 000. Bob lives in an area
where there is a 5% probability that a fire will completely destroy his house during any
year (while the land will not be affected by a fire). Bob’s utility function is given by:
where m ∈ [0, 20] denotes money measured in $10, 000 (thus, for example, m = 11
means $110, 000).
(a) What is Bob’s expected loss if he does not insure?
(b) What is Bob’s expected wealth if he does not insure?
(c) What is Bob’s expected utility if he does not insure?
(d) What is Bob’s expected utility if he purchases an insurance contract with premium
$1, 200 and deductible $20, 000?
(e) What is the slope of Bob’s reservation indifference curve at the no-insurance
point?
(f) Let A be the point in the wealth diagram that corresponds to the insurance contract
with premium $1, 200 and deductible $20, 000. What is the slope, at point A, of
Bob’s indifference curve that goes through point A?
(g) Does the indifference curve that goes through point A of Part (f) lie above or
below the reservation indifference curve?
(h) What is the maximum premium that Bob would be willing to pay for full insur-
ance?
Exercise 5.10 Barbara has a wealth of $80,000 and faces a potential loss of $20,000
√
with probability 10%. Her utility-of-money function is U(m) = m . An insurance
company offers her the following menu of contracts:
premium deductible
Contract 1: $2,340 $500
Contract 2: $2,280 $1,000
Contract 3: $2,220 $1,500
Contract 4: $2,160 $2,000
Exercise 5.11 You have the following vNM utility-of-money function: U(m) = ln(m).
Your initial wealth is $10,000 and you face a potential loss of $4,000 with probability
1
6 . An insurance company offers you the following menu of choices: if you choose
deductible d (with 0 ≤ d ≤ 4, 000) then your premium is h = 800 − 0.2 d.
(a) Translate the equation h = 800 − 0.2d into an equation in terms of wealth levels.
(b) Compare the slope of the reservation indifference curve at the no-insurance point
NI to the slope of the insurance budget line. Are there contracts that are better for
you than no insurance?
(c) Which contract will you choose from the menu?
(d) Compare expected utility if you do not insure with expected utility if you purchase
the best contract from the menu.
(e) What is the insurance company’s expected profit from the contract of Part (c)?
(f) Prove the result of Part (c) directly by expressing expected utility as a function of
the deductible d and by maximizing that expression.
√
Exercise 5.12 You have the following vNM utility-of-money function: U(m) = m.
1
Your initial wealth is $576 and you face a potential loss of $176 with probability 16 . An
insurance company offers you the following menu of choices: if you choose deductible
d (with 0 ≤ d ≤ 176) then your premium is h = 19 (176 − d).
(a) Translate the equation h = 91 (176 − d) into an equation in terms of wealth levels.
(b) Compare the slope of the reservation indifference curve at the no-insurance point
NI to the slope of the insurance budget line. Are there contracts that are better for
you than no insurance?
144 Chapter 5. Insurance: Part 2
Exercise 5.13 [Note: in this exercise the data is the same as in Exercise 5.12, but we
1
have changed the probability of loss from to 17 .]
16
√
Your vNM utility-of-money function is U(m) = m; your initial wealth is $576 and
you face a potential loss of $176 with probability 17 . An insurance company offers you
the following menu of choices: if you choose deductible d (with 0 ≤ d ≤ 176) then
your premium is h = 91 (176 − d).
(a) Translate the equation h = 91 (176 − d) into an equation in terms of wealth levels.
(b) Compare the slope of the reservation indifference curve at the no-insurance point
NI to the slope of the insurance budget line. Are there contracts that are better for
you than no insurance?
(c) Which contract will you choose from the menu?
(d) Compare expected utility if you do not insure with expected utility if you purchase
the best contract from the menu.
(e) What is the insurance company’s expected profit from the contract of Part (c)?
(f) Confirm the result of Part (c) by expressing expected utility as a function of the
deductible d and by finding the maximum of that function in the interval [0, 400].
choose the amount (of your potential loss) that you would like to be covered
(thus a number in the range from 0 to 3,000); for every dollar of coverage
you will pay $γ as premium (with 0 < γ < 1).
(c) Write an equation that expresses the premium in terms of the deductible. [Recall
that the deductible is that part of the loss that is not covered.].
(d) Translate the equation of Part (c) into an equation in terms of David’s wealth
levels.
(e) Does the insurance budget line of Part (d) go through the no-insurance point NI?
(f) For what values of γ will David choose not to insure?
(g) For what values of γ will Davis purchase full insurance?
(h) Assuming that the value of γ is in the range where David chooses partial insurance,
write a system of two equations whose solution gives the contract that David will
choose.
5.5 Exercises 145
Exercise 5.15 Anna’s vNM utility-of-money function is U(m) = ln(m). Her initial
wealth is $3,600 and she faces a potential loss of $2,700 with 25% probability. An
insurance company is offering Anna any contract such that premium h and deductible d
3
satisfy the following equation: h = 810 − 10 d.
3
(a) Translate the equation h = 810 − 10 d into an equation in terms of wealth levels.
(b) Does the equation found in Part (a) correspond to an isoprofit line?
(c) Does the insurance budget line of Part (a) go through the no-insurance point?
(d) Are there any contracts on the insurance budget line that Anna prefers to no
insurance?
(e) What is the best contract on the insurance budget line for Anna?
(f) Calculate Anna’s expected utility at the following points:
(1) NI (no insurance),
(2) the full-insurance contracts that belongs to the budget line, and
(3) the contract found in Part (e).
(g) Prove the result of Part (e) directly by expressing expected utility as a function of
the deductible d and by maximizing that expression.
Exercise 5.16 Kate has an initial wealth of W0 = $1, 600 and faces a potential loss
of ` = $576 with probability
√ 20%. Her von Neumann-Morgenstern utility-of-money
function is U(m) = m. An insurance company is offering the following menu of
contracts:
3
h = 152 − d.
10
However, the deductible is restricted to the interval [0, 360] (hence the largest deductible
is 360 rather than 576).
Let A = W1A ,W2A be the point in wealth space corresponding to the contract
(h = 44, d = 360).
3
(a) Translate the insurance budget line h = 152 − 10 d (d ∈ [0, 360]) into a budget
line in wealth space (give the range of values).
(b) Does contract A lie on, below or above the reservation indifference curve?
(c) Calculate the slope, at point A, of the indifference curve that goes through A,
compare it to the slope of the insurance budget line and deduce which contract
from the offered menu will be chosen by Kate.
146 Chapter 5. Insurance: Part 2
Exercise 5.17 Carla and Don have the same initial wealth, namely $32,400, and face
the same potential loss, namely $18,000, with the same probability, namely 15 .
Carla’s vNM utility-of-money function is
√
UC (m) = m
while Don’s is
1
UD (m) = 1 − m .
10,000 +1
They are unable to obtain insurance on the market, so they have decided to write a
mutual insurance contract, according to which any losses are shared equally between
them.
Assuming that the event that Carla suffers a loss is independent of the event that Don
suffers a loss, show that signing the mutual insurance contract has made each of them
better off relative to no insurance.
Exercise 5.18 Ann, Carla and Dana have the same initial wealth, namely $40,000,
and face the same potential loss, namely $30,000, with the same probability, namely
1
5 . They have the √same vNM preferences, represented by the vNM utility-of-money
function U(m) = m. They are unable to obtain insurance on the market and have
decided to sign a mutual insurance contract, according to which any losses suffered by
any of them will be shared equally by all three of them. Assume that the event that any
of them suffers a loss is independent of the event(s) that the other(s) also suffer a loss.
(a) Calculate the probabilities of the following events:
1. All three of them suffer a loss.
2. Exactly two of them suffer a loss.
3. Exactly one of them suffers a loss.
4. None of them suffers a loss.
(b) Show that each of them is better off with the mutual insurance contract relative to
no insurance.
5.6 Solutions to Exercises 147
Exercise 5.19 In this exercise we consider the benefit of mutual insurance when
independence fails to hold: one person suffering a loss makes it more likely that the
other person would also suffer a loss.
We consider two individuals, Albert an Ben, who have the same initial wealth,
namely $40,000, and face the same potential loss of $30,000 due to wildfire, with the
same probability, namely 15 . They have the√ same vNM preferences, represented by the
vNM utility-of-money function U(m) = m.
Since they live not far apart from each other if a fire occurs at one of the two
properties then the probability that there will be a fire at the other property is greater
than 15 . If the events were independent then the probabilities would be as follows:
Suppose that Albert and Ben are unable to obtain insurance on the market. Are they
better off with no insurance or with a mutual insurance agreement (according to which
any losses suffered by either of them will be shared equally by both)?
(c) The indifference curve that goes through A = (10, 40) is the set of lotteries that
yield an expected utility of 3.227. It is a convex curve since the utility function U
incorporates risk aversion. The slope of the indifference curve at point A is equal to
1
!
0 1
p U (10) 3 10
− =− 2 = −2.
1 − p U 0 (40) 3
1
40
(d) Similarly, the indifference curve that goes through B = (10, 10) is the set of lotteries
that yield an expected utility of 2.303. It is a convex curve. The slope of the
indifference curve at point B is equal to
1 1
p U 0 (10) 1
− 0
= − 32 10
1
=− .
1 − p U (10) 3 10
2
y
(probability 32 )
A
40
B
10
x
0 (probability 13 )
10
slope: − 12 slope: −2
(c) The indifference curve that goes through A = (10, 40) is the set of lotteries that yield
an expected utility (= value) of 30. It is a straight line because of risk neutrality. The
slope of the indifference curve at point A is equal to
0 1
p U (10) 3 1 1
− =− =− .
1 − p U 0 (40) 2
3
1 2
(d) Similarly, the indifference curve that goes through B = (10, 10) is the set of lotteries
that yield an expected utility (= value) of 10. It is a straight line because of risk
neutrality. The slope of the indifference curve at point B is equal to
0 1
p U (10) 3 1
− =− 1=− .
1 − p U 0 (10) 2
3
2
y
(probability 23 )
A
40
B
10 slope: − 12
x
0 (probability 13 )
10
slope: − 12
2 1
(a) 3 ln(x) + 3 ln(y) = ln(4).
(b) First of all, rewrite the above equation as 2 ln(x) + ln(y) = 3 ln(4), from which we get
2
that e(2 ln(x)+ln(y)) = e3 ln(4) . Now, e(2 ln(x)+ln(y)) = e2 ln(x) eln(y) = eln(x) y = x2 y.
Similarly, e3 ln(4) = 43 = 64. Thus the equation becomes x2 y = 64 from which we
64
get y = 2 .
x
2 1 2 1
(c) 3 ln(x) + 3 ln(y) = 3 ln(9) + 3 ln(4).
(d) Repeating the steps of Part (b): from 2 ln(x) + ln(y) = 2 ln(9) + ln(4) we get
324
e(2 ln(x)+ln(y)) = e(2 ln(9)+ln(4)) , which becomes x2 y = 92 4 which yields y = 2 .
x
(e) The slope of the indifference curve at point A = (4, 4) is equal to11
0 2
p U (4) 3
− =− 1 = −2.
1 − p U 0 (4) 1
3
(f) The slope of the indifference curve at point B = (9, 4) is equal to12
2 1
0 !
p U (9) 3 9 8
− =− =− .
1 − p U 0 (4) 1
3
1
4
9
√ √ √ √
(a) E[U(A)] = 15 100 + 45 25 = 2 + 4 = 6 and E[U(B)] = 15 4 + 54 49 = 6. Thus A
√
and B lie on the same indifference curve. On the other hand, E[U(C)] = 40 =
6.3246. Thus C lies on a higher indifference curve. See Figure 5.20.13
(b) The expected value of A, B and C is the same, namely 40. Thus the three points lie
on the same indifference curve, which is a straight line with slope − 14 .
y
(probability 45 )
B
49
C
35
A EU = 6.3246
25 EU = 6 x
0 (probability 15 )
4 60 100
13 Note that the scale of the axes has been distorted to make the qualitative properties of the graph easier to
see.
152 Chapter 5. Insurance: Part 2
1
!
1
20 50,000 8
− 19 1
=− = −0.0842.
20 80,000
95
1
20 1
(b) The slope of every isoprofit line is − 19 = − 19 = 0.0526.
20
(c) The maximum premium that Adam is willing to pay for full insurance is given by
the solution to the equation
1 19
ln(80, 000 − h) = ln(50, 000) + ln(80, 000)
20 20
1
800 = ` + 500,
10
5
(a) Bob’s expected loss is 100 110, 000 = $5, 500.
(b) Bob’s expected wealth if he does not insure is 195, 000 − 5, 500 = $189, 500.
5 95
800 − (20 − 8.5)2 + 800 − (20 − 19.5)2 = 793.15.
100 100
(d) The insurance contract with premium $1, 200 and deductible $20, 000 corresponds to
the following point in the wealth diagram: (173,800, 193,800). The corresponding
expected utility is
5 95
800 − (20 − 17.38)2 + 800 − (20 − 19.38)2 = 799.29.
100 100
(e) U 0 (m) = 40 − 2m. Thus the slope of Bob’s reservation indifference curve at the
no-insurance point is
5
100 40 − 2(8.5)
− 95 = −1.2105.
100
40 − 2(19.5)
(f) From Part (d) we have that A = (173, 800, 193, 800). The slope, at point A, of Bob’s
indifference curve that goes through point A is
5 40 − 2(17.38)
− = −0.2224.
95 40 − 2(19.38)
(g) Since E[U(A)] = 799.29 > 793.15 = E[U(NI)], the indifference curve that goes
through point A lies above reservation indifference curve.
(h) The maximum premium that Bob would be willing to pay for full insurance is given
by the solution to the equation U(W0 − h) = E[U(NI)], that is,
800 − [20 − (19.5 − h)]2 = 793.15, which is 2.1173, that is, $21, 173 (slightly less
than four times the expected loss).
154 Chapter 5. Insurance: Part 2
h i
e−W0 eh = e−W0 pe` + 1 − p ,
eh = pe` + 1 − p.
None of the contracts gives her higher expected utility than no insurance. Hence she
will not buy insurance.
5.6 Solutions to Exercises 155
Thus the reservation indifference curve is steeper, at NI, than the insurance budget
line and, therefore, there are contracts that are better than no insurance.
(c) Since the slope of any indifference curve at any point on the 45o line is equal to
1
6
− 5 = − 51 , the indifference curve that goes through the point at the intersection of
6
the insurance budget line and the 45o line is less steep than the insurance budget line
and thus we are in Case 3 of Section 5.3.2 and the optimal contract is given by the
solution to the following two equations:
1 1
!
1 6 W1 1
W2 = 11, 500 − W1 and − 5 1
=−
4 6 W
4
2
√1
!
1
2 400 2
− 16 = − .
15 √1 25
16 2 576
Thus the reservation indifference curve is less steep, at NI, than the insurance budget
line and, therefore, the insurance budget line lies below the reservation indifference
curve, that is, there are no contracts that are better than no insurance. Thus we are in
Case 1 of Section 5.3.2 and your best decision is not to insure.
1 √1
!
7 2 400 1
− 6 1
=− .
7
√ 5
2 576
Thus the reservation indifference curve is steeper, at NI, than the insurance budget
line and, therefore, there are contracts that are better than no insurance.
(c) Since the slope of any indifference curve at any point on the 45o line is equal to
1
− 76 = − 16 , the indifference curve that goes through the point at the intersection of
7
the insurance budget line and the 45o line is steeper than the insurance budget line
and thus we are in Case 2 of Section 5.3.2 and the optimal contract is given by the
full-insurance contract (the point of intersection between the insurance budget line
and the 45o line), that is, by the solution to the following two equations:
1
W2 = 626 − W1 and W2 = W1
8
which is W1 = W2 = 5,008 9 = 556.44. Thus the chosen deductible is zero and the
premium is h = 576 − 556.44 = $19.56.√ √
(d) Expected utility from no insurance is 17 400 + 67 576 = 23.4286, while expected
√
utility from the contract of Part (c) is 556.44 = 23.589.
(e) Expected profits from the contract of Part (c) is 19.56 − 71 400 = −$37.58, thus a
loss.
5.6 Solutions to Exercises 157
√ √
(f) Expected utility from contract (h, d) is: 17 576 − h − d + 76 576 − h. Replacing h
with 91 (176 − d) we get the following function:
q q
f (d) = 71 576 − 176
9 + 1
9 d − d + 6 176 1
7 576 − 9 + 9 d
q q
= 7 556.44 − 9 d + 7 556.44 + 19 d.
1 8 6
We know from Part (c) that the maximum of the function f (d) is achieved at a corner
(that is, not in the interior of the interval [0, 400]) and thus it cannot be found by
solving the equation f 0 (d) = 0 (indeed, the solution to this equation is −398.36
which is outside the interval [0, 400]). Another way to see that the solution is at a
corner, is to calculate the value f 0 (d) at d = 0: f 0 (0) = −0.0007 < 0, indicating that
increasing the deductible from 0 reduces expected utility.
(c) Since coverage = loss − deductible, the equation is h = γ(` − d) = γ(3000 − d).
(d) Replacing h with W0 −W2 = 8, 000 −W2 and d = (W2 −W1 ) in the equation of Part
(c) we get
8, 000 − 3, 000γ γ
W2 = − W1 .
1−γ 1−γ
(e) Yes: replacing W1 with 5, 000 (= W0 − ` = 8, 000 − 3, 000, the horizontal coordinate
of NI) in the above equation we get W2 = 8, 000 (the vertical coordinate of NI).
(f) David will choose not to insure when the insurance budget line is steeper than the
reservation indifference curve at the NI point, that is, when (recall that money is
measure in thousands of dollars)
0
1 (8 + 1)2
γ p U (5) 9 1
> = = = .
1−γ 1 − p U 0 (8) 9 (5 + 1)2 36 4
1
Solving γ
1−γ > 4 we get γ > 15 .
(g) David will choose full insurance when the slope of the indifference curve at the
p
offered full-insurance contract (which is − 1−p = − 91 ) is, in absolute value, greater
than or equal to the slope of the budget line (which is − 1−γγ
). Solving 19 ≥ 1−γ
γ
we
1
get γ ≤ 10 .
(h) David will choose partial insurance when
1. he prefers insurance to no insurance (that is, as seen in Part (f), when γ < 51 )
and
2. the slope of the indifference curve at the offered full-insurance contract is, in
absolute value, less than the slope of the budget line (that is, as seen in Part (g),
1
when γ > 10 ).
158 Chapter 5. Insurance: Part 2
1 1
<γ < .
10 5
1
Assume that 10 < γ < 51 . Then David will choose that contract
C C
C = W1 ,W2 at which there is a tangency between the budget line and the in-
difference curve through C, that is, C must satisfy the following equations (recall
that wealth levels are expressed in dollars while the argument of the utility function
is expressed in thousands of dollars):
2
W2C
8, 000 − 3, 000γ γ 1 +1
1,000 γ
W2 = − W1 and 2
= .
1−γ 1−γ 9 1−γ
C
W1
1,000 + 1
20,925
The solution is: W1 = 2, 325 and W2 = 7 = 2, 989.29. Thus the premium is h =
3, 600 − 2, 989.29 = 610.71 and the deductible is d = 2, 989.29 − 2, 325 = 664.29.
5.6 Solutions to Exercises 159
Thus
1
• E[U(NI)] = 4 ln(900) + 34 ln(3, 600) = 7.8421.
• E[U(F)] = ln(2, 790) = 7.9338.
1
• E[U(B)] = 4 ln(2, 325) + 34 ln(2989.29) = 7.94.
(g) Expected utility from contract (h, d) is: 14 ln(3, 600 − h − d) + 34 ln(3, 600 − h).
3
Replacing h with 810 − 10 d we get the following function:
0.225 0.175
− =0
2, 790 + 0.3d 2, 790 − 0.7d
Since 0.2852 < 37 = 0.4286, the indifference curve is less steep, at point A, than the
insurance budget line; thus the insurance budget line lies below the indifference
curve that goes through point A and, therefore, the best contract on the budget line is
contract A.
160 Chapter 5. Insurance: Part 2
Thus they are both better off with mutual insurance than with no insurance.
1p 4p
E[U(NI)] = 10, 000 + 40, 000 = 180
5 5
while expected utility from mutual insurance is
1 p 12
q
E[U(MI)] = 10, 000 + 40, 000 − 13 60, 000
125 125
48 64 p
q
+ 40, 000 − 13 30, 000 + 40, 000
125 125
= 183.29.
1p 4p
E[U(NI)] = 10, 000 + 40, 000 = 180.
5 5
3p 7 20 p
q
E[U(MI)] = 10, 000 + 40, 000 − 12 30, 000 + 40, 000 = 180.23.
30 30 30
• The owner of a store (the Principal) hires a sales assistant (the Agent) to run the
store. The surplus is the profit generated during a specified period of time. The size
of the profit is typically affected by random factors which are not under the control
of either party, such as the number of customers that will show up during that period
of time. Again, many contractual possibilities arise: the employee (the Agent) could
be hired at a fixed salary, or could be offered a percentage of the profits generated by
the store, or could be guaranteed a base salary as well as a percentage of the profits,
etc.
• A landowner (the Principal) gives his land to a farmer (the Agent) to cultivate. The
surplus is the sale value of the crop. The size and the value of the crop depend on
random factors, such as the weather and the market price. A possible contractual
arrangement is to have the Agent give a part of the crop to the Principal as rent
(this is called sharecropping). Another possible contractual arrangement is for the
Principal to pay a fixed salary to the Agent and keep the crop for himself. There are,
of course, several alternative contracts that could be agreed upon.
• The shareholders of a firm (the Principal) hire a manager (the Agent) to run the firm.
The surplus is the profit generated by the firm, which can vary depending on such
factors as the cost of inputs, the intensity of competition, the state of the economy,
etc. Possible compensation schemes are: a fixed salary to the manager, stock options,
bonuses, etc.
We assume that the Principal and the Agent are able to agree on the possible levels
of surplus, denoted by X1 , X2 , . . . , Xn (n ≥ 2) and are able to assign probabilities to them,
denoted by p1 , p2 , . . . , pn . Thus if they enter into a contractual relationship they will,
together, generate the money lottery
$X1 $X2 ... $Xn
p1 p2 ... pn
0 ≤ wi ≤ Xi .
• while for the Principal the contract corresponds to the money lottery
$(X1 − w1 ) ... $(Xn − wn )
.
p1 ... pn
6.2 The Edgeworth box 167
As in previous chapters, we use the following notation: if C and D are two contracts
and j is one of the parties to the contract (thus j ∈ {P, A}, where P denotes the Principal
and A the Agent),
C j D means that party j strictly prefers C to D.
C ∼j D means that party j is indifferent between C and D.
C %j D means that party j considers C to be at least as good as D
(that is, either C j D or C ∼ j D).
Definition 6.1.1 Let C and D be two possible contracts. We say that C Pareto dominates
D (or that D is Pareto dominated by C) if one party prefers C to D and the other considers
C to be at least as good as D, that is, if
- either C P D and C %A D,
- or C %P D and C A D.
We say that a contract is Pareto efficient if there is no other contract that Pareto dominates
it.a
a The notions of Pareto dominance and Pareto efficiency are named after the Italian economist Vilfredo
Pareto (1848-1923) who introduced them.
Thus a contract C is Pareto efficient if and only if, for every other contract D, if one of
party prefers D to C then the other Party prefers C to D. Clearly, it would be irrational for
the parties to sign a Pareto dominated contract, since there is another contract that one of
the parties prefers and the other also prefers or considers to be just as good. Our objective
is to characterize the set of Pareto efficient contracts.
X g > X b > 0.
A contract is specified as a pair (wg , wb ) where wg is what the agent gets in the good state
(so that the Principal gets the residual amount X g − wg ) and wb is what the Agent gets in the
bad state (so that the Principal gets the difference: X b − wb ). The set of possible contracts
can be represented graphically by means of an Edgeworth box, which is a rectangle whose
long side is equal to X g and whose short side is equal to X b , as shown in Figure 6.1.1
1 The Edgeworth box is named after the English economist Francis Ysidro Edgeworth (1845-1926) who
introduced it as a way of representing possible distributions of resources between two individuals.
168 Chapter 6. Risk Sharing and Pareto Efficiency
origin for
the Agent
wg 0A
The
length
of
this
side
is X b − wb wb
Xb C
0P X g − wg
origin for
the Principal
Figure 6.1: The Edgeworth box representing the set of possible divisions of the uncertain
surplus between the Principal and the Agent.
The lower left-hand corner of the box is taken to be the origin for the Principal and the
upper right-hand corner to be the origin for the Agent. Any point in the box represents
a possible contract. Consider, for example, point C in Figure 6.1. Going vertically up
from C to the top horizontal side of the box one obtains the amount wg (measured from
the origin 0A for the Agent) that the Agent receives if the surplus turns out to be X g , while
going vertically down from point C to the bottom horizontal side of the box one obtains
the amount (X g − wg ) (measured from the origin 0P for the Principal) that goes to the
Principal if the surplus turns out to be X g . Similarly, going horizontally from point C to
the right vertical side of the box one can read the amount wb that goes to the Agent if the
surplus turns out to be X b , while going horizontally to the left vertical side one can read
the amount (X b − wb ) that goes to the Principal when the surplus is X b .
6.2 The Edgeworth box 169
What about points outside the Edgeworth box? Do they represent feasible con-
tracts? For example, suppose that X g = 1, 000 and X b = 600 and consider the contract
(wg = 300, wb = −200) which is shown as point D in Figure 6.2.
D
800 −200
origin for
the Agent
0A
300
The
length
of
this
side
is
600
0P 700
origin for
the Principal
How is such a contract to be interpreted? If the outcome is $1,000 then the Agent will get
$300 and the rest, namely $700, will go to the Principal; on the other hand, if the outcome
is $600 then the Agent will give $200 to the Principal so that the Principal will get the
entire surplus of $600 plus the additional sum of $200 contributed by the Agent, for a total
of $800. Is such a contractual arrangement feasible? Only if the Agent has an initial wealth
which is greater than or equal to $200 so that she can draw from that wealth to make a
payment to the Principal in case the outcome turns out to be $600.
For the time being we shall assume that neither the Principal nor the Agent have
independent funds from which they can draw to make payments to the other party:
Two possible types of contracts in the Edgeworth box are worth highlighting.
• The first type are contracts that guarantee the Agent a fixed income, no matter what
the surplus turns out to be: these are contracts of the form (wg , wb ) with wg = wb .
The set of such contracts is represented by the 45o line out of the origin for the
Agent. Contract D in Figure 6.3 is an example of a fixed-income, or income-certainty,
contract for the Agent.
• The second type are contracts that guarantee the Principal a fixed income, no matter
what the surplus turns out to be: these are contracts of the form (wg , wb ) with
X g − wg = X b − wb . The set of such contracts is represented by the 45o line out of
the origin for the Principal. Contract E in Figure 6.3 is an example of a fixed-income,
or income-certainty, contract for the Principal.
origin for
the Agent
wgD 0A
45o
income certainty
for the Principal
The
length
of D wbD
this
side X b − wbE E
is
Xb
income certainty
45o for the Agent
0P X g − wgE
origin for
the Principal
Figure 6.3: Contract D = (wgD , wbD ) is such that wgD = wbD and thus guarantees income
certainty to the Agent. Contract E = (wgE , wbE ) is such that X g − wgE = X b − wbE and thus
guarantees income certainty to the Principal.
We shall assume that both the Principal and the Agent have preferences over money
lotteries that satisfy the axioms of Expected Utility Theory (see Chapter 5) and can thus
be represented by von Neumann-Morgenstern utility-of-money functions: UP (m) for the
Principal and UA (m) for the Agent.
6.2 The Edgeworth box 171
√
For example, suppose that the Principal is risk averse with utility function UP (m) = m,
while the Agent is risk neutral, so that we can take her utility function to be the identity
function: UA (m) = m. Suppose that in the good state the surplus is X g = $1, 000 and in the
bad state the surplus is X b = $600; furthermore the probability of the good state is pg = 13
(so that probability of the bad state is pb = 32 ). Would it be rational for the these two
individuals to sign the contract C = (wg = 400, wb = 400) that guarantees income certainty
to the Agent? √ If they did sign such
√ a contract, the Principal’s expected utility would be
E[UP (C)] = 13 1, 000 − 400 + 23 600 − 400 = 17.5931 and the Agent’s utility would be
UA (400) = 400. Consider the alternative contract D = (wg = 676, wb = 276). With this
contract, the Principal gets the same amount of money no matter what the state (namely,
1, 000 − 676
√ = 324 in the good state and 600 − 276 = 324 in the bad state) and thus his
utility is 324 = 18, while the Agent has an expected utility of 13 676 + 23 276 = 409.33.
Hence both parties strictly prefer contract D to contract C and thus it would be irrational
for them to agree on contract C. In other words, contract C is not a rational choice because
there is another contract, namely D, that Pareto dominates it.
In order to identify the set of Pareto efficient contracts, we need to add indifference
curves to the Edgeworth box.
Recall from Chapter 5 that the indifference curves of a risk-averse individual are convex
towards the origin. Figure 6.4 illustrates the case of a risk-averse Principal.
origin for
the Agent
0A
direction of
increasing utility
for the Principal
0P
origin for
the Principal
Figure 6.4: Indifference curves of a risk-averse Principal. Higher indifference curves (that
is, indifference curves farther away from the origin for the Principal) correspond to higher
levels of utility for the Principal.
172 Chapter 6. Risk Sharing and Pareto Efficiency
In Chapter 5 we also saw that the indifference curves of a risk-neutral individual are
straight lines. Figure 6.5 illustrates the case of a risk-neutral Agent.
origin for
the Agent
0A
direction of
increasing utility
for the Agent
0P
origin for
the Principal
Figure 6.5: Indifference curves of a risk-neutral Agent. Lower indifference curves (that is,
indifference curves farther away from the origin for the Agent) correspond to higher levels
of utility for the Agent.
6.2 The Edgeworth box 173
0A
Principal’s indifference curve
A
E
C
D B
0P
We now show that any contract at which the indifference curve of the Principal and the
indifference curve of the Agent cross cannot be Pareto efficient. Consider again the case
where the Principal is risk averse and the Agent is risk neutral. Contract C in Figure 6.6 is
a contract at which the indifference curves of the two parties cross.
Let us compare alternative contracts to contract C. The terms "above" and "below" are
to be understood as taking the point of view of the origin for the Principal (the bottom,
left-hand side of the box).
• Any contract, such as contract A, that lies above the Principal’s indifference curve
through C and above the Agent’s indifference curve through C, is preferred to C by
the Principal, but considered to be worse than C by the Agent: A P C and C A A.
• Any contract, such as contract B, that lies below the Principal’s indifference curve
through C and above the Agent’s indifference curve through C, is considered to be
worse than C by both Principal and Agent: C P B and C A B.
• Any contract, such as contract D, that lies below both indifference curves, is consid-
ered to be worse than C by the Principal but better than C by the Agent: C P D and
D A C.
• Finally, any contract, such as contract E, that lies between the two indifference
curves, is preferred to contract C by both parties: E P C and E A C.
Thus, whenever the indifference curves of the two parties that go through a contract C
cross at point C, the area between the two curves (the shaded area in Figure 6.6) represents
contracts that are preferred by both the Principal and the Agent to contract C. Hence C is
Pareto dominated by any of those contracts, implying that C is not Pareto efficient.
174 Chapter 6. Risk Sharing and Pareto Efficiency
origin for
the Agent
0A
Principal’s indifference curve
0P
origin for
the Principal
On the other hand, a contract at which the two indifference curves are tangent must
be Pareto efficient. Contract C in Figure 6.7 is an example of such a contract: any other
contract must be either above the Principal’s indifference curve, in which case it is worse
than C for the Agent, or below the Agent’s indifference curve, in which case it is worse
than C for the Principal, or between the two indifference curves, in which case it is worse
than C for both parties.
In the next section we identify the set of points where the indifference curves of
Principal and Agent are tangent.
Let us begin with the case where the Principal is risk averse, so that his indifference curves
are convex towards the origin for the Principal, and the Agent’s indifference curves are
straight lines.
Recall from Chapter 5 that the indifference curves of a risk-neutral individual are
straight lines with slope (since we are measuring the outcome of the good state on the
horizontal axis):
pg
− (6.1)
1 − pg
where pg is the probability of the good state. On the other hand, the slope of the Principal’s
indifference curve at a contract where the Agent gets wg in the good state (so that the
Principal gets X g − wg ) and wb in the bad state (so that the Principal gets X b − wb ) is (recall
that UP (·) is the utility-of-money function of the Principal and UP0 (·) is its first derivative):
pg UP0 (X g − wg )
− . (6.2)
1 − pg UP0 (X b − wb )
If the two indifference curves are tangent at point C = (wg , wb ) then they have the same
slope there, that is, it must be that
pg pg UP0 (X g − wg )
− = − (6.3)
1 − pg 1 − pg UP0 (X b − wb )
UP0 (X g − wg )
= 1, that is, that UP0 (X g − wg ) = UP0 (X b − wb ).
UP0 (X b − wb )
Since the Principal is risk averse, the second derivative of his utility function is negative,
which means that the first derivative is strictly decreasing; hence
176 Chapter 6. Risk Sharing and Pareto Efficiency
origin for
the Agent
0A
Principal’s
indifference curve
45o Agent’s
indifference curve
0P
origin for
the Principal
Figure 6.8: When the Principal is risk averse and the Agent is risk neutral, the 45o line out
of the origin for the Principal is the set of points where there is a tangency between the
Principal’s indifference curve and the Agent’s indifference curve.
while the Agent’s indifference curves are convex towards the origin for the Agent. The
slope of the Agent’s indifference curve at contract C = (wg , wb ) is
pg UA0 (wg )
−
1 − pg UA0 (wb )
where UA (·) is the utility-of-money function of the Agent and UA0 (·) is its first derivative.
As before, if the two indifference curves are tangent at point C = (wg , wb ) then they have
the same slope there, that is, it must be that
pg pg UA0 (wg )
− = − (6.4)
1 − pg 1 − pg UA0 (wb )
6.3 Points of tangency 177
UA0 (wg )
0 b
= 1, that is, that UA0 (wg ) = UA0 (wb ).
UA (w )
Since the Agent is risk averse, the second derivative of her utility function is negative,
which means that the first derivative is strictly decreasing; hence
that is, if and only if point C = (wg , wb ) lies on the 45o line out of the origin for the Agent,
as shown in Figure 6.9.
origin for
the Agent
0A
Principal’s 45o
indifference
curve
Agent’s
indifference curve
0P
origin for
the Principal
Figure 6.9: When the Agent is risk averse and the Principal is risk neutral, the 45o line
out of the origin for the Agent is the set of points where there is a tangency between the
Principal’s indifference curve and the Agent’s indifference curve.
178 Chapter 6. Risk Sharing and Pareto Efficiency
When one party is risk neutral and the other is risk averse,
a contract C = (wg , wb ) such that 0 < wg < X g and 0 < wb < X b
is Pareto efficient if and only if all the risk is borne by the risk-neutral party,
so that the risk-averse party is guaranteed a fixed income.
The reason why the above principle requires the restriction that 0 < wg < X g and
0 < wb < X b will be explained in Section 6.4.
Recall from Chapter 5 that the absolute value of the slope of the indifference curve of a
risk-averse individual is
pg
grater than 1−pg at a point above the 45o line
pg
equal to 1−pg at a point on the 45o line (6.5)
pg
less than 1−pg at a point below the 45o line.
Consider a point which is above (such as point A in Figure 6.10) or on (such as point B
in Figure 6.10) the 45o line out of the origin for the Principal. Then, by (6.5), the slope of
the Principal’s indifference curve at that point is greater than (point A), or equal to (point
pg
B), 1−pg ; on the other hand, from the point of view of the Agent, that point is below her
45o line and thus the slope of her indifference curve at that point is, in absolute value, less
pg
than 1−pg . Hence the Principal’s indifference curve is steeper at that point than the Agent’s
indifference curve, so that the two curves are not tangent and thus the contract is not Pareto
efficient (any contract between the two indifference curves Pareto dominates it).
6.3 Points of tangency 179
0A
Pic
45o
Aic
A D
Pic
Pic
Aic Aic
B C
Pic
45o
Aic
0P
Figure 6.10: When both the Principal and the Agent are risk averse a tangency between
the Principal’s indifference curve and the Agent’s indifference curve cannot occur at points
that are not between the two 45o lines. Aic stands for "indifference curve of the Agent" and
Pic stands for "indifference curve of the Principal".
Similarly, consider a point which is below (such as point C in Figure 6.10) or on (such
as point D in Figure 6.10) the 45o line out of the origin for the Agent. Then, by (6.5), the
slope of the Agent’s indifference curve at that point is greater than (point C), or equal to
pg
(point D), 1−pg ; on the other hand, from the point of view of the Principal, that point is
below his 45o line and thus the slope of his indifference curve at that point is, in absolute
pg
value, less than 1−pg . Hence the Principal’s indifference curve is less steep at that point
than the Agent’s indifference curve, so that the two curves are not tangent and thus the
contract is not Pareto efficient (any contract between the two indifference curves Pareto
dominates it).
180 Chapter 6. Risk Sharing and Pareto Efficiency
A Pareto efficient contract is shown as point E in Figure 6.11. It lies between the two
45o lines and thus involves risk-sharing: neither party is guaranteed a fixed income.
origin for
the Agent
0A
45o
Pic
Aic
45o
0P
origin for
the Principal
Figure 6.11: Contract E is Pareto efficient and involves risk-sharing. Aic stands for
"indifference curve of the Agent" and Pic stands for "indifference curve of the Principal".
that is,
Equation (6.6) is necessary and sufficient for a contract E = (wg , wb ) with 0 < wg < X g and
0 < wb < X b to be Pareto efficient and provides us with a quick test for Pareto efficiency.
6.3 Points of tangency 181
pg X g − pg wgD + (1 − pg )X b − (1 − pg )wbD
(6.7)
> pg X g − pg wCg + (1 − pg )X b − (1 − pg )wCb ).
that is,
The left-hand side of (6.8) is the expected value of the money lottery for the Agent
corresponding to contract C and the right-hand side of (6.8) is the expected value of the
money lottery for the Agent corresponding to contract D. Thus we conclude that contract
D is preferred to contract C by the Principal if and only if the Agent has the opposite
ranking, that is, she prefers C to D.
182 Chapter 6. Risk Sharing and Pareto Efficiency
An alternative proof that any contract is Pareto efficient makes use of the tangency
argument: gwhen both parties are risk neutral, their indifference curves are straight line with
p
slope − 1−p g and thus the indifference curve of the Principal that goes through a point C
coincides with the indifference curve of the Agent that goes to point C and thus the two
are tangent.
6.3.6 Pareto efficiency for contracts in the interior of the Edgeworth box
A contract C = (wg , wb ) is in the interior of the Edgeworth box if and only if
0 < wg < X g and 0 < wb < X b .
Let us focus on contracts that are in the interior of the Edgeworth box. We showed in
Section 6.3.4 that, for such contracts, equality (6.6) (page 180) is necessary and sufficient
for Pareto efficiency when both parties are risk averse. We now show that, for contracts
in the interior of the Edgeworth box, equality (6.6) is necessary and sufficient for Pareto
efficiency no matter what the risk attitude of the parties, that is, in all cases.
First of all, note that when an individual is risk neutral, we can take the identity function
as his von Neumann-Morgenstern utility-of-money function, that is, we can take U(m) = m.
0 (m )
Hence U 0 (m) = 1 and, for any two sums of money m1 and m2 , U 1
U 0 (m ) = 1. Thus
2
- If the Principal is risk averse and the Agent is risk neutral (Section 6.3.1), equation
(6.6) reduces to
UP0 (X g − wg )
=1
UP0 (X b − wb )
which requires (as shown in Section 6.3.1) that X g − wg = X b − wb , that is, the
Principal (the risk-averse party) must be guaranteed a fixed income.
- If the Principal is risk neutral and the Agent is risk averse(Section 6.3.2), (6.6)
reduces to
U 0 (wg )
1 = A0 b
UA (w )
which requires (as shown in Section 6.3.2) that wg = wb , that is, the Agent (the
risk-averse party) must be guaranteed a fixed income.
- If both Principal and Agent are risk neutral, then equation (6.6) reduces to 1 = 1,
which is true at every point.
Hence we conclude this section with a general principle:
0A
C
Pic
Aic
45o
0P
Figure 6.12: Contract C satisfies Pareto efficiency subject to the constraint that only
contracts in the Edgeworth box are feasible. Aic stands for "indifference curve of the
Agent" and Pic stands for "indifference curve of the Principal".
efficiency subject to the constraint that only contracts in the Edgeworth box are feasible.2
Putting together the results of this Section and of Section 6.3.1 we conclude that when the
Principal is risk averse and the Agent is risk neutral, the set of Pareto efficient contracts is
given by the union of the 45o line out of the origin for the Principal and the portion of the
top side of the Edgeworth box from the point of intersection with the 45o line out of the
origin for the Principal to the origin for the Agent, as shown in Figure 6.13.
origin for
the Agent
0A
45o
0P
origin for
the Principal
Figure 6.13: The set of Pareto efficient contracts when the Principal is risk averse and the
Agent is risk neutral is given by the thick line (the union of the 45o line out of the origin
for the Principal and the portion of the top side of the Edgeworth box from the origin for
the Agent to the point of intersection with the 45o line out of the origin for the Principal).
curves are convex towards the origin for the Agent). The analysis in this case is symmetric
to that of the previous section.
Consider a point, such as point D in Figure 6.14, on the bottom side of the Edgeworth
box to the left of the point at which the 45o line out of the origin for the Agent intersects
that side. From (6.5) (page 178) we know that at such a point the indifference curve of
the Principal is steeper than the indifference curve for the Agent, as shown in Figure 6.14.
Thus there would potentially be contracts that are Pareto superior to contract D (all those
in the shaded area in Figure 6.14, that is, all those that lie between the two indifference
curves). However, such contracts are not feasible, because they lie outside the Edgeworth
box. Thus there are no contracts in the Edgeworth box that are Pareto superior to D;
in other words, contract D satisfies Pareto efficiency subject to the constraint that only
contracts in the Edgeworth box are feasible.3
origin for
the Agent
0A
45o
0P D
origin for
the Principal
Aic
Pic
Figure 6.14: Contract D satisfies Pareto efficiency subject to the constraint that only
contracts in the Edgeworth box are feasible. Aic stands for "indifference curve of the
Agent" and Pic stands for "indifference curve of the Principal".
3A contract (wg , wb ) in the shaded area in Figure 6.14 is such that wb > X b (while 0 < wg ≤ X g ) and thus
it would imply a payment from the Principal to the Agent in the bad state. Such a payment is feasible only if
the Principal has independent funds from which to draw if the state is bad. Since we assumed that neither the
Principal nor the Agent have independent funds at their disposal, contracts outside the Edgeworth box are
not feasible. The case where the parties do have independent funds will be discussed in Section 6.5.
186 Chapter 6. Risk Sharing and Pareto Efficiency
Putting together the results of this Section and of Section 6.3.2 we conclude that when
the Principal is risk neutral and the Agent is risk averse, the set of Pareto efficient contracts
is given by the union of the 45o line out of the origin for the Agent and the portion of
the bottom side of the Edgeworth box from the origin for the Principal to the point of
intersection with the 45o line out of the origin for the Agent, as shown in Figure 6.15.
origin for
the Agent
0A
0P
origin for
the Principal
Figure 6.15: The set of Pareto efficient contracts when the Principal is risk neutral and the
Agent is risk averse is given by the thick line (the union of the 45o line out of the origin for
the Agent and the portion of the bottom side of the Edgeworth box from the origin for the
Principal to the point of intersection with the 45o line out of the origin for the Agent).
The same logic applies to the case where both the Principal and the Agent are risk averse:
points on the top side of the Edgeworth box where the Principal’s indifference curve is less
steep than the Agent’s indifference curve and points on the bottom side of the Edgeworth
box where the Principal’s indifference curve is steeper than the Agent’s indifference curve
correspond to Pareto efficient contracts (subject to the constraint that only contracts in the
Edgeworth box are feasible).
6.5 The Edgeworth box when the parties have positive initial wealth 187
The thick curve in Figure 6.16 shows the set of Pareto efficient contracts for a situation
where both the Principal and the Agent are risk averse.
Aic
0A
Pic
Pic
Aic
0P
Aic
Pic
Figure 6.16: The set of Pareto efficient contracts when both the Principal and the Agent
are risk averse is shown as a thick curve. Aic stands for "indifference curve of the Agent"
and Pic stands for "indifference curve of the Principal".
6.5 The Edgeworth box when the parties have positive initial wealth
So far we have assumed that only contracts inside the Edgeworth box are feasible or,
equivalently, that both the Principal and the Agent have zero initial wealth. If, on the other
hand, one or both parties have a positive initial wealth then, as shown in Figure 6.2 on page
169, it is feasible for the parties to agree on a contract that lies outside the Edgeworth box.
As before, let the surplus in the good state be X g and the surplus in the bad state X b .
Let W P ≥ 0 be the initial wealth of the Principal and W A ≥ 0 the initial wealth of the
Agent. Construct an extended Edgeworth box as follows: the length of the horizontal side
is X g +W P +W A and the length of the vertical side is X b +W P +W A .
188 Chapter 6. Risk Sharing and Pareto Efficiency
R Whereas before we represented a contract as a pair (wg , wb ), where wg was the portion,
given to the Agent, of the surplus X g generated in the good state (thus 0 ≤ wg ≤ X g )
and wb was the portion, given to the Agent, of the surplus X b generated in the bad
state (thus 0 ≤ wb ≤ X b ), we now need to represent a contract differently in order to
allow for the possibility of payments from one party to the other. Thus in this more
general setting we represent a contract as a pair (zg , zb ) where zg is the wealth of the
Agent in the good state (thus 0 ≤ zg ≤ X g + W P + W A ) and zb is the wealth of the
Agent in the bad state (thus 0 ≤ zb ≤ +X b + +W P +W A ).4
Figure 6.17 shows the extended Edgeworth box. The dashed box inside the extended box
is the Edgeworth box that would apply if the parties had zero initial wealth.
WA 0A
WA
The D
The length length
of this of this
side is side is
b
X +W P +W A Xb C
WP
The length of this side is X g
0P WP
Figure 6.17: The extended Edgeworth box. The dashed box inside the extended box is the
Edgeworth box that would apply if the parties had zero wealth.
Point C in Figure 6.17 represents a contract where the Agent gets all the surplus in each
state and the Principal retains his initial wealth of W P . Note that a wealth of W P for sure is
an outside option for the Principal: he can guarantee himself such a wealth by refusing
to enter into an agreement with the Agent. Thus the indifference curve of the Principal
that goes through point C represents a reservation level of utility for the Principal: if the
Principal is rational, he will not agree to any contract that gives him an expected utility
less than UP (W P ).
4 For example, if X g = $1, 500, X b = $900 and the parties agree that the Agent will get one third of the
surplus in each state (thus $500 in the good state and $300 in the bad state) then we represent this contract
not as the pair (500, 300) but as the pair (zg = 500 +W A , zb = 300 +W A ), where W A is the Agent’s initial
wealth. Hence, from the Principal’s point of view, this contract will yield him a wealth of 1, 000 +W P in the
good state and 600 +W P in the bad state, where W P is the Principal’s initial wealth.
6.5 The Edgeworth box when the parties have positive initial wealth 189
Similarly, point D in Figure 6.17 represents a contract where the Principal gets all the
surplus in each state and the Agent retains her initial wealth of W A . A wealth of W A for
sure constitutes an outside option for the Agent: she can guarantee herself such a wealth
by refusing to enter into an agreement with the Principal. Thus the indifference curve of
the Agent that goes through point D represents a reservation level of utility for the Agent:
if the Agent is rational, she will not agree to any contract that gives her an expected utility
less than UA (W A ).
We call a contract that gives each party at least his/her reservation utility individually
rational. Thus the set of individually rational contracts is the set of points in the extended
Edgeworth box that lie between the two reservation indifference curves: the indifference
curve of the Principal that goes through point (W P ,W P ) (measured from the origin for the
Principal) and the indifference curve of the Agent that goes through the point (W A ,W A )
(if measured from the origin for the Agent, or (X g +W P , X b +W P ) if measured from the
origin for the Principal).
Figure 6.18 shows the reservation indifference curve of a of a risk-averse Principal and
the reservation indifference curve of a risk-neutral Agent. The area between the two curves
consist of the set of individually rational contracts.
WA 0A
Agent’s reservation
indifference curve
WA
D
The length
of this Xb
side is
b
X +W P +W A C Xg
WP
Principal’s reservation
indifference curve
0P WP
Figure 6.18: The reservation indifference curves of a risk-averse Principal and a risk-neutral
Agent.
To find the Pareto efficient contracts we proceed as in the previous sections. The only
difference is that rationality of the parties now requires not only Pareto efficiency but also
individual rationality. Thus we want to identify the set of contracts that are both Pareto
efficient and individually rational; let us call it the set of rational contracts. It is given by
the intersection of the set of Pareto efficient contracts and the set of individually rational
contracts.
190 Chapter 6. Risk Sharing and Pareto Efficiency
1. For the case where the Principal is risk averse and the Agent is risk neutral, the
set of rational contracts consists of the portion of the 45o line out of the origin for
the Principal from the point (W P ,W P ) to the top side of the extended Edgeworth
box, followed by the portion of the top side of the extended Edgeworth box from
the intersection with the 45o line to the point of intersection with the reservation
indifference curve of the Agent. The set of rational (i.e. Pareto efficient and
individually rational) contracts is shown as a thick line in Figure 6.19.
WA 0A
set of
rational contracts
WA
Xb
Xg
WP
C Agent’s reservation
Principal’s reservation indifference curve
indifference curve
0P WP
Figure 6.19: The thick line shows the set of rational (i.e. Pareto efficient and individually
rational) contracts when the Principal is risk averse and the Agent is risk neutral.
6.5 The Edgeworth box when the parties have positive initial wealth 191
2. For the case where the Principal is risk neutral and the Agent is risk averse,
the set of rational contracts consists of the portion of the 45o line out of
the origin for the Agent from the point (W A ,W A ) to the bottom side of the
extended Edgeworth box, followed by the portion of the bottom side of the
extended Edgeworth box from the intersection with the 45o line to the point of
intersection with the reservation indifference curve of the Principal. The set of
rational (i.e. Pareto efficient and individually rational) contracts is shown as a
thick line in Figure 6.20.
WA 0A
Principal’s reservation
indifference curve D
WA
set of
Xb rational contracts
Xg
WP
Agent’s reservation
indifference curve
0P WP
Figure 6.20: The thick line shows the set of rational (i.e. Pareto efficient and individually
rational) contracts when the Principal is risk neutral and the Agent is risk averse.
192 Chapter 6. Risk Sharing and Pareto Efficiency
3. For the case where both the Principal and the Agent are risk neutral, the set of
rational contracts coincides with the set of individually rational contracts (since
every contract is Pareto efficient). It is shown as the shaded area between the two
reservation indifference curves in Figure 6.21.
WA 0A
D
WA
Xb
Xg
WP Agent’s reservation
C indifference curve
Principal’s reservation
indifference curve
0P WP
Figure 6.21: The shaded area between the two reservation indifference curves is the set of
rational contracts when both parties are risk neutral.
6.6 More than two outcomes 193
4. For the case where both the Principal and the Agent are risk averse, the set of
rational contracts is a curve between the two 45o lines together with some segments
of the top and bottom sides of the extended Edgeworth box. It is shown as a thick
curve in Figure 6.22.
WA
0A
WA
Principal’s Agent’s
reservation reservation
indifference indifference
curve curve
WP
0P WP
Figure 6.22: The thick curve shows the set of rational (i.e. Pareto efficient and individually
rational) contracts when both the Principal and the Agent are risk averse.
Let there be n states (n ≥ 2) and let $Xi be the outcome in state i ∈ {1, . . . , n}, which
occurs with probability pi (0 < pi < 1, for every i ∈ {1, . . . , n}, and p1 + p2 + · · · + pn = 1).
We order the outcomes in such a way that
A contract is an n-tuple (w1 , w2 , . . . , wn ) where wi is what the Agent gets in state i (with
0 ≤ wi ≤ Xi ).
Proof. (A) Suppose that w j < wi (recall that Xi < X j ) and let
pi pj
ŵ = wi + w j. (6.10)
pi + p j pi + p j
Then w j < ŵ < wi .7 Modify contract C by replacing both wi and w j with ŵ and leaving
the other coordinates unchanged. Call D the resulting contract. Then the expected value of
D is equal to the expected value of C.8 Hence the Principal is indifferent between contract
C and contract D.9 On the other hand, the Agent prefers contract D to contract C, since C
is a mean-preserving spread of D (see Chapter 4, Section 4.4.2). Thus D Pareto dominates
C and therefore C is not Pareto efficient.
(B) Because of Part (A) we can assume that wi < w j (by hypothesis, wi 6= w j ). Suppose
that wi < Xi ; we want to show that C is not Pareto efficient. We need to consider two cases.
Case B.1: ŵ ≤ Xi where ŵ is defined in (6.10) (and thus wi < ŵ < w j ). Consider the
contract obtained from C by replacing both wi and w j with ŵ. Then, repeating the argument
developed in Part (A), the Principal is indifferent between C and D, because E[C] = E[D],
while the Agent prefers D to C because C is a mean-preserving spread of D. Hence D
Pareto dominates C and therefore C is not Pareto efficient.
Case B.2: ŵ > Xi (where ŵ is defined in (6.10); note that this implies that w j > Xi ).
Construct a new contract, call it D, obtained from C by replacing wi with Xi and w j with w̃˜
where
pi
w̃˜ = w j − (Xi − wi ) . (6.11)
pj
pi
Note that, since - by hypothesis - wi < Xi , pj (Xi − wi ) > 0 and thus w̃˜ < w j . Furthermore,
w̃˜ > Xi since
pi
wj − (Xi − wi ) > Xi if and only if p j w j − pi Xi + pi wi > p j Xi
pj
if and only if pi wi + p j w j > (pi + p j )Xi
pi pj
if and only if wi + w j > Xi
pi + p j pi + p j
if and only if (by (6.10)) ŵ > Xi
n n n n
∑ pk (Xk − wk ) = ∑ pk Xk − ∑ pk wk = ∑ pk Xk − E[C]
k=1 k=1 k=1 k=1
n
and, similarly, the expected utility of contract D is ∑ pk Xk − E[D].
k=1
196 Chapter 6. Risk Sharing and Pareto Efficiency
First we show that E[C] = E[D], so that the Principal is indifferent between C and D.
Clearly, E[C] = E[D] if and only if pi Xi + p j w̃ = pi wi + p j w j , which is true since
pi
pi Xi + p j w̃˜ = pi Xi + p j w j − (Xi − wi ) = pi Xi + p j w j − pi Xi + pi wi = pi wi + p j w j .
pj
Next we show that the Agent prefers D to C, so that D Pareto dominates C and therefore C
is not Pareto efficient. The Agent prefers D to C because D dominates C in the sense of
second-order stochastic dominance (see Chapter 4, Definition 4.4.3); indeed the lottery
corresponding to contract C can be obtained from the lottery corresponding to contract D
by a sequence of two mean-preserving spreads. To see
this, consider the following lotteries
wk
where the dots stand for entries of the form for k ∈ {1, 2, . . . , n} \ {i, j}.
pk
. . . wi Xi . . . w̃˜ w j . . .
LC = (6.12)
. . . pi 0 . . . 0 p j . . .
w̃˜ w j . . .
. . . wi Xi . . .
LD = (6.13)
. . . 0 pi . . . pj 0 ...
w̃˜
. . . wi Xi . . . wj ...
LE = (6.14)
. . . α pi 0 . . . p j (1 − α)pi . . .
Xi . . . w̃˜
... wi wj ...
LF = (6.15)
. . . α pi + β p j 0 . . . 0 (1 − α)pi + (1 − β )p j . . .
(6.16)
where
w j − Xi w j − w̃˜
α= and β= (6.17)
w j − wi w j − wi
LC is the lottery (for the Agent) corresponding to contract C and LD is the lottery (for the
Agent) corresponding to contract D. We want to show that LE is a mean-preserving spread
of LD , LF is a mean-preserving spread of LE and LF = LC .
• LE is a mean-preserving spread of LD because (1) wi < Xi (by hypothesis),
(2) w j > w̃˜ (as shown above) and (3) α pi wi + (1 − α)pi w j = pi Xi (so that the
expected value of E equals the expected value of D).10
10 In fact,
w j − Xi Xi − wi wi w j − wi Xi + w j Xi − wi w j
α pi wi + (1 − α)pi w j = pi wi + pi w j = pi =
w j − wi w j − wi w j − wi
w j − wi
= pi Xi = pi Xi .
w j − wi
6.6 More than two outcomes 197
11 In fact,
w j − w̃˜ w̃˜ − wi w j − wi
β p j wi + (1 − β )p j w j = p j wi + p jw j = p j w̃˜ = p j w̃˜.
w j − wi w j − wi w j − wi
12 In fact,
h i
w j − Xi w j − w̃˜ (w j − Xi ) pi + w j − w j − ppij (Xi − wi ) pj
α pi + β p j = pi + pj =
w j − wi w j − wi w j − wi
(w j − Xi ) pi + (Xi − wi ) pi w j − wi
= = pi = pi
w j − wi w j − wi
and
w j − w̃˜ w̃˜ − wi
w j − Xi Xi − wi
(1 − α) pi + (1 − β ) p j = 1 − pi + 1 − pj = pi + pj
w j − wi w j − wi w j − wi w j − wi
Xi − wi w j p j − (Xi − wi ) pi − wi p j w j − wi
= pi + = p j = p j.
w j − wi w j − wi w j − wi
198 Chapter 6. Risk Sharing and Pareto Efficiency
Let C = (w1 , w2 , . . . , wn ) be an interior contract, that is, a contract such that, for every
i = 1, 2, . . . , n, 0 < wi < Xi . If C is Pareto efficient then13
...
As in the case of two outcomes, there are also Pareto efficient contracts that are not interior
contracts, that is, contracts where, in some states, one of the parties gets the entire surplus.
We will omit a discussion of necessary conditions for Pareto efficiency for such contracts.
13 Note that if the n − 1 equations below are satisfied, then, for every i, j = 1, . . . , n with i 6= j,
UP0 (Xi −wi ) U 0 (w ) U 0 (X −w ) U 0 (w ) U 0 (X −w )
UP0 (X j −w j )
= U A0 (w i ) . For example, one can get UP0 (X1 −w1 ) = UA0 (w1 ) by multiplying the equation UP0 (X1 −w1 ) =
A j P 3 3 A 3 P 2 2
UA0 (w1 ) UP0 (X2 −w2 ) UA0 (w2 )
UA0 (w2 )
by the equation UP0 (X3 −w3 )
= UA0 (w3 )
.
6.7 Exercises 199
6.7 Exercises
The solutions to the following exercises are given in Section 6.8 at the end of this chapter.
Exercise 6.1 A Principal wants to hire an Agent to run his firm. The Principal’s utility-
of-money function is UP (m) = ln(m), while the Agent’s utility-of-money function is
UA (m) = 2m + 2. There are three possible profit levels: X1 = $2, 400, X2 = $1, 600 and
X3 = $900. The corresponding probabilities are p1 = 15 , p2 = 25 and p3 = 25 . Let w1
be the payment to the Agent if the outcome is X1 and similarly for w2 and w3 . Show
that contract A = (w1 = 700, w2 = 700, w3 = 700) is Pareto dominated by contract
B = (w1 = 1, 200, w2 = 1, 000, w3 = 200).
Exercise 6.3 Let the surplus in the good state be X g = $200 and the surplus in the bad
state be X b = $120. Consider the following contracts, where the first coordinate is the
amount of money that the Agent receives in the good state and the second coordinate
is the amount of money that the Agent receives in the bad state: C = (60, 80) and
D = (100, 60). The probability of the good state is 25% (so that the probability of the
bad state is 75%).
(a) Represent the two contracts as points in an Edgeworth box.
(b) If the Principal is risk neutral, how does he rank the two contracts?
(c) If the Agent is risk neutral, how does she rank the two contracts?
200 Chapter 6. Risk Sharing and Pareto Efficiency
0A
Pic Pic
Aic
A
E
D B
0P
Figure 6.23: The Edgeworth box for Exercise 6.4. Pic stands for "indifference curve of the
Principal" and Aic for "indifference curve of the Agent".
Exercise 6.5 Let the surplus in the good state be X g = $200 and the surplus in the bad
state be X b = $120. Consider the following contracts, where the first coordinate is the
amount of money that the Agent receives in the good state and the second coordinate
is the amount of money that the Agent receives in the bad state: C = (60, 80) and
D = (100, 60). The probability of the good state is 25% (so that the probability of the
bad state is 75%).
√ The Principal’s von Neumann-Morgenstern utility-of-money function
is UP (m) = m and the Agent’s von Neumann-Morgenstern utility-of-money function
is UA (m) = ln(m). Does one of the two contracts Pareto dominate the other?
6.7 Exercises 201
0A
45o
A D
B C
45o
0P
Exercise 6.7 Let the surplus in the good state be X g = $800 and the surplus in the
bad state be X b = $400. The Principal’s von Neumann-Morgenstern utility-of-money
m 2
function is UP (m) = 82 − 10 − 100 while the Agent’s von Neumann-Morgenstern
utility-of-money function is UA (m) = 2m + 4. Consider the following contract (where,
as usual, the first coordinate is the amount of money that the Agent receives in the good
state and the second coordinate is the amount of money that the Agent receives in the
bad state): C = (400, 200).
(a) Is contract C Pareto efficient? Explain.
(b) Represent contract C as a point in an Edgeworth box and sketch the indifference
curves of Principal and Agent that go through that point.
(c) In the Edgeworth box show a contract D such that D P C and D ∼A C (that is,
the Principal prefers D to C and the Agent finds D to be just as good as C).
202 Chapter 6. Risk Sharing and Pareto Efficiency
Exercise 6.8 Let the surplus in the good state be X g = $1, 206 and the surplus in the
bad state be X b = $676. The probability of the good state is 25%. The Principal’s von
Neumann-Morgenstern utility-of-money function is UP (m) = m √ while the Agent’s von
Neumann-Morgenstern utility-of-money function is UA (m) = m.
(a) Find a Pareto efficient contract in the interior of the Edgeworth box at which the
Principal’s expected utility is 232.5.
(b) Calculate the Agent’s expected utility at the contract of Part (a).
(c) Find a Pareto efficient contract on one of the sides of the Edgeworth box at which
the Agent’s expected utility is 27.
(d) Calculate the Principal’s expected utility at the contract of Part (c).
Exercise 6.9 The owner of a firm (the Principal) hires a manager (the Agent) to
run the firm. The Principal’s utility-of-money function
h is U(m)
i = ln(m), while the
x
−( 100 )
Agent’s utility-of-money function is V (m) = 100 1 − e (where e is the number
2.71828. . . ). The profit of the firm is affected by random events and can turn out to
be X g = $1, 000 (this is expected to happen with probability 14 ) or X b = $600 (with
probability 34 ). The Principal offers the following contract to the Agent: if the profit
turns out to be $1,000, I will pay you wg = $400, otherwise I will pay you wb = $200.
(a) What is the attitude to risk of the Principal?
(b) What is the attitude to risk of the Agent?
(c) Is the proposed contract Pareto efficient? Explain.
(d) If you were to propose a contract that both Principal and Agent prefer to the one
considered above, which of the following suggestions would you make? Explain
your answer by making use of an Edgeworth box.
1. Increase both wg and wb .
2. Decrease both wg and wb .
3. Increase wg and decrease wb .
4. Decrease wg and increase wb .
6.7 Exercises 203
1,000 0A
Pic
E
D
650 550
C
Aic
45o
0P 1,000
Exercise 6.10 Consider the Edgeworth box of Figure 6.25. Clearly, the Agent is risk
neutral while the Principal is risk averse.
√ The Principal’s von Neumann-Morgenstern
utility-of-money function is UP (m) = m. The probability of the good state is 25 .
(a) What is the value of X g , the surplus in the good state?
(b) What is the value of X b , the surplus in the bad state?
(c) Calculate the expected utility of both parties at contract C.
(d) Find the coordinates of contract D and calculate the expected utility of both
parties at that contract.
(e) Find the coordinates of contract E and calculate the expected utility of both parties
at that contract.
(f) Find three contracts that Pareto dominate contract C.
Exercise 6.11 Let the surplus in the good state be X g = $1, 000 and the surplus in the
bad state be X b = $400.
√ The Principal’s von Neumann-Morgenstern utility-of-money
function is UP (m) = m while the Agent’s von Neumann-Morgenstern utility-of-
money function is UA (m) = ln(m). Is contract C = (600, 300) (where, as usual, the first
coordinate is what the Agent gets in the good state and the second coordinate is what
the Agent gets in the bad state) Pareto efficient?
204 Chapter 6. Risk Sharing and Pareto Efficiency
6.7.4 Exercises for Section 6.4: Pareto efficient contracts on the sides of the
Edgeworth box
Exercise 6.12 Let the surplus in the good state be X g = $1, 000 and the surplus in the
bad state be X b = $600. The probability of the good state is 23 . The Agent is risk neutral,
while the Principal is risk averse with von Neumann-Morgenstern utility-of-money
function UP (m) = ln(m). Consider contract C = (250, 150) (where, as usual, the first
coordinate is what the Agent gets in the good state and the second coordinate is what
the Agent gets in the bad state).
(a) Is contract C Pareto efficient?
(b) Represent contract C in an Edgeworth box and sketch the indifference curves of
Principal and Agent through point C.
(c) Find a Pareto efficient contract, call it D, such that the Agent is indifferent between
C and D while the Principal prefers D to C.
(d) Represent contract D in the Edgeworth box of Part (b).
Exercise 6.13 Let the surplus in the good state be X g = $2, 000 and the surplus in the
bad state be X b = $1, 200. The probability of the good state is 79 . The Principal is risk
neutral, while the Agent
√ is risk averse with von Neumann-Morgenstern utility-of-money
function UA (m) = m. Consider contract C = (1559, 1004) (where, as usual, the first
coordinate is what the Agent gets in the good state and the second coordinate is what
the Agent gets in the bad state).
(a) Is contract C Pareto efficient?
(b) Represent contract C in an Edgeworth box and sketch the indifference curves of
Principal and Agent through point C.
(c) Find a Pareto efficient contract, call it D, such that the Agent is indifferent between
C and D while the Principal prefers D to C.
(d) Represent contract D in the Edgeworth box of Part (b).
Exercise 6.14 As in Exercise 6.13, let the surplus in the good state be X g = $2, 000
and the surplus in the bad state be X b = $1, 200. The probability of the good state is 79 .
However, consider now the case where both the Principal and the Agent are risk√averse
√ von Neumann-Morgenstern utility-of-money function: UP (m) = m and
with the same
UA (m) = m. Consider again contracts C = (1559, 1004) and D = (1493.04, 1200) of
Exercise 6.13.
(a) Is contract C Pareto efficient?
(b) How does the Principal rank the two contracts C and D?
(c) How does the Agent rank the two contracts C and D?
(d) Is contract D Pareto efficient?
6.7 Exercises 205
Exercise 6.15 Let the surplus in the good state be X g = $3, 000 and the surplus in the
bad state be X b = $1, 800. Both the Principal and the Agent are risk averse with the
same von Neumann-Morgenstern utility-of-money function:
m m
UP (m) = ln 1 + and UA (m) = ln 1 + .
1, 000 1, 000
(a) Find the range of values of wg for which contracts of the form (wg , 0) with
0 ≤ wg ≤ X g are Pareto efficient. [Hint: refer to Figure 6.15 on page 186.]
(b) Find the range of values of wg for which contracts of the form (wg , 1800) with
0 ≤ wg ≤ X g are Pareto efficient.
Exercise 6.16 Mr. P wants to hire Ms. A to run his firm. If Ms. A works for Mr.
P, one of two outcomes will occur: the profit of the firm will be $520 (this occurs
with probability 45
98 ) or it will be $200 (with probability
√
53
98 ). Mr. P’s von Neumann-
Morgenstern utility-of-money function is U(m) = m, while Ms. A is risk neutral.
Consider the following contract, call it C: Ms. A will get $144 if the profit of the firm
turns out to be $520, and she will get $90 if the profit of the firm turns out to be $200.
(a) What is Mr. P’s expected utility from this contract?
(b) What is Ms. A’s expected utility from this contract?
(c) Find a Pareto efficient contract in the Edgeworth box, call it D, that Mr. P
considers to be just as good as C and Ms. A prefers to C.
(d) Find a a Pareto efficient contract in the Edgeworth box, call it E, that Mr. P
prefers to C and Ms. A considers to be just as good as C.
(e) How does Mr. P rank the three contracts, C, D and E?
(f) How does Ms. A rank the three contracts, C, D and E?
206 Chapter 6. Risk Sharing and Pareto Efficiency
6.7.5 Exercises for Section 6.5: The Edgeworth box when the parties have positive
initial wealth
Exercise 6.19 Suppose that there are three possible outcomes as shown in the following
table:
outcome: X1 = $500 X2 = $600 X3 = $800
1 3 12
probability: 16 16 16
The Principal
√ is risk averse with von Neumann-Morgenstern utility-of-money function
UP (m) = m, while the Agent is risk neutral. Consider the contract, call it C, where
the Agent is given $400 no matter what the outcome.
(a) Calculate the expected utility from contract C for the two parties.
(b) Find a contract, call it D, that (1) Pareto dominates C and (2) is Pareto efficient.
The Principal
√ is risk averse with von Neumann-Morgenstern utility-of-money function
UP (m) = m, while the Agent is risk neutral. Suppose that the Principal offers the
following contract, call it C, to the Agent: the Agent will be paid $20 no matter what
the outcome.
(a) What is the Principal’s expected utility from contract C? What is the Agent’s
expected utility?
(b) Consider an alternative contract, call it D: the Agent will be paid nothing is the
outcome is $141 or $164 or $461, while she will be paid $60 if the outcome is
either $645 or $749. Is contract D Pareto superior to contact C?
For Exercises 6.21 and 6.22 make use of Propositions 6.6.1 and 6.6.2 and the logic used in
the proof of Proposition 6.22.
The Principal is risk neutral while the Agent is risk averse with von Neumann-Morgenstern
m
utility-of-money function UP (m) = ln 100 . Consider the following contract (where the
first coordinate is what the Agent gets if the the outcome is $141, the second coordinate
is what she gets if the outcome is $164, etc.): C = (141, 164, 400, 500).
Is contract C Pareto efficient? If your answer is ‘Yes’, then explain why, if your answer
is ‘No’ then find a contract that Pareto dominates C.
208 Chapter 6. Risk Sharing and Pareto Efficiency
The Principal
√ is risk averse with von Neumann-Morgenstern utility-of-money function
UP (m) = m and the Agent is risk averse
with von Neumann-Morgenstern utility-of-
m
money function UA (m) = ln 100 . Consider the following contracts:
C = (w1 = 100, w2 = 100, w3 = 100), D = (w1 = 164, w2 = 336, w3 = 450).
(a) Is contract C Pareto efficient?
(b) Is contract D Pareto efficient?
with corresponding expected utility of 23 V (2, 500)+ 13 V (0) = 23 2(1, 500)+ 31 2(0) =
2, 000. Thus Joanne is indifferent between the two contracts.
210 Chapter 6. Risk Sharing and Pareto Efficiency
origin for
the Agent
100 60 0A
60 60
D
40 80
C
0P 100 140
origin for
the Principal
(b) From
the point
of view of the Principal, contract C corresponds to the
money lottery
$140 $40 $100 $60
1 3 and contract D corresponds to the money lottery 1 3
4 4 4 4
The expected value of C is 14 140 + 43 40 = 65 and the expected value of D is
1 3
4 100 + 4 60 = 70. Thus a risk-neutral Principal prefers contract D to contract
C.
(c) From
the point
of view of the Agent, contract C corresponds to the
money lottery
$60 $80 $100 $60
1 3 and contract D corresponds to the money lottery 1 3
4 4 4 4
The expected value of C is 14 60 + 34 80 = 75 and the expected value of D is
1 3
4 100 + 4 60 = 70. Thus a risk-neutral Agent prefers contract C to contract D.
6.8 Solutions to Exercises 211
A P E P C P B P D.
D A E A C ∼A B A A.
For the Agent the expected utility of contract C is E[UA (C)] = 41 ln (60) + 43 ln (80) = 4.31
and the expected utility of contract D is E[UA (D)] = 41 ln (100) + 34 ln (60) = 4.22. Thus
the Agent prefers C to D.
Hence it is neither the case that C Pareto dominates D nor the case that D Pareto
dominates C.
(c) One such contract is contract D shown in Figure 6.27. There are many more: any
contract on the portion of the straight-line indifference curve of the Agent which lies
above the indifference curve of the Principal. Of all those contracts, contract D is
the only one that is Pareto efficient.
400 0A
200
C Pic
45o
Aic
0P
Figure 6.27: The Edgeworth box for Exercise 6.7. Pic stands for "indifference curve of the
Principal" and Aic for "indifference curve of the Agent".
6.8 Solutions to Exercises 213
(a) Since the Principal is risk neutral and the Agent is risk averse, a Pareto efficient
contract in the interior of the Edgeworth box must be on the 45o line out of the origin
for the Agent, that is, it must be a contract that guarantees a fixed amount w to the
Agent. Thus w is the solution to
1 3
(1, 206 − w) + (676 − w) = 232.5
4 4
which is w = 576.
√
(b) The Agent’s expected utility is 576 = 24.
√ √
(c) It must be a contract of the form C = (wg , 676) with 14 wg + 34 676 = 27. The
solution is wg = 900. Thus the contract is C = (900, 676).
(d) At contract C = (900, 676) the Principal’s expected utility is 14 (1, 206 − 900) +
3
4 (0) = 76.5.
1
U 0 (X g − wg ) U 0 (600) 600 400 2
= = = = = 0.67
U 0 (X b − wb ) U 0 (400) 1
400
600 3
and
(d) From the above calculations we see that the Principal’s indifference curve through
the proposed contract is steeper than the Agent’s indifference curve. Thus any
contract in the shaded region in Figure 6.28 is Pareto superior to the given contract.
Hence wg should be decreased and wb should be increased.
214 Chapter 6. Risk Sharing and Pareto Efficiency
400 0A
Pic
45o
200
Aic
45o
0P
Aic
325 250 0A
D
Pic
150
C
Pic
45o
0P
(a) Contract C is not Pareto efficient, because it is in the interior of the Edgeworth box
and yet does not lie on the 45o line out of the Agent’s origin.
point that satisfies these two inequalities is the point (wg = 1425.16, wb = 1425.16)
which lies outside the Edgeworth box (since wb = 1425.16 > X b = 1, 200). Thus
the point we are seeking must be on the bottom√side of√the Edgeworth box, that
is it must be a point D = (wg , 1200) such that 79 wg + 29 1200 = 37.7512; hence
D = (1493.04, 1200).
1,559 0A
45o
Aic
C
196 1,004
D
0P 441
Pic
√
UP0 (441) 196 2
0 =√ = = 0.667
UP (196) 441 3
while
√
UA0 (1, 559) 1, 004
0 =√ = 0.8025.
UA (1, 004) 1, 559
√ √
(b) For the Principal the expected utility of contract C is 79 441 + 29 196 = 175 =
7√ 2
√9
19.4444 and the expected utility of contract D is 9 2, 000 − 1, 493.04 + 9 0 =
17.5123. Thus the Principal prefers contract C to contract D.
(c) As calculated in Exercise 6.13, the Agent is indifferent between contract C and
contract D.
(d) It follows from Parts (b) and (c) that contract C Pareto dominates contract D; hence
contract D is not Pareto efficient.
6,000
The solution is wg ≤ 19 = 315.7895.
(b) A contract of the form (wg , 1800) lies on the bottom side of the Edgeworth box and
thus is Pareto efficient if and only if the Principal’s indifference curve at that point is
as steep as, or steeper than, the Agent’s indifference curve (see Figure 6.15 on page
186), that is, if and only if
51,000
The solution is wg ≥ 19 = 2, 684.2105.
218 Chapter 6. Risk Sharing and Pareto Efficiency
√1 1
2 651 450
which is not the case since = 0.961 and = 0.222. Thus contract C is not
√1 1
2 601 100
220 Chapter 6. Risk Sharing and Pareto Efficiency
300
D
200
600
0P 200 E
Figure 6.31: The Edgeworth box for Part (a) of Exercise 6.18.
Pareto efficient.
√ √
(d) For the Principal the expected utility of contract D is 12 501 + 12 301 = 19.866
√
which is greater than 201 = 14.177 (the utility of his initial wealth), hence the
Principal prefers contract D to keeping his initial wealth. For the Agent the expected
utility of contract D is 12 ln(6) + 12 ln(4) = 1.589 which is greater than ln(2) = 0.693
(the utility of her initial wealth), hence the Agent prefers contract D to keeping her
initial wealth. Hence contract D is individually rational.
(e) Contract D is Pareto efficient if and only if the indifference curve of the Principal at
point D has the same slope as the indifference curve of the Agent at that point, that
is, if and only if
√1 1
UP0 (500) UA0 (500) 2 501 600
= , that is, =
UP0 (300) UA0 (300) √1 1
400
2 301
√1 1
2 501 600
which is not the case since = 0.775 while = 0.667. Thus contract D is
√1 1
2 301 400
not Pareto efficient.
√ √
(f) For the Principal the expected utility of contract E is 12 301 + 12 1 = 9.175 which
√
is less than 201 = 14.177 (the utility of his initial wealth), hence the Principal
prefers keeping his initial wealth to contract E, so that contract E is not individually
rational.
(g) Contract E is Pareto efficient if and only if the indifference curve of the Principal
is steeper than, or as steep as, the indifference curve of the Agent at that point (see
6.8 Solutions to Exercises 221
√1 1
2 301 800
which is not the case since 1 = 0.058 and 1 = 0.875. Thus contract E is not
√
2 1 700
Pareto efficient.
(h) See Figure 6.32.
300
D
200
600
0P 200 E Pic
Aic
Figure 6.32: The Edgeworth box for Part (h) of Exercise 6.18.
1 3 6
E[UA (B)] = ln(1.41) + ln(1.64) + ln(4.5) = 1.0852
10 10 10
1 3 3 3
E[UA (C)] = ln(1.41) + ln(1.64) + ln(4) + ln(5) = 1.0815.
10 10 10 10
Thus the Agent prefers B to C. For the Principal we can take the identity function as his
utility-of-money function so that
1 3 3 3
E[UP (B)] = 0 + 0 + (461 − 450) + (645 − 450) = 61.8
10 10 10 10
1 3 3 3
E[UP (C)] = 0 + 0 + (461 − 400) + (645 − 500) = 61.8
10 10 10 10
Thus the Principal is indifferent between B and C and therefore contract B Pareto dominates
contract C.
6.8 Solutions to Exercises 223
3√ 3√ 4√
E[UP (C)] = 20 + 361 + 145 = 11.858,
10 10 10
we want x to be such that
3√ 7√
E[UP (E)] = 20 + x = 11.858.
10 10
Thus x = 225.70. It remains to show that the Agent prefers contract E to con-
tract C. E is the contract (w1 = 144, w2 = 461 − 225.70, w3 = 645 − 225.70) =
(144, 235.3, 419.3). Since the Agent is risk neutral, she ranks contracts according to
their expected value:
3 3 4
E[C] = 144 + 100 + 500 = 273.2 and
10 10 10
3 3 4
E[E] = 144 + 235.3 + 419.3 = 281.51.
10 10 10
Thus the Agent prefers E to C and hence E Pareto dominates C.
(b) Contract D is not Pareto efficient because, contrary to Proposition 6.6.2, although
X1 − w1 = 164 < X2 − w2 = 461 − 80 = 381 < X3 − w3 = 645 − 200 = 445, it is not
the case that w2 = 0. Let us find an alternative contract, call it F, that the Principal
considers just as good as contract D and gives the Principal the same income is states
2 and 3. Since
3√ 3√ 4√
E[UP (D)] = 164 + 381 + 445 = 18.1357,
10 10 10
we need to solve the equation
3√ 7√
164 + x = 18.1357.
10 10
The solution is x = 416.966. Thus F = (0, 461−416.966, 645−416.966) = (0, 44.034, 228.034).
Since the Agent is risk neutral, she ranks contracts according to their expected value:
3 4 3 4
E[D] = 10 80 + 10 200 = 104 and E[F] = 10 44.034 + 10 228.034 = 104.424. Thus
the Agent prefers contract F to contract D, so that D is Pareto dominated by F.
224 Chapter 6. Risk Sharing and Pareto Efficiency
.
The expression ‘adverse selection’, or ‘hidden type’, refers to situations in which one
party to a contract (e.g. a buyer) possesses information relevant to the contract that is
not available to the opposing party (e.g. a seller). Thus it is a situation of asymmetric
information.
For example, in the context of health insurance, the insurance company (the seller
of insurance) is typically aware of the fact that there are individuals who – because of
their family history – are at a higher risk of developing a condition that requires extensive
medical services, while other individuals represent a lower risk. If the insurance company
offers a contract that would, on average, be profitable if everybody (high-risk and low-risk
individuals) were to purchase that contract, it might discover that its costs are much higher
than expected, because only (or mostly) the high-risk individuals ended up purchasing
the contract.1 In a situation of adverse selection the uninformed party realizes that the
terms of a proposed contract determine the composition of the pool of individuals who
will find that contract acceptable and a change in the contract that would be beneficial to
the uninformed party, if everybody found the contract acceptable, might have undesirable
consequences because of an adverse change in the proportion of “bad types” within the
pool of applicants. Adverse selection in insurance markets will be discussed in the next
chapter. In this chapter we will discuss how adverse selection arises in other markets and
what its consequences are.
1 Thiswas the rationale behind the original provision of the Affordable Care Act that established a
mandate for individuals to purchase health insurance.
228 Chapter 7. Adverse Selection or Hidden Type
2 In
“The market for ‘lemons’: qualitative uncertainty and the price mechanism”, Quarterly Journal of
Economics, 1970, Vol. 84, pp. 488-500. The Nobel Memorial Prize in Economics was awarded in 2001
to George Akerlof, Michael Spence and Joseph Stiglitz "for their analyses of markets with asymmetric
information".
7.2 Conditional probability and belief updating 229
your 50 customers
Type L Type H Type H
from to .
50% 50% 100%
This simple example illustrates the logic of the adverse selection phenomenon.
In order to be able to develop a more general analysis, we need to take a side tour into
the topic of conditional probability and belief updating. To illustrate the reason for this,
consider te following, slightly more complex, version of the above example, where instead
of two types there are three: L, M and H. The types are characterized as follows (where
‘reservation price’ is the maximum price that the individual is willing to pay for a buffet):
type: L M H
2 3 1
fraction of total population: 6 6 6
reservation price: $6 $6.50 $7
cost of food consumed: $3 $6 $9
If you charge $6 for the buffet then all types will be willing to enter your restaurant.
Thus (assuming that the probability of any given customer being of a particular type is
equal to the fraction of that type in the population) from the point of view of cost admitting
a customer correspond to playing the following lottery:
cost of serving the customer: $3 $6 $9
2 3 1
probability: 6 6 6
so that the expected cost of serving each customer is 26 3 + 63 6 + 16 9 = $5.50. On the other
hand, if you charge $6.50 then the L-types will not enter your restaurant and thus, from
the from point of view of cost, admitting a customer correspond to facing the following
lottery:
cost of serving the customer: $6 $9
probability: p 1− p
What is the value of p? This question is answered in the following section.
factors (state e). Then we can represent the doctor’s state of uncertainty by the set
{a, b, c, d, e}, as shown in Figure 7.1.
Of all the states that are possible, the individual might consider some to be more likely
than others and might even dismiss some states as “extremely unlikely” or “implausible”.
To represent the individual’s probabilistic beliefs we use a probability distribution over the
set of states.
Continuing the doctor’s example, suppose that - based upon her past experience with
similar cases - she considers a bacterial infection (state a) to be twice as likely as a viral
infection (state b), she considers a food allergy (state d) to be three times as likely as a drug
allergy (state c), which - in turn - she considers to be as likely as environmental factors
(state e); that is, she has the following beliefs:
state: a b c d e
.
4 2 1 3 1
probability: 11 11 11 11 11
Suppose that the doctor can acquire further information by ordering a blood test. How
should she revise her beliefs when she learns the result of the blood test? In order to
answer this question we need to introduce the notion of conditional probability and be
more precise about what we mean by information.
Note that the probability conditional on an event E is defined only if P(E) > 0, that is, if
E is an event that has positive unconditional probability.
Conditional probability can also be extended to events, as follows. Let D and E be
events with P(E) > 0. Then the probability of D conditional on E, denoted by P(D|E) is
defined as3
P(D|E) = ∑ P(s|E). (7.2)
s∈D
suppose that the doctor can order a blood test: a positive blood test will reveal that there is
an infection and rule out states c, d and e, while a negative blood test will reveal that there is
no infection, thus ruling out states a and b. We can represent the information obtained from
a blood test as a set of states (an event): the set {a, b} represents the information conveyed
by a positive blood test, while the set {c, d, e} represents the information conveyed by a
negative blood test.
Suppose that the doctor orders the blood test and it comes back positive. How should
she revise her beliefs in light of this piece of information?
The issue of how to “rationally” modify one’s initial beliefs - expressed as a probability
distribution P on a finite set U - after receiving an item of information (represented by a
subset F of U) has been studied extensively by philosophers and logicians. Two different
situations may arise:
• In one case, the item of information F was not ruled out by the initial beliefs, in
the sense that event F was assigned positive probability (P(F) > 0). Information
might still be somewhat surprising, in case P(F) is small (close to zero), but it is not
completely unexpected. We call this case belief updating.
• The other case is where the item of information was initially dismissed, in the sense
that it was assigned zero probability (P(F) = 0). In this case the information received
is completely surprising or completely unexpected. We call this case belief revision.
In this book we will only address the issue of belief updating.4
It is generally agreed that the rational way to update one’s beliefs is by conditioning
the initial beliefs on the information received, that is, by using the conditional probability
formula (7.1).
3 Typically, conditional probability is defined directly for events as follows: if D, E ⊆ U and P(E) > 0
then P(D|E) = P(D∩E)P(E) where D ∩ E is the intersection of D and E, that is, the set of states that belong to
both D and E. The reader should convince himself/herself that the conjunction of (7.1) and 7.2 is equivalent
to this definition (interpreting P(s|E) as P({s}|E) and recalling that P(∅) = 0).
4 The issue of belief revision is discussed in my book Decision Making (see
http://faculty.econ.ucdavis.edu/faculty/bonanno/).
232 Chapter 7. Adverse Selection or Hidden Type
Definition 7.2.1 We use the expression belief updating or Bayesian updating to refer
to the modification of initial beliefs (expressed by an initial probability distribution
P) obtained by applying the conditional probability rule; this assumes that the belief
change is prompted by the arrival of new information, represented by an event F such
that P(F) > 0.
Thus, when receiving a piece of information F ⊆ U such that P(F) > 0, one would
change one’s initial probability distribution P into a new probability distribution Pnew by
• reducing the probability of every state in U \ F (the complement of F) to zero (this
captures the notion that the information represented by F is trusted to be correct),
and
• setting Pnew (s) = P(s|F) for every state s ∈ F.
0 if s ∈
/F
Pnew (s) = P(s|F) = (7.3)
P(s) if s ∈ F
P(F)
(recall the assumption that P(F) > 0). Thus, for every event E ⊆ U, Pnew (E) = ∑ Pnew (s) =
s∈E
∑ P(s|F) = P(E|F).
s∈E
For instance, in the doctor’s example, belief updating requires the following. Recall
that the doctor’s initial beliefs are:
state: a b c d e
.
4 2 1 3 1
probability: 11 11 11 11 11
6
Let + be the event that the blood test is positive (that is, + = {a, b} and thus P(+) = 11 ).
5
Let – be the event that the blood test is negative (that is, – = {c, d, e} and thus P(–) = 11 ).
Then
state a b c d e
4 2 1 3 1
initial beliefs 11 11 11 11 11
4 2
beliefs updated on information + 11
6 = 2
3
11
6 = 1
3 0 0 0
11 11
1 3 1
beliefs updated on information – 0 0 11
5 = 1
5
11
5 = 3
5
11
5 = 1
5
11 11 11
7.2 Conditional probability and belief updating 233
As a further example, suppose that there are only three students in a class: Ann, Bob
and Carla. The professor tells them that in the last exam one of them got 95 points (out of
100), another 78 and the third 54.
We can think of a state as a triple (a, b, c), where a is Ann’s score, b is Bob’s score and c is
Carla’s score. Then, based on the information given by the professor, the set of possible
states is:
U = {(95, 78, 54), (95, 54, 78), (78, 95, 54), (78, 54, 95), (54, 95, 78), (54, 78, 95)} .
Suppose that in all the previous exams Ann and Bob always obtained a higher score than
Carla and often Ann outperformed Bob. Then Ann might consider states (95,78,54) and
(78,95,54) much more likely than (78,54,95) and (54,78,95).
For example, suppose that Ann’s beliefs are as follows:
(95, 78, 54) (95, 54, 78) (78, 95, 54) (54, 95, 78) (78, 54, 95) (54, 78, 95)
.
16 8 4 2 1 1
32 32 32 32 32 32
Suppose that, before distributing the exams, the professor says “I was surprised to see that,
this time, Ann did not get the highest score”. Based on this information, how should Ann
revise her beliefs? The information is that Ann did not receive the highest score, which is
represented by the event
F = {(78, 95, 54), (54, 95, 78), (78, 54, 95), (54, 78, 95)}.
These updated beliefs can be represented more succinctly as follows, by not listing the
states that are ruled out by information F (that is, the states in the complement of F, which
are zero-probability states in the updated beliefs):
(78, 95, 54) (54, 95, 78) (78, 54, 95) (54, 78, 95)
. (7.5)
4 2 1 1
8 8 8 8
The belief updating rule can also be applied sequentially if one first receives information
F (with P(F) > 0) and later receives a further piece of information E (with E ⊆ F and
P(E|F) > 0 or, equivalently, P(E ∩ F) > 0). For instance, in the above example, suppose
that the professor first informs the students that Ann did not get the highest score and later
tells them that Carla received a higher score than Bob. Call F the first piece of information
and E the second piece of information. Then, as we saw above, F is the set of states where
it is in fact true that Ann did not get the highest score:
F = {(78, 95, 54), (54, 95, 78), (78, 54, 95), (54, 78, 95)}.
234 Chapter 7. Adverse Selection or Hidden Type
On the other hand, E is the set of states where it is in fact true that Carla received a higher
score than Bob:
E = {(95, 54, 78), (78, 54, 95), (54, 78, 95)}.
Ann’s updated beliefs after learning information F are given above in (7.4). Updating
those beliefs by conditioning on E yields
(95, 78, 54) (95, 54, 78) (78, 95, 54) (54, 95, 78) (78, 54, 95) (54, 78, 95)
1 1
0 0 0 0 2 2
Expressing all of this more succinctly, we have that the updated beliefs after learning F
are as given in (7.5) above, namely
(78, 95, 54) (54, 95, 78) (78, 54, 95) (54, 78, 95)
4 2 1 1
8 8 8 8
and the final beliefs are obtained by conditioning these beliefs on E or, equivalently, by
conditioning the initial beliefs on E ∩ F:
(78, 54, 95) (54, 78, 95)
.
1 1
2 2
Cars are of different qualities. Let there be n possible quality levels: q1 , q2 , . . . , qn with
qi denoting a lower quality level than qi+1 , for every i = 1, 2, . . . , n − 1. We express this as
follows:
q1 ≺ q2 ≺ · · · ≺ qn . (7.6)
We assume that each current owner of a car of quality qi (a Group-S individual) considers
owning that car to be just as good as having $Sqi ; thus we can think of Sqi as her reservation
price for the car, in the sense that she would be happy to sell her car at any price P > Sqi .5
We call Sqi the value of a car of quality qi to its initial owner. Given the assumed ranking
of possible qualities (see (7.6)), it is natural to postulate that higher-quality cars have a
higher value to their owners than lower-quality cars, that is,
On the other side of the market, we assume that each individual within Group B considers
becoming the owner of a car of quality qi to be just as good as having $Bqi ; thus we can
think of Bqi as his reservation price for such a car, in the sense that he would be happy to
buy a car of quality qi at any price P < Bqi .6 We call Bqi the value of a car of quality qi
to a Group-B individual. Given the assumed ranking of possible qualities (see (7.6)), it is
natural to postulate that higher-quality cars have a higher value to potential buyers than
lower-quality cars, that is,
This means that, for every car, there are potential gains from trade, in the sense
that for a car of quality qi there is a price P such that (1) an individual in Group B
would be willing to pay $P for that car and (2) the owner of such a car (a Group-S
individual) would be happy to sell at price P and (3) the trade would make each
party better off (indeed this would be true for every P such that Sqi < P < Bqi ).
5 For example, the utility-of-money function of the owner of a car of quality qi could be
m + Sq if she has a car of quality qi
i
UqSi (m) = . Let m be her initial endowment of money; then, since
m if she has no car
she owns a car of quality qi her current utility is m + Sqi , while if she sells her car for $P then her utility is
m + P. Thus she is better off selling her car if and only if P > Sqi (and she is indifferent if P = Sqi ).
6 For example, his utility-of-money function could be
m + Bq if he has a car of quality qi
i
UqBi (m) = . Let m be his initial endowment of money; then, since
m if he has no car
he currently does not own a car, his current utility is m, while if he buys a car of quality qi at price $P then
his utility is m − P + Bqi . Thus he is better off buying such a car if and only if P < Bqi (and he is indifferent
if P = Bqi ).
236 Chapter 7. Adverse Selection or Hidden Type
• Information about quality is asymmetric: each car-owner knows the quality of her
own car (because she has had extensive experience with it and knows how well she
took care of it), while a potential buyer is not able to determine the quality of a car
by mere inspection.7 Of course, a potential buyer can ask the owner what the quality
of her car is, but the owner would have an incentive to claim that it is higher than it
actually is, hoping to sell the car for a higher price.
By (7.9) the initial situation (where cars are currently in the hands of Group-S individuals)
is Pareto inefficient: it would be a Pareto improvement to have each car sold to a Group-
B individual (at an appropriate price). We now show that the postulated asymmetric
information prevents the potential gains from trade from being realized, in the sense that
some cars will remain in the "wrong hands", that is, in the hands of Group-S individuals.
The crucial observation is that, although Group-B individuals have different reservation
prices for different quality levels, there cannot be different prices on the market for cars of
different qualities, because buyers are unable to determine the quality of the cars that are
offered for sale and thus would not know if they should pay a high price or a low price;
furthermore, they realize that the owners of low-quality cars would have an incentive to
claim that their cars are of high quality in order to obtain a higher price and thus any claims
made by the sellers would have to be discarded as not credible. Hence there can only be
one price for all second-hand cars.
Let us begin with an example. Let there be six quality levels. For each quality level,
the fraction of cars of that quality and the value of a car of that quality to its current owner
and to a potential buyer is shown in the following table:
quality: q1 q2 q3 q4 q5 q6
1 3 4 1 2 1
proportion: 12 12 12 12 12 12
value to current owner Sqi : $900 $1, 800 $2, 700 $3, 600 $4, 500 $5, 400
value to potential buyer Bqi : $1, 000 $2, 000 $3, 000 $4, 000 $5, 000 $6, 000
7 Some features of a car, such as mileage, exterior damage, etc., can be ascertained by a potential buyer
upon inspection. Other features cannot: for example, whether it was driven with care, whether the car was
affected by a flood or involved in an accident, etc. These are hidden features and it is natural to assume that
they are known to the current owner, but cannot be ascertained by a potential buyer.
8 We make this assumption in order to show that the inefficiencies that arise have nothing to do with the
buyers being risk averse. An example where the buyers are risk averse is given in Exercise 7.10.
9 Without this information, the potential buyers would find themselves in an even worse position.
7.3 The market for used cars 237
A naïve potential buyer would reason as follows (recall the assumption of risk
neutrality):
“For me, picking a car at random corresponds to facing the money lottery
$1, 000 $2, 000 $3, 000 $4, 000 $5, 000 $6, 000
(7.10)
1 3 4 1 2 1
12 12 12 12 12 12
Thus, as long as the price P of a second-hand car is less than $3,250, I expect
to make a positive net gain by buying a car.”
Suppose that, in fact, the price of a second-hand car is $3,000. Should the potential
buyer decide to buy a car that is offered for sale at that price, expecting a gain of
$(3, 250 − 3, 000) = $250? The answer is negative, because when the price is $3,000
not every car will be offered for sale: the owners of cars of qualities q4 , q5 and q6 will not
be willing to sell.10 Thus only cars of qualities q1 , q2 and q3 will be offered for sale when
the price is $3,000, so that buying a car at that price means facing the following lottery,
where the probabilities are obtained from (7.10) by conditioning on the event {q1 , q2 , q3 }:
$1, 000 $2, 000 $3, 000
(7.11)
1 3 4
8 8 8
Thus the buyer would be paying $3,000 expecting a benefit of only $2,375! That is, buying
a car for $3,000 means facing an expected loss of $625.
Might paying a price somewhat below $2,375 be a good idea? For example, should the
buyer be willing to pay $2,000 for a second-hand car? The answer is again negative: when
the price is $2,000 the owners of cars of quality q3 will drop out of the market (because
they value their cars at $2,700: more than the price) and thus buying a car for $2,000
means facing the lottery11
$1, 000 $2, 000
(7.12)
1 3
4 4
Thus the buyer would be paying $2,000 expecting a benefit of only $1,750! That is, buying
a car for $2,000 means facing an expected loss of $250.
It is only when the price P is in the range between $900 and $1,000, that buying a
car at that price is rational for the buyer: when 900 < P < 1, 000 then only cars of the
lowest quality, namely q1 , are offered for sale and the buyer knows this (such cars are
referred to as “lemons”; hence the expression ‘the market for lemons’ used by Akerlof in
his seminal paper). Thus in this example we have an extreme inefficiency: of all the used
cars in existence, only the lowest-quality cars are being traded in the market, despite the
fact that for every car that is not traded there is a potential buyer who would be willing to
pay a price that exceeds the reservation price of the owner of that car.
We can now define what an equilibrium price in the market for used cars is in the
presence of adverse selection.
The first condition in Definition 7.3.1 says that at price P all the qualities q1 , . . . , qm are
offered for sale and none of the qualities above qm are offered for sale.12 Thus, conditioning
on the fact that the qualities offered for sale are precisely {q1 , . . . , qm } the buyer realizes
that buying a used car at price P means facing the following money lottery in terms of
potential net gain:
$(Bq1 − P) . . . $(Bqm − P)
p1 pm
(7.14)
m ... m
∑ pi ∑ pi
i=1 i=1
12 Note that, by (7.7), (1) Sqm < P implies that, for every i = 1, . . . , m − 1, Sqi < P and
(2) Sqm+1 > P implies that, for every i = m + 2, . . . , n, Sqi > P.
7.3 The market for used cars 239
The example analyzed above illustrates the logic of the adverse selection phenomenon,
which is as follows. In order for all the used cars to be put up for sale, the price P must
exceed the value of the top-quality car for the owner of such a car, that is, it must be that
P > Sqn . When all the used cars are offered for sale, a potential buyer faces the following
lottery in terms of potential net gain:
$(Bq1 − P) . . . $(Bqn − P)
(7.15)
p1 ... pn
whose expected value is p1 Bq1 + · · · + pn Bqn − P. Note that
p1 Bq1 + · · · + pn Bqn < Bqn
(since it is a convex combination of all the values Bqi , of which Bqn is the largest).
Our hypothesis is that Bqn > Sqn and thus it is, in principle, possible that
p1 Bq1 + · · · + pm Bqn > Sqn , in which case one can find a price P such that
Sqn < P < p1 Bq1 + · · · + pn Bqn .
If that is the case, then at such a price all the cars would be traded and thus Pareto efficiency
would be achieved. In the above example, on the other hand, we had that
p1 Bq1 + · · · + pn Bqn < Sqn (7.16)
and thus at any price P that would induce all the car-owners to offer their cars for sale
(that is, at any P such that P > Sqm ) no potential buyer would be willing to buy, so that the
highest-quality cars would not be traded, leading to a Pareto inefficient situation.
Then the reasoning can repeated for the case of all qualities but the top one being
offered for sale. This would require a price P such that Sqn−1 < P < Sqn . Then the lottery
faced by a potential buyer contemplating buying a used car offered for sale would be
$(Bq1 − P) . . . $(Bqn−1 − P)
(7.17)
p1 ... pn−1
whose expected value is p1 Bq1 + · · · + p n−1 Bq n−1 − P. Again, the important observation
is that p1 Bq1 + · · · + pn−1 Bqn−1 < Bqn−1 . Given our assumption that Bqn−1 > Sqn−1 it is,
in principle, possible that p1 Bq1 + · · · + pn−1 Bqn−1 > Sqn−1 , in which case one can find
a price P such that Sqn−1 < P < p1 Bq1 + · · · + pn−1 Bqn−1 , in which case all the qualities
except for the top one would be traded. However, it is also possible - as in our example
above - that p1 Bq1 + · · · + pn−1 Bqn−1 < Sqn−1 , in which case there cannot be a price at
which all the qualities up to qn−1 are traded. This argument can be repeated for every
quality level. In the extreme case - as in our example above - the only possible equilibrium
is one where only the lowest-quality cars are traded. Note that, given our assumptions
(namely (7.9), in particular Sq1 < Bq1 ), an equilibrium whereonly the lowest-quality cars
are traded always exists: any price P such that Sq1 < P < min Bq1 , Sq2 would give rise to
such an equilibrium.
240 Chapter 7. Adverse Selection or Hidden Type
Depending on the difference between the seller’s value and the buyer’s value of each
quality many possibilities arise:
- If the difference is small - as in our example above - then it may be that the only
possible equilibrium is one where only the lowest quality cars are traded. Thus this
is a situation where the bad-quality cars drive the good-quality cars out of the market
and the lack of trading for good-quality cars gives rise to a Pareto inefficiency.
- If the difference is somewhat large then there may be several possible equilibria
that can be “Pareto ranked” in terms of efficiency (this is illustrated in the example
below).
- If the difference is substantially large then among the possible equilibria there is also
the most efficient one where all the cars are traded.
The following is an example of the second possibility listed above. Let there be three qual-
ities: L (for ‘low’), M (for ‘medium’) and H (for ‘high’). The corresponding proportions
and valuations are as shown in the following table:
quality: L M H
1 3 1
fraction: 5 5 5
value to current owner: $10, 000 $14, 000 $16, 000
value to potential buyer: $12, 000 $15, 000 $18, 000
Let us see what values of P (the market price of a used car) give rise to an equilibrium.13
1. If P > 16, 000 then all the cars will be offered for sale. A potential buyer then faces
the following net-gain lottery if he buys a car:
$(12, 000 − P) $(15, 000 − P) $(18, 000 − P)
1 3 1
5 5 5
whose expected value is 15, 000 − P < 0. Thus there is no equilibrium at any such
price.
2. If 14, 000 < P ≤ 16, 000 then only cars of qualities L and M are offered for sale. A
potential buyer then faces the following net-gain lottery if he buys a car:
$(12, 000 − P) $(15, 000 − P)
1 3
4 4
whose expected value is 14, 250 − P. This is positive if and only if P < 14, 250. Thus
- If 14, 250 ≤ P ≤ 16, 000 there is no equilibrium at such price.
- If 14, 000 < P < 14, 250 then, for every such price there is an equilibrium where
all the L-quality and all the M-quality cars are traded, while the H-quality cars
are not.
13 We assume that, if indifferent between offering her car for sale and not offering it, the owner chooses
not to offer it and, if indifferent between buying and not buying, the potential buyer chooses not to buy.
7.3 The market for used cars 241
3. If 10, 000 < P ≤ 14, 000 then only the L-cars are offered for sale, so that a potential
buyer realizes that he would buy a car worth $12,000 to him. Thus
- If 12, 000 ≤ P ≤ 14, 000 there is no equilibrium at such price.
- If 10, 000 < P < 12, 000 then, for every such price there is an equilibrium
where all and only the L-quality cars are traded.
4. If P ≤ 10, 000 then no cars are offered for sale and thus there is no trading.
Clearly, the equilibria under Point 2 are Pareto superior to the equilibria under Point 3,
since more cars are traded. None of them, however, is Pareto efficient, since the H-quality
cars are not traded and remain “in the wrong hands”.
offer for sale the lower-quality ones); in other words, in the market for new cars there is
no asymmetric information between buyer and seller. When buying a new car a buyer is
just as likely to obtain an above-average-quality car as a below-average-quality car; hence
new cars sell for a market price that reflects the average quality of all new cars: there is no
adverse selection in the market for new cars.
One could object to the analysis of this section by pointing out that there are observable
characteristics of used cars, such as mileage and age, that are easily verified by the buyer
and can be the basis for price differences: typically, a 2014 Toyota Corolla will sell for a
lower price than a 2018 Toyota Corolla, or a 2014 Toyota Corolla with 30,000 miles will
sell for a higher price than a 2014 Toyota Corolla with 85,000 miles. The answer to this
objection is that, indeed, observable differences can be associated with different prices, but
within each category of car, e.g. a 2014 Toyota Corolla with approximately 80,000 miles,
there typically are better cars and worse cars and information about the car’s quality is
asymmetric: the seller has that information, the buyer does not. Thus the “lemons problem”
arises within each well-defined category of used car.
The adverse selection problem arises in many different markets. In this chapter we
considered the market for used car, or - more generally - the market for durable goods.
In the next chapter we will analyze adverse selection in the market for insurance. Stglitz
and Weiss14 suggested that adverse selection could be the cause of credit rationing in the
market for unsecured loans, that is, loans for which the borrower cannot offer a collateral.
Consider, for example, a bank that offers loans not knowing the credit worthiness of the
borrowers: some borrowers are “good quality” in that they are low risk, either because
they only borrow if they expect to be able to repay the loan and/or they plan to invest in a
low-return but reasonably safe project, while other borrowers are “low quality” in that they
are high risk, either because they borrow without worrying about defaulting and/or they
plan to invest in high-return but high-risk projects. The low-risk types are more sensitive
to increases in the rate of interest and drop out of the market when the rate of interest is
high, while the high-risk types are less sensitive to increases in the rate of interest and
continue to borrow even if the rate is high. In this case lenders might prefer to keep the rate
of interest low, so as to have a better mix of “good” and “bad” types, even if at that low
rate there is excess demand for loans. That is, if the bank faces excess demand for loans, it
might prefer not to raise its interest rate to clear the market, because by doing so it would
drive away some low-risk borrowers and increase the proportion of high-risk borrowers in
the pool of applicants, thus facing a higher probability of defaulting borrowers.
14 Stiglitz,
Joseph and Andrew Weiss, “Credit rationing in markets with imperfect information”, The
American Economic Review, 1981, Vol. 71, pp. 393-410.
7.4 Exercises 243
7.4 Exercises
The solutions to the following exercises are given in Section 7.5 at the end of this chapter.
7.4.1 Exercises for Section 7.2.2: Conditional probability and belief updating
Exercise 7.4 There is an urn with 40 balls: 4 red, 16 white, 10 blue and 10 black. You
close your eyes and pick a ball at random. Let E be the event “the selected ball is either
red or white”.
(a) What is the probability of E?
(b) Now somebody tells you: “the ball in your hand is not black”. How likely is it
now that you picked either a red or a white ball?
244 Chapter 7. Adverse Selection or Hidden Type
Exercise 7.5 Suppose there are 3 individuals. It is known that one of them has a virus.
A blood test can be performed to test for the virus. If an individual does have the virus,
then the result of the test will be positive. However, the test will be positive also for an
individual who does not have the virus but has a particular defective gene.
It is known that exactly one of the three individuals has this defective gene: it could be
the same person who has the virus or somebody who does not have the virus. A positive
test result will come up if and only if either the patient has the virus or the defective
gene (or both).
Suppose that Individual 1 takes the blood test and the result is positive. Assuming that
all the states are equally likely, what is the probability that he has the virus? [Hint: think
of the universal set (or sample space) U as a list of states and each state tells you which
individual has the virus and which individual has the defective gene.]
Exercise 7.6 In a remote rural clinic with limited resources, a patient arrives complain-
ing of low-abdomen pain. Based on all the information available, the doctor thinks that
there are only four possible causes: a bacterial infection (b), a viral infection (v), cancer
(c), internal bleeding (i). Of the four, only the bacterial infection and internal bleeding
are treatable at the clinic. In the past the doctor has seen 600 similar cases and they
eventually turned out to be as follows:
7.4.2 Exercises for Section 7.3: The market for used cars
Exercise 7.7 There are 200 cars in total. Cars are of different quality. Quality is
measured in terms of the expected number of years of residual life of the car. Make use
of the following information:
Quality: 4 8 10 16
1 3 2 2
Fraction of total number: 8 8 8 8
Value of car to current owner: $2, 000 $3, 000 $4, 000 $5, 000
Exercise 7.8 There are two groups of individuals. Group 1 individuals own cars, while
Group 2 do not. There are four possible quality levels of cars as shown in the following
table, together with the total number of cars of each quality.
Quality: A B C D
Number of cars: 1, 000 2, 000 1, 000 4, 000
The value that a Group-1 individual attaches to a car of a given quality is lower than the
value of the same quality car to a Group-2 individual, as shown in the following table:
Quality: A B C D
Value to Group 1 individuals: $5, 400 $4, 500 $3, 600 $2, 700
Value to Group 2 individuals: $6, 000 $5, 000 $4, 000 $3, 000
The quality of a car is known to the owner but not to the prospective buyer. All
individuals are risk-neutral.
(a) Write the lottery for a Group-2 individual that corresponds to picking a car at
random from the entire pool of cars and calculate the expected value.
(b) Suppose that the price of a second-hand car is $3,800. Should a Group-2 individ-
ual be willing to buy a car at that price? Explain your answer.
(c) Let P be the price at which second-hand cars are traded. What are the possible
values of P such that, at that price, 4,000 cars are traded?
246 Chapter 7. Adverse Selection or Hidden Type
Exercise 7.9 Making use of the information in Exercise 7.8, for every positive price P
determine if there is trading of second-hand cars at that price. Assume that, if indifferent
between selling and not selling, a Group-1 individual would decide to sell and, if
indifferent between buying and not buying, a Group-2 individual would decide to buy.
Exercise 7.10 Let the quality of a second-hand motorcycle be denoted by θ ∈ {1, 2, 3},
where θ is the number of comprehensive tune-ups that the motorcycle received in the
past. The owner knows the value of θ but a potential buyer does not. The value of
a motorcycle of quality θ to the seller is $800θ . Each potential buyer has an initial
wealth
√ of $9,025 and the utility of purchasing a motorcycle √ of quality θ at price P
is 9, 025 − P + 1, 000θ , while the utility of not buying is 9, 025 = 95. Let the
proportion of motorcycles of each quality be as follows (where q is a number strictly
between 0 and 31 ):
θ: 1 2 3
.
2 1
proportion: q 3 −q 3
Suppose that the price of a second-hand motorcycle is P = $1, 744. [In the following
assume that, if indifferent between selling and not selling, the owner of a motorcycle
would sell and, if indifferent between buying and not buying, a potential buyer would
buy.]
(a) Are there values of q such that ALL motorcycles are traded?
(b) Are there values of q such that all motorcycles of quality θ = 1 and θ = 2 are
traded?
(c) Are there values of q such that only motorcycles of quality θ =1 are traded?
Exercise 7.11 Suppose that you have just opened an all-you-can-eat buffet restaurant.
The capacity of your restaurant is 50, that is, you can accommodate at most 50 customers.
Suppose it costs $2 to provide one serving of food (e.g. a standard-size plate). Assume
that you are risk neutral. Each potential customer can be described by a pair (r, c)
where r is the reservation price (the customer will come to your restaurant if and only
if the admission price p is less than or equal to r) and c is the number of servings of
food he/she consumes. There is a total of 240 potential customers. Their types and
corresponding numbers are given in the following table:
Customer type: ($6, 1) ($6, 1.5) ($6.50, 1.5) ($6.50, 2.2) ($7, 2.2) ($7, 3.5)
Number: 20 20 40 60 40 60
What price should you charge if you want to maximize your profits?
7.5 Solutions to Exercises 247
z1 z2 z3 z4 z5 z6 z7 z8
3 5 3 2 6 3 1 1
24 24 24 24 24 24 24 24
a b c d e f g h i
.
11 7 9 16 5 4 8
60 0 60 60 60 60 60 60 0
1
P(V1 ∩ 1+ ) P(V1 ) 3 3
P(V1 |1+ ) = = = 5
= 5 = 60%.
P(1+ ) P(1+ ) 9
Solution to Exercise 7.6. The probabilities are as follows:
b v c i
140 14 110 11 90 9 260 26
600 = 60 600 = 60 600 = 60 600 = 60
(a) The event that the patient has a treatable disease is {b, i}.
14 26
P({b, i}) = P(b) + P(i) = 60 + 60 = 23 .
(b) A negative result of the ultrasound is represented by the event {b, v, c}.
A non-treatable disease is the event {v, c}. Thus,
11 9
P ({v, c} ∩ {b, v, c}) P ({v, c}) 60 + 60
P ({v, c}|{b, v, c}) = = = 14 11 9
= 10
17 = 58.82%.
P ({b, v, c}) P ({b, v, c}) 60 + 60 + 60
9
P (c) 60 9
P (c|{b, v, c}) = = 14 11 9
= 34 = 26.47%.
P ({b, v, c}) 60 + 60 + 60
(c) A positive blood test is represented by the event {b, v}. A treatable disease is the
event {b, i}. Thus,
14
P ({b, i} ∩ {b, v}) P (b) 60 14
P ({b, i}|{b, v}) = = = 14 11
= 25 = 56%.
P ({b, v}) P ({b, v}) 60 + 60
7.5 Solutions to Exercises 249
(e) We are conditioning on {b, v} ∩ {b, v, c} = {b, v}; thus, we want P ({b, i}|{b, v})
which was calculated in Part (c) as 14
25 = 56%.
• If 2, 700 ≤ P < 3, 600 then only cars of qualities D will be offered for sale and thus
a potential buyer realizes that she would be buying a car worth $3,000 to her. Hence
– If 3, 000 < P < 3, 600 then potential buyers are not willing to buy and thus
there is no trading.
– If 2, 700 ≤ P ≤ 3, 000 then all (and only) the cars of quality D are traded.
• If P < 2, 700 no cars will be offered for sale and thus there is no trading.
3
p p
2q 8, 281 + 1 − 23 q 9, 281 = 136.5q + 96.34 − 144.51q = 96.34 − 8.01q.
√
The buyer will be willing to buy if 96.34 − 8.01q ≥ 9025 = 95, that is, if
q ≤ 0.1965. So the answer is: Yes all the values of q ≤ 0.1673.
(c) No. If q ≤ 0.1673 then both qualities θ = 1 and θ = 2 are traded and if q > 0.1673
then both qualities θ = 1 and θ = 2 are offered for sale, but buyers are not willing
to buy.
15 By q 3
the conditional probability rule, P(θ = 1|{θ = 1, θ = 2}) = 2 = 2q and
q+ 3 −q
2
P(θ = 2|{θ = 1, θ = 2}) = 3 −q = 1 − 23 q.
2
q+ 3 −q
7.5 Solutions to Exercises 251
Solution to Exercise 7.11. First of all, let us convert numbers into proportions:
Customer type: ($6, 1) ($6, 1.5) ($6.50, 1.5) ($6.50, 2.2) ($7, 2.2) ($7, 3.5)
1 1 2 3 2 3
Proportion: 12 12 12 12 12 12
Secondly, note that it cannot be profit maximizing to charge a price different from either
$6 or $6.50 or $7.16
• If you charge $6 then all the 240 people are interested in eating at your restaurant.
You will only be able to accommodate 50 of them. Assuming that the order of arrival
is random, the probability that a served customer is of any given type coincides
with the fraction of that type in the population. Thus letting in any one customer
corresponds to playing the following “consumption”’ lottery:
1 3 5 3
whose expected value is 12 1 + 12 (1.5) + 12 (2.2) + 12 (3.5) = 2.25. Hence the ex-
pected cost of a single customer is $(2 × 2.25) = $4.50 so that the expected profit
from a single customer is $(6 − 4.50) = $1.50. Hence your total expected profit is
$(1.50 × 50) = $75 .
• If you charge $6.50 then only 10 12 of the potential customers are interested in eating
at your restaurant, for a total of 10
12 240 = 200: still more than you can accommodate.
Assuming, again, that the order of arrival is random, the probability that a served
customer is of any given type coincides with the fraction of that type in the population
of interested customers. Applying the conditional probability rule, their types and
(updated) proportions are:
Customer type: (6.50, 1.5) (6.50, 2.2) (7, 2.2) (7, 3.5)
2 3 2 3
Fraction: 10 10 10 10
Thus, letting in any one customer corresponds to playing the following consumption
lottery:
consumption: 1.5 2.2 3.5
2 3 2 5 3
probability 10 10 + 10 = 10 10
whose expected value is 2.45. Hence the expected cost of a single customer is
$(2 × 2.45) = $4.90 so that the expected profit from a single customer is $(6.50 −
4.90) = $1.60. Hence your total expected profit is $(1.60 × 50) = $80 .
16 For example, if you charge $5.50 then your revenue will be $(5.50 × 50) = $275 and if you increase
your price to $6 then it will still be the case that at least 50 customers want to enter your restaurant and your
revenue will increase to $(6 × 50) = $300.
252 Chapter 7. Adverse Selection or Hidden Type
5
• If you charge $7 then only 12 of the potential customers are interested in eating at
5
your restaurant, for a total of 12 240 = 100: still more than you can accommodate.
Applying the conditional probability rule, their types and (updated) proportions are:
Thus, letting in any one customer corresponds to playing the following consumption
lottery:
consumption: 2.2 3.5
2 3
probability: 5 5
whose expected value is 2.98. Hence the expected cost of a single customer is $(2 ×
2.98) = $5.96 so that the expected profit from a single customer is
$(7 − 5.96) = $1.04. Hence your total expected profit is $(1.04 × 50) = $52 .
In Chapter 5 we considered the case of only one type of individual, with a constant
probability of loss. In Chapter 10 we will consider the case where the probability of loss
is affected by the individual’s behavior: a situation referred to as "moral hazard". In this
chapter we will continue to assume that the probability of loss is constant, that is, not
affected by the individual’s behavior, but we will consider the case where there are different
types of individuals, each type with a different probability of loss. In the case of two types,
one type has a higher probability of loss and will be called the high-risk type and the other
will be called the low-risk type. There is uncertainty on the part of the insurance company
due to do the insurance company’s inability to tell high-risk from low-risk individuals
apart (while being fully aware that it faces two different types of potential customers).
254 Chapter 8. Adverse Selection in Insurance
It follows that type-H individuals have steeper indifference curves than type-L individuals,
as shown in Figure 8.1. In fact, fix an arbitrary point C = (W1C ,W2C ) in the wealth plane.
The slope of the indifference curve going through this point is
pH U 0 (W1C )
− for type-H individuals
1 − pH U 0 (W2C )
and
pL U 0 (W1C )
− for type-L individuals.
1 − pL U 0 (W2C )
Since pL < pH ,
pL pH
< .
1 − pL 1 − pH
Wealth in
good state
W2
C
W2C
indifference curve
for L-type
indifference curve
for H-type
Wealth in
W1 bad state
0 W1C
Figure 8.1: The indifference curve through point C for type-H is steeper than the indiffer-
ence curve through point C for type-L.
8.2 Two types of customers 255
Since the H-type indifference curve is steeper than the L-type indifference curve at any
point, this must be true also at the no-insurance point, that is, the reservation indifference
curve for the H-type is steeper than the reservation indifference curve for the L-type, as
shown in Figure 8.2.1
Wealth in
good state
W2
NI 45o line
W0
W0 − hLmax FL
reservation indifference
W0 − hH
max FH curve for L-type
reservation indifference
curve for H-type
Wealth in
W1 bad state
0 W0 − `
Figure 8.2: The reservation indifference curve of an H-type is steeper than the reservation
indifference curve of an L-type.
It follows that the maximum premium that the L-type individuals are willing to pay for
full insurance, denoted by hLmax , is smaller than the maximum premium that the H-type
individuals are willing to pay for full insurance, denoted by hH L H
max : hmax < hmax , as shown
in Figure 8.2. Letting FL be the full-insurance contract that the L-type individuals consider
to be just as good as no insurance and FH the full-insurance contract that the H-type individ-
uals consider to be just as good as no insurance, we have that FL = W0 − hLmax ,W0 − hLmax
and FH = W0 − hH H L H
max ,W0 − hmax , with W0 − hmax > W0 − hmax .
1 Recallthat the reservation indifference curve is defined to be the indifference curve that goes through
the no-insurance point NI.
256 Chapter 8. Adverse Selection in Insurance
Let NH > 0 be the number of H-type individuals in the population and NL > 0 be the
number of L-type individuals and define
NH NL
qH = so that 1 − qH = . (8.1)
NH + NL NH + NL
8.2.1 The contracts offered by a monopolist who can tell individuals apart
As a benchmark, we shall first consider the case of a monopolist who is able to tell whether
an individual who applies for insurance is an H-type or and L-type. For example, a health
insurance company might be legally allowed to require applicants to submit to a DNA test
that reveals whether a defective gene is present, in which case the individual is more likely
to develop a particular disease requiring extensive medical care.
In the perfect-information case, from the point of view of the monopolist there are
effectively two separate insurance markets: one for the H-types and one for the L-types.
Then we can apply the analysis of Chapter 5 (Section 5.2.1) and conclude that the monop-
olist would offer the full-insurance contract FL to type-L individuals and the full-insurance
contract FH to type-H individuals so that its expected profits would be2
hH L
total profits: max − pH ` NH + hmax − pL ` NL (8.2)
hH L
profit per customer: max − pH ` qH + hmax − pL ` (1 − qH ) . (8.3)
Which type of individual is “better” for the insurance company, that is, which type
yields higher profits? The H-type is better in that she is willing to pay a higher premium
for full insurance, but on the other hand she will submit a claim with higher probability,
that is, the H-type yields higher revenue but also higher cost. Thus, in principle, either type
could be more profitable. The answer depends on the specific values of the parameters.
4p 6p √
1, 600 + 3, 600 = W .
10 10
2 To obtain profit per customer from total profits, divide by (NH + NL ) and use (8.1).
8.3 The monopolist under asymmetric information 257
8p 2p √
1, 600 + 3, 600 = W .
10 10
The solution is W = 1, 936 so that hH
max = 3, 600 − 1, 936 = $1, 664 and thus the
expected profit from an H-type is
8
1, 664 − (2, 000) = $64.
10
Thus in this case insuring an L-type is more profitable than insuring an H-type.
The shaded area in left pane of Figure 8.3 shows the set of insurance contracts that
are attractive to the H-type, in that they yield at least the reservation utility to this type
of individuals, while the shaded area in right pane shows the set of insurance contracts
that are attractive to the L-type, in that they yield at least the reservation utility to this type
of individuals. It is clear from Figure 8.3 that if an insurance contract is attractive to an
L-type then it is also attractive to an H-type.
W2 W2
L-type L-type
H-type H-type
W1 W1
0 0
Figure 8.3: The shaded area in the left pane shows the set of insurance contracts that are
acceptable to the H-type; the shaded area in the right pane shows the set of insurance
contracts that are acceptable to the L-type.
W2
NI 45o line
L-type
H-type FH
W1
0
Figure 8.4: Contracts in the shaded area are acceptable to the H-type but not to the L-type.
8.3 The monopolist under asymmetric information 259
Under Option 1, the monopolist caters only to one type of individuals, namely the
H-type, and thus we can use the analysis of Chapter 5 (Section 5.2.1) and conclude that,
in order to maximize its profits, it will offer the full-insurance contract that leaves the
H-type indifferent between insuring and not insuring (shown as point FH in Figure 8.4).
Let hH
max be the premium of this contract. Then the monopolist’s total profits under Option
1, denoted by π1 , will be
π1 = NH hH
max − pH ` . (8.4)
For example, if the individuals’ initial wealth is W0 = 3, 600, the potential loss is
√ 8
` = 2, 000, the utility-of-money function is U(m) = m and pH = 10 , then, as calculated
at the end of the previous section, hH
max = $1, 664 and the expected profit from a single
8
contract would be 1, 664 − 10 (2, 000) = $64, so that total profits would be $64NH .
We call the number [qH pH + (1 − qH ) pL ] the average probability of loss and denote it
by p̄:
Thus when both types are insured with the same contract, the relevant isoprofit line is a
straight line with slope − 1−p̄ p̄ ; we call isoprofit lines with this slope average isoprofit lines.
It follows from (8.7) that
pL p̄ pH
< < . (8.8)
1 − pL 1 − p̄ 1 − pH
Of course, it is still true that at a point on the 45o line the slope of the L-type indifference
pL
curve is − 1−p L
; however the straight line with this slope is no longer a relevant isoprofit
line: the relevant isoprofit line has a slope of − 1−p̄ p̄ and is thus steeper than the L-type
indifference curve at that point. Figure 8.5 shows this for the reservation indifference curve
of the L-type.
Wealth in
good state
W2
H-type
Wealth in
W1 bad state
0
Figure 8.5: The reservation indifference curve of the L-type is less steep, at a point on the
45o line, than the average isoprofit line, whose slope is − 1−p̄ p̄ .
8.3 The monopolist under asymmetric information 261
Given the relative slope of the L-type reservation indifference curve and the average
isoprofit line at a full-insurance contract, such a contract cannot be profit-maximizing
under Option 2: there will be contracts to the left of it (thus partial-insurance contracts) that
are acceptable to the L-type (and thus to both types) and are below that average isoprofit
line (and thus yield higher expected profits). Of course, this argument applies to any
contract where the average isoprofit line is steeper than the L-type reservation indifference
curve. Hence, the profit-maximizing choice for the monopolist under Option 2 is that
contract on the L-type reservation indifference curve where there is a tangency between
the indifference curve and the average isoprofit line, as shown in Figure 8.6.
Wealth in
good state
W2
NI 45o line
C
L-type
p̄
line with slope − 1− p̄
H-type
Wealth in
W1 bad state
0
Figure 8.6: C is the profit-maximizing contract for the monopolist under Option 2.
The existence of such a contract is guaranteed if and only if the slope of the L-type
reservation indifference curve at the no-insurance point is, in absolute value, greater than
p̄ 3
1− p̄ . On the other hand, if the slope of the L-type reservation indifference curve at the
no-insurance point is, in absolute value, less than 1−p̄ p̄ , then Option 2 cannot yield positive
profits. The reason for this is that the average isoprofit line that goes through the no-
insurance point is the zero-profit line and thus every contract that is acceptable to the
L-types will be above the zero profit line, which means that it would yield negative profits.
3 If p̄
this condition is satisfied, then at NI the slope of the indifference curve is larger than 1− p̄ and at a
p̄
point on the 45o line it is smaller than 1− p̄ ; thus, by the Intermediate Value Theorem, there must be a point
along the curve where it is equal.
262 Chapter 8. Adverse Selection in Insurance
If Option 2 is profitable, the profit-maximizing contract under this option, let us denote
it by C = (W1C ,W2C ), is given by the solution to the following equations:
U 0 (W1C )
pL p̄
= . (8.10)
1 − pL U 0 (W2C ) 1 − p̄
Equation (8.9) states that an L-type individual is indifferent between contract C and no
insurance, that is, contract C lies on the reservation indifference curve for the L-type;
equation (8.10) states that, at point C, the reservation indifference curve of the L-type is
tangent to (has the same slope as) the average isoprofit line.
As an example, let us revisit the case considered at the end of Section 8.2.1 where
√ is W0 = 33, 600, potential
the individuals’ initial wealth
8
loss is ` = 2, 000, the utility-of-
money function is U(m) = m, pL = 10 and pH = 10 ; furthermore, let NH = 2, 400 and
8
NL = 3, 900, so that qH = 21 . Thus the average probability of loss is
8 8 13 3 103
p̄ = + = .
21 10 21 10 210
Since !
3 1
U 0 (1, 600)
pL 10 80 9
= = = 0.6429
1 − pL U 0 (3, 600) 7
10
1
120
14
and
103
p̄ 210
= 107
= 0.9626,
1 − p̄ 210
inequality (8.11) is not satisfied and thus Option 2 is not profitable.
Let us now change the value of NL from 3,900 to 44,000: NL = 44, 000 (while everything
2,400 3
else remains as above). Then qH = 2,400+44,000 = 58 ; thus
3 8 55 3 189 p̄ 189
p̄ = + = so that = = 0.483
58 10 58 10 580 1 − p̄ 391
and thus inequality (8.11) is satisfied and Option 2 is profitable. The profit-maximizing
contract under Option 2, denoted by C = (W1C ,W2C ), is given by the solution to the following
equations (which correspond to equations (8.9) and (8.10)):
8.3 The monopolist under asymmetric information 263
3 7 3p 7p
q q
W1C + W2C = 1, 600 + 3, 600
10 10 10 10
1
3 √ C 189
10 2 W1 580
7 1
= 391
.
10
√ C 580
2 W2
The solution is W1C = 2, 456.53 and W2C = 3, 124.97, that is, C = (2456.53, 3124.97); the
corresponding premium is 3, 600 − 3, 124.97 = $475.03 and the deductible is 3, 124.97 −
2, 456.53 = $668.44 so that the expected profit from a single contract is
189
475.03 − p̄(2, 000 − 668.44) = 475.03 − (1, 331.56) = $41.13
580
and total expected profits are
The first condition, (IRH ), is called the Individual Rationality constraint for type H and
says that the H-types must consider the contract targeted to them to be at least as good as
no insurance. The second condition, (ICH ), is called the Incentive Compatibility constraint
for type H and says that the H-types must consider the contract targeted to them to be at
least as good as the other contract that is offered (namely CL ).
Similarly, in order for contract CL to be chosen by L-type individuals two conditions must
be satisfied:
The first condition, (IRL ), is the Individual Rationality constraint for type L: it says that the
L-types must consider the contract targeted to them to be at least as good as no insurance.
The second condition, (ICL ), is he Incentive Compatibility constraint for type L: it says
that the L-types must consider the contract targeted to them to be at least as good as the
other contract that is offered (namely CH ).
Wealth in
good state
W2
contract CL
NI 45o line
L-type
H-type H-type
contract CH
Wealth in
W1 bad state
0
Figure 8.7 shows a pair of contracts that satisfy all four constraints as strict inequalities:
(1) contract CH is strictly above the H-type reservation indifference curve and thus (IRH )
is satisfied as a strict inequality, (2) contract CL is to the left of the H-type indifference
curve that goes through contract CH and thus (ICH ) is satisfied as a strict inequality,
(3) contract CL is strictly above the L-type reservation indifference curve and thus (IRL ) is
8.3 The monopolist under asymmetric information 265
satisfied as a strict inequality and (4) contract CH is worse than contract CL for the L-type
(indeed, it is even worse than no insurance).
On the other hand, Figure 8.8 shows a pair of contracts where the (ICH ) and (IRL )
constraints are satisfied as equalities while the other two constraints are satisfied as strict
inequalities.
Wealth in
good state
W2
NI 45o line
CL
CH L-type
H-type H-type
Wealth in
W1 bad state
0
Figure 8.8: Another pair of contracts that satisfy the four constraints.
From now on, we will assume that, if indifferent between contract CH and contract
CL the H-types will choose contract CH . Furthermore, we will continue to assume that, if
indifferent between insuring and not insuring, each individual will choose to insure.
If the monopolist offers a menu of two contracts, CH and CL , that satisfy the four
constraints, then the H-types will purchase contract CH = (hH , dH ) and the L-types will
purchase contract CL = (hL , dL ) and thus the monopolist’s expected total profits will be
π3 = NH [hH − pH (` − dH )] + NL [hL − pL (` − dL )] .
Thus the monopolist, under Option 3, faces the following maximization problem:
Max π3 = NH [hH − pH (` − dH )] + NL [hL − pL (` − dL )]
hH ,dH ,hL ,dL
subject to
(IRH ) EH [U(CH )] ≥ EH [U(NI)]
(ICH ) EH [U(CH )] ≥ EH [U(CL )]
(IRL ) EL [U(CL )] ≥ EL [U(NI)]
(ICL ) EL [U(CL )] ≥ EL [U(CH )]
266 Chapter 8. Adverse Selection in Insurance
It follows that the (IRH ) constraint can be derived from the (IRL ) and (ICH ) constraints:
- by (ICH ), EH [U(CH )] ≥ EH [U(CL )] and this, together with (8.13) yields the (IRH )
constraint: EH [U(CH )] ≥ EH [U(NI)].
Thus:
First observation: the (IRH ) constraint is redundant.
subject to:
EH [U(CH )]
z }| {
(ICH ) pH U (W0 − hH − dH ) + (1 − pH )U (W0 − hH )
≥ pH U (W0 − hL − dL ) + (1 − pH )U (W0 − hL )
| {z }
EH [U(CL )]
EL [U(CL )]
z }| { (8.14)
(IRL ) pLU (W0 − hL − dL ) + (1 − pL )U (W0 − hL )
EL [U(CL )]
z }| {
(ICL ) pLU (W0 − hL − dL ) + (1 − pL )U (W0 − hL )
To see this, start with two contracts (CH ) and (CL ) that satisfy the above three constraints
and suppose that EH [U(CH )] > EH [U(CL )]. Modify contract CH by increasing the premium
hH up to the point where (ICH ) is satisfied as an equality, that is, up to the point where
EH [U(CH )] = EH [U(CL )].4 Then profits will increase, since π3 is increasing in hH , and
thus the initial pair {CH ,CL } could not have been a solution to the maximization problem.5
So far we have concluded that the solution to the initial constrained optimization
problem requires that the two contracts CH and CL must be on the same indifference curve
for the H-type.
To see this, suppose that contract CH were above contract CL , as shown in Figure 8.9.
Wealth in
good state
W2
45o line
CH
L-type
CL
H-type
Wealth in
W1 bad state
0
Figure 8.9: Contract CH – which, by the second observation, must be on the same H-type
indifference curve as contract CL – cannot be above contract CL .
Then we can draw the indifference curve of the L-type that goes through contract CH : it
will be less steep than the indifference curve of the H-type and thus contract CL will be
below this L-type indifference curve, implying the the L-type would strictly prefer contract
CH to contact CL , contradicting the incentive compatibility constraint for the L-type, (ICL ).
4 The right-hand side of (ICH ) is independent of hH while the left-hand side is decreasing in hH .
5 Note that an increase in hH does not affect the (IRL ) constraint (both sides of it are independent of
hH ), while it reinforces the (ICL ) constraint, since the left-hand side of (ICL ) is independent of hH , while
the right-hand side is decreasing in hH ; thus if the (ICL ) constraint was satisfied to start with, then it will
continue to be satisfied after the increase in hH .
268 Chapter 8. Adverse Selection in Insurance
Thus, by the second and third observation, the two contracts CH and CL must be on the
same H-type indifference curve, with CL above CH , as shown in Figure 8.10.6
Wealth in
good state
W2
45o line
CL
L-type
CH H-type
To see this, suppose that CH is not a full-insurance contract, that is, suppose that it lies
above the 45o line. Then, as we know from Chapter 5, the H-type indifference curve is
pH
steeper at point CH than the isoprofit line with slope − 1−p H
, as shown in Figure 8.10. This
is indeed a relevant isoprofit line, because – by the (ICL ) constraint – contract CH will be
purchased only by the H-types. Hence there are points below contract CH , on the H-type
indifference curve, that will yield higher profits to the monopolist, since any such contract
would still be bought only by the H-types.7
6 To reach this conclusion one also needs to rule out the possibility that C = C . This is a consequence
H L
of the fourth observation below: starting from CH = CL above the 45o line, the monopolist could increase its
profits by separating CH from CL and moving it, along the H-type indifference curve, towards the 45o line;
on the other hand, if – to start with – CH = CL is already on the 45o line, then, by the (ICL ) constraint it must
be on or above the reservation indifference curve for the L-type and we know from the analysis of Option 2
that this is not a profit-maximizing configuration.
7 Moving contract C towards the 45o line, along the H-type indifference curve, will not alter the (IR )
H L
constraint (which is independent of hH and dH ) and will make contract CH even less attractive than contract
CL for the L-type, that is, the (ICL ) constraint will still hold.
8.3 The monopolist under asymmetric information 269
To see this, consider the situation depicted in Figure 8.11 where contract CL is above
the reservation indifference curve of the L-type (and – in accordance with the previous
observations – CL and CH lie on the same indifference curve of the H-type; furthermore,
CH is a full-insurance contract, that is, it lies on the 45o line). Draw the isoprofit line with
pL
slope − 1−p L
that goes through contract CL . We know from Chapter 5 that the H-type
pH
indifference curve at point CL is steeper than the line with slope − 1−p H
which, in turn, is
pL
steeper than the line with slope − 1−pL . Thus there are points on the H-type indifference
curve (to which both CL and CH belong) that are below this isoprofit line. Modify contract
CL by moving it along the H-type indifference curve up to the point where it intersects the
reservation indifference curve of the L-type, that is, until the (IRL ) constraint is satisfied
as an equality. Then the (ICH ) constraint is not affected (it is still satisfied as an equality)
and the (ICL ) constraint is also not affected, since contract CH is still worse, for the L-type,
than the modified contract CL . Thus the new CL contract is still purchased only by the
L-types and hence yields higher profits to the monopolist than the original CL contract
pL
(since the new contract is below the isoprofit line with slope − 1−p L
that goes through the
original contract).
Wealth in
good state
W2
isoprofit line with
pL
slope − 1−p
NI L 45o line
CL
L-type
H-type
CH
Wealth in
W1 bad state
0
Figure 8.11: Contract CL cannot be above the L-type reservation indifference curve.
270 Chapter 8. Adverse Selection in Insurance
Fix an arbitrary premium hH for the full-insurance contract CH (thus dH = 0), such
that CH lies on the segment of the 45o line between FH and FL , where FH is the point of
intersection of the reservation indifference curve of the H-type and the 45o line and FL is
the point of intersection of the reservation indifference curve of the L-type and the 45o line:
see Figure 8.12. Then the contract CL at the intersection of the reservation indifference
curve of the L-type and the indifference curve of the H-type that goes through contract CH
is uniquely determined; hence we can think of this contract CL as a function of hH :
Wealth in
good state
W2
NI 45o line
CL
L-type
H-type FL
H-type CH
FH
Wealth in
W1 bad state
0
Then the constrained maximization problem (8.14) can be reduced to the following
unconstrained maximization problem:
Max π3 (hH ) = NH (hH − pH `) + NL [hL (hH ) − pL (` − dL (hH ))] (8.16)
hH ∈[hLmax ,hH
max ]
where hLmax is the maximum premium that the L-type is willing to pay for full insurance
(the premium of contract FL in Figure 8.12) and hH max is the maximum premium that the
H-type is willing to pay for full insurance (the premium of contract FH in Figure 8.12).
R Note that one possible choice under Option 3 is to set hH = hHmax , that is, to choose
CH = FH (see Figure 8.12), in which case the corresponding contract for the L-type
is the trivial contract with hL = 0 and dL = `, that is, CL = NI. This amounts to
insuring only the H-types, with the full-insurance contract that was obtained as
the profit-maximizing contract under Option 1. Thus Option 1 is a special case of
Option 3.
Before we discuss the solution to (8.16) in general, let us consider a numerical example.
√ W0 = 1, 600, poten-
Let the following be common to all individuals: initial wealth
tial loss ` = 700 and vNM utility-of-money function U(m) = m. The H-types face a
1
probability of loss pH = 15 while the L-types face a probability of loss pL = 10 .
• To find hH
max solve the equation
1√ 4p p
900 + 1, 600 = 1, 600 − h.
5 5
The solution is hH
max = 156.
Given hH ∈ [79, 156], contract CL is given by the solution to the following pair of equations:
1
√ 9√ 1√ 9√
10 900 + 10 1, 600 = 10 1, 600 − hL − dL + 10 1, 600 − hL
(8.17)
1√ 4√ √
5 1, 600 − hL − dL + 5 1, 600 − hL = 1, 600 − hH .
The first equation states that CL lies on the reservation indifference curve of the L-type and
the second equation states that CL lies on the indifference curve of the H-type that goes
through the full-insurance contract CH with premium hH . The solution to (8.17) is
p
hL (hH ) = hH + 156 1, 600 − hH − 6, 084 (8.18)
p
dL (hH ) = 80 hH + 5, 460 1, 600 − hH − 219, 260. (8.19)
272 Chapter 8. Adverse Selection in Insurance
Let there be a total of N individuals and let qH be the fraction of type-H individuals (thus
NH = qH N), so that (1 − qH ) is the fraction of type-L individuals (thus NL = (1 − qH )N).
Then the monopolist’s objective is to choose that value of hH ∈ [79, 156] that maximizes
the function
π3 (hH ) = N [qH (hH − pH 700) + (1 − qH ) [hL (hH ) − pL (700 − dL (hH ))] (8.20)
where hL (hH ) and dL (hH ) are given by (8.18) and (8.19), respectively. The solution of this
maximization problem depends on the value of qH , that is, on how many H-types there are
relative to the L-types.
1
For example, if qH = 20 then the graph of the profit function (8.20) is shown in Figure
8.13. The profit-maximizing value of hH is an interior point of the interval [79, 156], namely
96.64. Replacing this value in (8.18) and (8.19) we obtain hL = 61.26 and dL = 172.80.
Thus the monopolist offers a full-insurance contract CH = (hH = 96.64, dH = 0) and
a partial-insurance contract CL = (hL = 61.26, dL = 172.80) and the H-types purchase
contract CH while the L-types purchase contract CL . We call such a situation a separating
equilibrium, since the monopolist – through the menu of contracts it offers – is able to
induce a separation of the types: all the individuals of one type purchase one contract and
all the individuals of the other type purchase the other contract.
π3 (hH )
5.94N
5.5N
0.8N
hH
79 96.64 156
1
Figure 8.13: The graph of the profit function (8.20) when qH = 20 .
8.3 The monopolist under asymmetric information 273
Now consider the case where qH = 15 . For this case the graph of the profit function
(8.20) is shown in Figure 8.14. It is clear from the graph that in this case the solution of the
profit-maximizing problem is a corner solution at hH = hH
max = 156 and only the H-types
insure (the contract targeted to the L-types is the trivial contract with hL = 0 and dL = 700:
see the remark on page 271).
π3 (hH )
3.2N
hH
79 156
−5N
~ When qH is “sufficiently close to 1” (that is, when the number of H-types is large
relative to the entire population of potential customers), the monopolist will choose
to insure only the H-types by offering the full-insurance contract with premium hHmax
(that is, the solution to the profit maximization problem (8.16) given on page 271
is the corner solution, as illustrated in Figure 8.14). As a consequence, the (IRH )
constraint is satisfied as an equality.
We want to show that π2∗ < π3∗ , where π3∗ is the maximum profit that the monopolist
can make under Option 3.
Figure 8.15 shows the contract, denoted by B, that maximizes profits under Option 2.
If the monopolist offers only this contract, then both types of individuals will purchase it.
Suppose that the monopolist switches from a one-contract menu containing only contract
B to a two-contract menu {B, F} obtained by adding to contract B also the full-insurance
contract F given by the intersection of the H-type indifference curve that goes through B
and the 45o line, as shown in Figure 8.15. Then the L-types will continue to buy contract B
(since it is strictly better for them that the newly added contract F), while the H-types will
switch to contract F.9 Thus profits from the L-types will not change, but profits from the
H-types will increase.10 The pair of contracts {B, F} satisfies all the constraints considered
under Option 3 (namely, IRH , ICH , IRL , ICL ) and yields profits that are larger than π2∗ ;
thus π3∗ (which may be even larger) is greater than π2∗ .
Wealth in
good state
W2
NI 45o line
L-type
H-type
p̄
line with slope − 1− p̄
F
Wealth in
W1 bad state
0
9 As usual, we assume that, if indifferent, the H-types will choose the contract targeted to them. Otherwise
the newly offered contract would have to be slightly above contract F on the 45o line.
10 Because of the usual argument that at a point above the 45o line, such as point B in Figure 8.15, the
pH
H-type indifference curve is steeper than the isoprofit line with slope − 1−p H
, which is the relevant isoprofit
line if we consider contracts on the portion of the H-type indifference curve between point B and point F,
since those contracts would be purchased only by the H-types.
276 Chapter 8. Adverse Selection in Insurance
2. there is no unexploited profit opportunity in the industry, that is, there is no currently
not offered contract that would yield positive profits to a (existing or new) firm that
offered that contract.
In the one-type case we saw that at the free-entry competitive equilibrium every firm
offers the full-insurance contract that lies at the intersection of the zero-profit line and the
45o line. Recall that the no-insurance point NI can be thought of as a trivial contract with
zero premium and deductible equal to the full loss `: such a “contract” obviously yields
zero profits; thus the zero-profit line goes through point NI. In the two-type context the
situation is complicated by the fact that there are three zero-profit lines:
pL
• a “low-risk” line (through NI) with slope − 1−p L
, which is a relevant isoprofit line
if and only if the contracts on this line are sold only to the low-risk individuals,
p̄ = qH pH + (1 − qH )pL ,
which is a relevant isoprofit line if and only if the contracts on this line are sold to
both types of individuals,
pH
• a “high- risk” line (through NI) with slope − 1−p H
, which is a relevant isoprofit line
if and only if the contracts on this line are sold only to the hight-risk individuals.
Since 0 < pL < pH < 1 and 0 < qH < 1, pL < p̄ < pH and thus
pL p̄ pH
< < . (8.21)
1 − pL 1 − p̄ 1 − pH
Hence the low-risk zero-profit line is less steep that the average risk zero-profit line, which,
in turn, is less steep than the high-risk zero-profit line.
8.4 A perfectly competitive insurance industry 277
W2
NI 45o line
pL
line with slope − 1−p L
pH
line with slope − 1−p H
W1
0
We also note that, at any insurance contract, the slope of the H-type indifference curve
pH
is – in absolute value – greater than or equal to 1−p H
and thus, by (8.21), it is greater than
the absolute value of the slope of the average isoprofit line through that point, as shown in
Figure 8.17.
W2
p̄
line with slope − 1− p̄
W1
0
Figure 8.17: At any (full- or partial-) insurance contract the H-type indifference curve is
steeper than the average isoprofit line.
278 Chapter 8. Adverse Selection in Insurance
In what follows we shall assume that the reservation indifference curve of the L-type is
steeper, at the no-insurance point, than the average zero-profit line, that is,
U 0 (W0 − `)
pL p̄
> . (8.22)
1 − pL U 0 (W0 ) 1 − p̄
Inequality (8.22) ensures that it is possible to make positive profits by insuring both types
of individuals with the same contract.11
• a pooling equilibrium where all the firms in the industry offer the same contract,
which is bought by both types of individuals, or
• a separating equilibrium where two contracts are offered in the industry: one
contract, denoted by CH , which is purchased only by the H-types and the other
contract, denoted by CL , which is purchased only by the L-types.
hB − p̄ (` − dB ) = 0. (8.23)
Consider the function πL (h, d) = h − pL (` − d) that gives, for every insurance contract
(h, d) the expected profit from that contract if it is bought only by the low-risk individuals,
that is, only by the L-types. Since pL < p̄, pL (` − dB ) < p̄(` − dB ) and thus it follows
from (8.23) that
Since the function πL (h, d) is a continuous function, it follows from (8.24) that, for every
insurance contract A = (hA , dA ),
Thus, if we can find a contract, close to B, that would be considered better than B by the
L-types, but worse than B by the H-types, then any firm that introduced contract A would
attract only the L-types and thus, by (8.25), it would make positive profits, contradicting
the second requirement of the definition of a free-entry competitive equilibrium. Does
such a contract exist?
11 By (8.22) there are contracts that are above the reservation indifference curve of the L-type (and thus
attractive to both types) and below the average zero-profit line (and thus yielding positive profits if purchased
by both types).
8.4 A perfectly competitive insurance industry 279
The answer is affirmative, as shown in Figure 8.18: draw the indifference curves of the
two types that go through contract B. The L-type indifference curve is less steep than the
H-type indifference curve and thus there are contracts, such as contract A in Figure 8.18,
which are below the H-type indifference curve through B and above the L-type indifference
curve through B. Thus if such a contract A were to be introduced, the L-types would
switch from the original contract B to the new contract A while the H-types would stay
with contract B. Hence, by (8.25), the firm that introduced contract A would make positive
profits, contradicting the second requirement of the definition of free-entry competitive
equilibrium.
Wealth in
good state
W2
45o line
A
B
L-type
H-type
Wealth in
W1 bad state
0
Figure 8.18: If, initially, both types purchase contract B and then contract A is added as an
option, then the H-types will not switch to A while the L-types will.
Wealth in
good state
W2
NI A
CB
low-risk zero-profit line
high-risk
zero-profit CH
line
Wealth in
W1 bad state
0
Figure 8.19: Where on the low-risk zero-profit line could contract CL be?
• Suppose that contract CL is on the low-risk zero-profit line to the right of point C,
such as point B in Figure 8.19. Then such a contract B would be preferred to CH by
both types,12 giving rise to a situation where all types purchase the same contract,
which – as we saw above – cannot be a free-entry competitive equilibrium.
12 B is preferred to CH by type H because B is to the right of the H-type indifference curve through CH and
B is preferred to CH by type L because CH is below the L-type indifference curve through B: this indifference
curve is not shown in Figure 8.19, but it is less steep than the indifference curve of the H-type through B
(also not shown); hence, since CH is below the latter, it is also below the former.
8.4 A perfectly competitive insurance industry 281
• Suppose that contract CL is on the low-risk zero-profit line to the left of point C, such
as point A in Figure 8.19. Then – by the usual argument based on the observation that
at such a point the L-type indifference curve is steeper than the low-risk zero-profit
line – there would be contracts below that zero profit line and above the indifference
curve of the L-type that goes through A that would attract the L-types, and only the
L-types, and yield positive profits, contradicting the first requirement of a free-entry
competitive equilibrium.
Thus we conclude that contract CL must be at the intersection of the low-risk zero-profit
line and the indifference curve of the H-type that goes through contract CH , that is, it must
coincide with point C in Figure 8.19.
We have determined that necessary conditions for a pair of contracts {CH ,CL } to be a
free-entry competitive equilibrium are:
(1) CH is at the intersection of the high-risk zero-profit line and the 45o line, and
(2) CL is at the intersection of the low-risk zero-profit line and the H-type indifference
curve through CH .
While necessary, the above two conditions are not sufficient for a free-entry competitive
equilibrium. To see this, consider the situation depicted in Figure 8.20.
Wealth in
good state
W2
NI CL
low-risk zero-profit line
D average zero-profit line
high-risk L-type
zero-profit
line
CH
Wealth in
W1 bad state
0
Figure 8.20: A case where the pair of contracts {CH ,CL } is not a free-entry competitive
equilibrium because of the existence of contracts like D.
282 Chapter 8. Adverse Selection in Insurance
Suppose that the contracts currently offered in the industry are CH and CL and one of the
existing firms, or a new firm, introduces contract D. The H-types will switch to contract D,
since it is better than the contract that they are currently purchasing, namely CH (D is to
the right of the H-type indifference curve through CH ) and the L-types will also switch to
D, since it is better than the contract that they are currently purchasing, namely CL (D is to
the right of the L-type indifference curve through CL ). Then, for the firm that introduced
contract D, the relevant isoprofit lines are the average isoprofit lines; since contract D is
below the average zero-profit line, it yields positive profits to the firm, contradicting the
second requirement of a free-entry competitive equilibrium.
Thus, in order for the two-contract configuration {CH ,CL } described above to be a
free-entry competitive equilibrium, it is also necessary that there be no contracts such as
contract D described above, that is, it must not be the case that the average zero-profit line
crosses the L-type indifference curve that goes through contract CL . In other words, the
average zero-profit line must be entirely below the L-type indifference curve through CL ,
as shown in Figure 8.21.
Since the higher the value of qH (that is, the larger the number of H-types in the
population relative to the number of L-types), the closer the average zero-profit line will be
to the high-risk zero-profit line, this additional requirement for the existence of a free-entry
competitive equilibrium can be understood in terms of qH being “sufficiently large”.
W2
NI CL
low-risk zero-profit line
high-risk L-type
zero-profit average zero-profit line
line
CH
0 W1
Figure 8.21: The pair of contracts {CH ,CL } is a free-entry competitive equilibrium.
8.5 Exercises
The solutions to the following exercises are given in Section 8.6 at the end of this chapter.
Exercise 8.1 Sara and Mary have the same initial wealth of $400 and face the same
potential loss of $280. They also have the same vNM utility-of-money function
U(m) = ln(m). They differ, however, in the probability of loss, which is pS = 12
for Sara and pM = 15 for Mary.
(a) Calculate the slope of Sara’s reservation indifference curve at the no-insurance
point.
(b) Calculate the slope of Mary’s reservation indifference curve at the no-insurance
point.
(c) Find the maximum premium that Sara would be willing to pay for full insurance
and calculate the insurance company’s expected profit from selling that full-
insurance contract to Sara.
(d) Find the maximum premium that Mary would be willing to pay for full insurance
and calculate the insurance company’s expected profit from selling that full-
insurance contract to Mary.
Exercise 8.2 Diana and Fran have the same initial wealth of $4,096 and face the same
potential√loss of $2,800. They also have the same vNM utility-of-money function
U(m) = m. They differ, however, in the probability of loss, which is pD = 14 for Diana
1
and pF = 16 for Fran.
Consider a monopolist seller of insurance who knows all of the above information about
Diana and Fran.
(a) What insurance contract would the monopolist offer to Diana?
(b) What insurance contract would the monopolist offer to Fran?
(c) Assume that, when indifferent between not insuring and insuring, both Diana
and Fran will choose to insure. Calculate the monopolist’s expected profit from
selling insurance to Diana and Fran.
284 Chapter 8. Adverse Selection in Insurance
8.5.2 Exercises for Section 8.3: The monopolist under asymmetric information
Exercise 8.3 There are 16,000 individuals, all identical in terms of initial wealth
W0 = $4, 000, potential
loss ` = $2, 500 and vNM utility-of-money function
m
U(m) = 10 ln 1,000 . Of these individuals, 12,000 have a high probability of loss
pH = 51 and 4,000 have a low probability of loss pL = 15 1
. A monopolist seller of in-
surance is considering using Option 1 (Section 8.3.1). What will its expected total
profits be? [As usual, assume that – if indifferent between insuring and not insuring –
individuals will choose to insure.]
Exercise 8.4 Consider again the case described in Exercise 8.3: there are 16,000 individ-
uals, all identical in terms of initial wealth W0 = $4, 000, potential loss
m
` = $2, 500 and vNM utility-of-money function U(m) = 10 ln 1,000 ; of these individ-
1
uals, 12,000 have a high probability of loss pH = 5 and 4,000 have a low probability of
1
loss pL = 15 .
(a) Show that, for a monopolist seller of insurance, Option 2 (Section 8.3.2) is not
profitable.
(b) Calculate what the monopolist’s total expected profits would be if it offered a
full-insurance contract that makes the L-type indifferent between insuring and
not insuring. [As usual, assume that – if indifferent between insuring and not
insuring – individuals will choose to insure.]
Exercise 8.5 Consider again the case described in Exercise 8.3, but reduce the number
of high-risk individuals from 12,000 to 6,000; thus there are 10,000 individuals, all
identical in terms of initial wealth W0 = potential loss ` = $2, 500 and vNM
$4, 000,
m
utility-of-money function U(m) = 10 ln 1,000 ; of these individuals, 6,000 have a high
1 1
probability of loss pH = 5 and 4,000 have a low probability of loss pL = 15 .
(a) Show that, for a monopolist seller of insurance, Option 2 (Section 8.3.2) is
profitable, by calculating the slopes of the relevant curves at the no-insurance
point.
(b) Write two equations, whose solution gives the contract that maximizes the mo-
nopolist’s profits under Option 2. If you are able to, compute the solution and
determine the monopolist’s expected total profits if it offers that contract. [As
usual, assume that – if indifferent between insuring and not insuring – individuals
will choose to insure.]
8.5 Exercises 285
Exercise 8.6 There are 6,000 individuals, all with the same initial wealth W0 = 16, 000,
potential loss ` = 7, 000 and with the same vNM utility-of-money
facing the same √
function U(m) = m. Of these 6,000 individuals, 1,000 are high-risk with a probability
2
of loss pH = 10 while the remaining 5,000 are low-risk with a probability of loss
1
pL = 10 .
(a) Calculate the monopolist’s total expected profits if it decides to pursue Option 1.
(b) Calculate the average probability of loss p̄.
(c) Write two equations whose solution gives the profit-maximizing contract under
Option 2.
(d) Suppose that, under Option 3, the monopolist decides to offer a full-insurance
contract with premium $1,400. Write a pair of equations whose solution gives
the other contract that the monopolist will offer.
(e) The solution of the pair of equations in Part (d) is the following contract, expressed
in term of wealth levels: C = (10169.41, 15832.48). Calculate the monopolist’s
profits if it offers the two contracts of Part (d).
Exercise 8.7 There are 8,000 individuals, all with the same initial wealth W0 = 10, 000,
facing the same potential loss ` = 6, 000 and with the same vNM utility-of-money
function U(m) = ln(m). Of these 8,000 individuals, 1,500 are high-risk with a proba-
bility of loss pH = 41 while the remaining 6,500 are low-risk with a probability of loss
1
pL = 16 .
(a) Calculate the monopolist’s total expected profits if it decides to pursue Option 1.
(b) Calculate the average probability of loss p̄.
(c) Write two equations whose solution gives the profit-maximizing contract under
Option 2.
(d) Suppose that, under Option 3, the monopolist decides to offer a full-insurance
contract with premium $2,000. Write a pair of equations whose solution gives
the other contract that the monopolist will offer.
(e) The solution of the pair of equations in Part (d) is the following contract, expressed
in term of wealth levels: C = (4120.36, 9980.26). Calculate the monopolist’s
profits if it offers the two contracts of Part (d).
286 Chapter 8. Adverse Selection in Insurance
Exercise 8.8 Consider again the information given in Exercise 8.6: there are 6,000
individuals, all with the same initial wealth W0 = 16, 000, facing the same potential
√
loss ` = 7, 000 and with the same vNM utility-of-money function U(m) = m; of
2
these 6,000 individuals, 1,000 are high-risk with probability of loss pH = 10 while the
1
remaining 5,000 are low-risk with probability of loss pL = 10 .
(a) Find the pair of contracts that is the only candidate for a free-entry perfectly
competitive equilibrium.
(b) Calculate the average probability of loss p̄.
(c) Write a pair of equations such that if it does not have a solution then the pair of
contracts of Part (a) is a free-entry perfectly competitive equilibrium.
(d) Suppose that the pair of equations in Part (c) does have a solution. Write a third
equation such that (1) if it is satisfied (at the solution of the other two equations),
then the pair of contracts of Part (a) is a free-entry perfectly competitive equi-
librium and (2) if it is not satisfied (at the solution of the other two equations),
then the pair of contracts of Part (a) is not a free-entry perfectly competitive
equilibrium.
Exercise 8.9 Consider again the information given in Exercise 8.7: there are 8,000
individuals, all with the same initial wealth W0 = 10, 000, facing the same potential
loss ` = 6, 000 and with the same vNM utility-of-money function U(m) = ln(m); of
these 8,000 individuals, 1,500 are high-risk with probability of loss pH = 14 while the
1
remaining 6,500 are low-risk with probability of loss pL = 16 .
(a) Find the pair of contracts that is the only candidate for a free-entry perfectly
competitive equilibrium.
(b) Calculate the average probability of loss p̄.
(c) Write a pair of equations such that: (1) if it has a solution then the pair of contracts
of Part (a) is not a free-entry perfectly competitive equilibrium and (2) if it does
not have a solution then the pair of contracts of Part (a) is a free-entry perfectly
competitive equilibrium.
8.6 Solutions to Exercises 287
(c) To find the full-insurance contract that makes Sara indifferent between insuring and
not insuring we need to solve the equation ln(W ) = 12 ln(120) + 21 ln(400). The
solution is W = 219.09, so that the maximum premium that Sara is willing to pay
for full insurance is 400 − 219.09 = $180.91. If Sara purchases this full-insurance
contract then the insurance company’s expected profit is 180.91 − 12 (280) = $40.91.
(d) To find the full-insurance contract that makes Mary indifferent between insuring
and not insuring we need to solve the equation ln(W ) = 15 ln(120) + 45 ln(400). The
solution is W = 314.40, so that the maximum premium that Mary is willing to pay
for full insurance is 400 − 314.40 = $85.60. If Mary purchases this full-insurance
contract then the insurance company’s expected profit is 85.60 − 15 (280) = $29.60.
(a) The monopolist will offer Diana the full-insurance contract that makes her indif-
ferent
√ between
√ insuring
√ and not insuring, which is determined by the solution to
W = 14 1, 296 + 34 4, 096. The solution is W = 3, 249, so that the premium of
the full-insurance contract is 4, 096 − 3, 249 = $847. If Diana purchases this full-
insurance contract then the insurance company’s expected profit from this contract
is 847 − 14 (2, 800) = $147.
(b) The monopolist will offer Fran the full-insurance contract that makes her indif-
ferent
√ between insuring√and not insuring, which is determined by the solution to
1√
W = 16 1, 296 + 15
16 4, 096. The solution is W = 3, 875.06, so that the pre-
mium of the full-insurance contract is 4, 096 − 3, 875.06 = $220.94. If Fran pur-
chases this full-insurance contract then the insurance company’s expected profit is
1
220.94 − 16 (2, 800) = $45.94.
(c) The monopolist’s expected profit from insuring Diana and Fran is 147 + 45.94 =
$192.94.
288 Chapter 8. Adverse Selection in Insurance
(a) We need to show that the L-type reservation indifference curve is less steep at the
no-insurance point, than the average isoprofit line, that is,
0
pL U (W0 − `) p̄
0
≤ (8.26)
1 − pL U (W0 ) 1 − p̄
12,000
First we compute the average probability of loss p̄. Since qH = 16,000 = 43 ,
3 1 1 1 1
p̄ = + = .
4 5 4 15 6
Thus
p̄ 1
= = 0.2.
1 − p̄ 5
On the other hand,
10
U 0 (W0 − `)
pL 1 1,500 4
= = = 0.1905.
1 − pL U 0 (W0 ) 14 10
4,000
21
(a) We need to show that the L-type reservation indifference curve is steeper, at the
no-insurance point, than the average isoprofit line, that is,
0
pL U (W0 − `) p̄
> (8.27)
1 − pL U 0 (W0 ) 1 − p̄
6,000
First we compute the average probability of loss p̄. Since qH = 10,000 = 53 ,
3 1 2 1 11
p̄ = + = .
5 5 5 15 75
Thus
p̄ 11
= = 0.1719.
1 − p̄ 64
On the other hand,
10
U 0 (W0 − `)
pL 1 1,500 4
= = = 0.1905.
1 − pL U 0 (W0 ) 14 10
4,000
21
U 0 (W1 )
pL p̄
= .
1 − pL U 0 (W2 ) 1 − p̄
that is,
1 W1 14 W1 1 14
10 ln + 10 ln = 10 ln(1.5) + 10 ln(4)
15 1, 000 15 1, 000 15 15
10
!
1 11
15 W1 75
14 10
= 64
.
15 W2 75
The solution is W1 = 1, 650.99 and W2 = 3, 972.69. Thus the premium of the offered
contract is 4, 000 − 3, 972.69 = $27.31 and the deductible is 3, 972.69 − 1, 650.99 =
$2, 321.70. Both types purchase this contract. Thus the monopolist’s expected total
profits are:
11
10, 000 27.31 − (2, 500 − 2, 321.70) = $11, 593.33.
75
290 Chapter 8. Adverse Selection in Insurance
(a) First calculate the maximum premium that the H-types are willing to pay for full
insurance by solving
2p 8p p
9, 000 + 16, 000 = 16, 000 − h.
10 10
The solution is hHmax = $1, 560. Thus under Option 1 the monopolist would only
offer full insurance at a premium of $1,560, attracting only the H-types; its total
expected profits would be
2
1, 000 1, 560 − 7, 000 = $160, 000.
10
(b) The average probability of loss is
1 2 5 1 7
p̄ = qH pH + (1 − qH ) pL = + = .
6 10 6 10 60
(c) The equations whose solution gives the profit-maximizing contract under Option 2
are:
1p 9p 1p 9p
9, 000 + 16, 000 = 16, 000 − h − d + 16, 000 − h
10 10 10 10
1 √1 !
7
10 2 16,000−h−d 60
9
= .
√ 1 53
10 2 16,000−h 60
(d) The assumption is that CH = (hH = 1400, dH = 0). To find contract CL = (hL , dL )
solve the following equations:
2p 8p p
16, 000 − h − d + 16, 000 − h = 16, 000 − 1, 400
10 10
1 p 9p 1p 9p
16, 000 − h − d + 16, 000 − h = 9, 000 + 16, 000
10 10 10 10
(e) The two contracts are: CH = (hH = 1400, dH = 0), which will be bought by the
H-types, and CL = (hL = 167.52, dL = 5663.07), which will be bought by the L-
types.13 Thus the monopolist’s expected total profits will be
2 1
1, 000 1, 400 − 7, 000 +5, 000 167.52 − (7, 000 − 5, 663.07) = $169, 135.
10 10
13 The
contract was given in terms of wealth levels as (10169.41, 15832.48), from which we obtain the
premium as 16, 000 − 15, 832.48 = 167.52 and the deductible as 15, 832.48 − 10, 169.41 = 5, 663.07.
8.6 Solutions to Exercises 291
(a) First calculate the maximum premium that the H-types are willing to pay for full
insurance by solving
1 3
ln(4, 000) + ln(10, 000) = ln(10, 000 − h).
4 4
The solution is hHmax = $2, 047.29. Thus under Option 1 the monopolist would only
offer full insurance at a premium of $2,047.29, attracting only the H-types; its total
expected profits would be
1
1, 500 2, 047.29 − (6, 000) = $820, 935.
4
(d) The assumption is that CH = (hH = 2000, dH = 0). To find contract CL = (hL , dL )
solve the following equations:
1 3
ln(10, 000 − h − d) + ln(10, 000 − h) = ln(10, 000 − 2, 000)
4 4
1 15 1 15
ln(10, 000 − h − d) + ln(10, 000 − h) = ln(4, 000) + ln(10, 000)
16 16 16 16
(e) The two contracts are: CH = (hH = 2000, dH = 0), which will be bought by the H-
types, and CL = (hL = 19.74, dL = 5859.9), which will be bought by the L-types.14
Thus the monopolist’s expected total profits will be
1 1
1, 500 2, 000 − 6, 000 +6, 500 19.74 − (6, 000 − 5, 859.9) = $821, 394.38.
4 16
14 The
contract was given in terms of wealth levels as (4120.36, 9980.26), from which we obtain the
premium as 10, 000 − 9, 980.26 = 19.74 and the deductible as 9, 980.26 − 4, 120.36 = 5, 859.9.
292 Chapter 8. Adverse Selection in Insurance
(a) Contract CH is the full-insurance contract that yields zero profits if bought only by
the H-types. Thus its premium is given by the solution to hH − pH ` = 0, that is,
2
hH = 10 (7, 000) = $1, 400. Contract CL is given by the intersection of the H-type
indifference curve through CH and the low-risk zero-profit line; thus it is given by
the solution to
p 2p 8p
16, 000 − 1, 400 = 16, 000 − h − d + 16, 000 − h
10 10
1
h− (7, 000 − d) = 0
10
(c) We need to express the fact that there is a contract at which the average zero-profit
line intersects the L-type indifference curve through contract
CL = (hL = 109.32, dL = 5906.81):
7
h− (7, 000 − d) = 0
60
1p 9p
16, 000 − 109.32 − 5, 906.81 + 16, 000 − 109.32
10 10
1p 9p
= 16, 000 − h − d + 16, 000 − h.
10 10
(d) Let E = (hE , dE ) be a contract that is a solution to the two equations of Part (c),
that is, E lies both on the average zero-profit line and on the L-type indifference
curve through contract CL = (hL = 109.32, dL = 5906.81). The existence of such
a contract E is compatible with the fact that the pair of contracts of Part (a) is a
free-entry competitive equilibrium if and only if there is a tangency at contract E
between the indifference curve and the average zero-profit line, that is, if the two
slopes are the same: a condition that is expressed by the following equation (the
left-hand side is 1−p̄ p̄ and the right-hand side is the absolute value of the slope of the
indifference curve at contract E):
√
7 1 16, 000 − hE
= √ .
53 9 16, 000 − hE − dE
8.6 Solutions to Exercises 293
(a) Contract CH is the full-insurance contract that yields zero profits if bought only by
the H-types. Thus its premium is given by the solution to hH − pH ` = 0, that is,
hH = 14 (6, 000) = $1, 500. Contract CL is given by the intersection of the H-type
indifference curve through CH and the low-risk zero-profit line; thus it is given by
the solution to
1 3
ln(10, 000 − 1, 500) = ln(10, 000 − h − d) + ln(10, 000 − h)
4 4
1
h− (6, 000 − d) = 0
16
which is CL = (hL = 90.98, dL = 4544.31).
(b) The average probability of loss is
3 1 13 1 25
p̄ = qH pH + (1 − qH ) pL = + = .
16 4 16 16 256
(c) We need to express the fact that there is a contract at which the average zero-profit
line intersects the L-type indifference curve through contract
CL = (hL = 90.98, dL = 4544.31):
25
h− (6, 000 − d) = 0
256
1 15
ln(10, 000 − 90.98 − 4, 544.31) + ln(10, 000 − 90.98)
16 16
1 15
= ln(10, 000 − h − d) + ln(10, 000 − h).
16 16
IV
Asymmetric Information:
Signaling
9 Signaling . . . . . . . . . . . . . . . . . . . . . . . . . 297
9.1 Earnings and education
9.2 Signaling in the job market
9.3 Indices versus signals
9.4 More than two types
9.5 A more general analysis
9.6 Signaling in other markets
9.7 Exercises
9.8 Solutions to Exercises
9. Signaling
Thus, for example, in 2018 a worker with a Bachelor’s degree had an annual income
1 Thedata in Table 9.1 refers to persons of age 25 and over, who are full-time wage and salary workers.
Source: Current Population Survey, U.S. Department of Labor, U.S. Bureau of Labor Statistics, https:
//www.bls.gov/emp/tables/unemployment-earnings-education.htm . We have converted the data
from weekly to annual figures.
298 Chapter 9. Signaling
that was 64% higher than the income of a worker with only a High-school diploma, and
a Master’s degree translated into a further 20% increase in income. Given the positive
correlation between level of education and earnings, it is natural to ask the following
questions:
1. Why do some people choose lower levels of education than others?
2. Why do employers reward higher levels of education with higher salaries?
3. Does the availability of educational institutions make society better off?
The answer to the first question is that, while education offers clear benefits in terms of
earnings, it also involves direct and opportunity costs. The opportunity cost of pursuing
a college degree, for example, is the income that one foregoes during the four years of
college. The direct cost of a college degree is the amount of money that one spends during
those four years in tuition, books, etc., as well as the subsequent interest cost on loans
taken during the college years. For some individuals those costs are lower than for others:
for example, children of more affluent parents do not need to borrow in order to pay tuition
costs or are able to obtain zero-interest loans from family members. Furthermore, there are
also differential psychological costs associated with college life: some find studying more
difficult and more burdensome than others. Thus, for some, a cost-benefit calculation leads
to the decision to acquire a lower level of education, while others pursue higher levels of
education.
The answer to the second question seems obvious too: employers pay more educated
workers more, because those workers have become more productive and more valuable
to the employer as a consequence of the additional education. In other words, employers
recognize that there is an objective, causal relationship between education and productivity:
the more education you acquire, the more productive you become. But is there truly
such a causal relationship between education and productivity? Is it at all possible that
employers are objectively wrong in postulating such a relationship between education and
productivity? What if a person’s productivity was determined at birth by his/her genetic
makeup and education had no effect at all on productivity? Could this be possible?
Before the 1974 seminal contribution of Michael Spence,2 an economist would have
denied such a possibility by arguing as follows. It is certainly possible that employers
might start with some incorrect hypothesis about the relationship between education and
productivity, but then their experience would reveal that, sometimes, highly educated
employees turn out to be less productive than, or as productive as, less educated employees.
The employers would then adjust their beliefs, so that, with sufficient time and experience,
they would discover that their initial hypothesis was wrong and would therefore no longer
automatically offer higher salaries to more educated job applicants. Spence’s contribution
was to show the possibility self-confirming beliefs: the employers’ (objectively wrong)
beliefs induce them to make education-dependent wage offers, which - in turn - induce
prospective employees to make choices of education levels which then confirm the em-
ployers’ beliefs, thus generating an equilibrium where nobody has any reason to change
his/her actions and beliefs. Such a situation is called a signaling equilibrium and is the
object of the next section.
2 MichaelSpence, Market signaling: Informational transfer in hiring and related screening processes,
Harvard University Press, 1974.
9.2 Signaling in the job market 299
Years Corresponding
of schooling education certificate
y=6 Elementary school
y = 12 High school diploma
y = 16 Bachelor’s degree
y = 18 Master’s degree
y = 21 Ph.D. degree
Suppose that employers believe that education increases productivity, so that the more
educated a candidate is, the more valuable he/she is to the employer. These beliefs are
manifested through a wage schedule that associates higher salaries with higher levels of
education, as shown in the following table:
Thus, for example, a job candidate who only has a High-school diploma will be offered
a salary of $20,000, while a job candidate with a Bachelor’s degree will be offered a higher
salary, namely $25,000.
Now the reader is asked to imagine a world where education has no effect on a person’s
productivity. In particular, in this hypothetical world there are only two types of individuals:
Type H whose productivity is worth $30,000 to potential employers (and is unaffected by
education), and type L whose productivity is worth $20,000 to potential employers (and is
also unaffected by education). The proportion of individuals of type H in the population
is pH , with 0 < pH < 1, so that the proportion of type L is (1 − pH ). We assume that
each individual knows her own type, while a potential employer is unable to determine
the type of a job applicant. Thus we are in a situation of asymmetric information. One
300 Chapter 9. Signaling
last assumption is that the two types have different monetary costs of acquiring education,
which are as follows:
(
0 if y ≤ 6
For type H : CH (y) =
1, 000(y − 6) if y > 6
(
0 if y ≤ 6
For type L : CL (y) =
2, 000(y − 6) if y > 6
Thus every year of schooling beyond 6th grade costs $1,000 to an H-type and $2,000 to an
L-type.
Each type will choose that level of education that yields the largest net income. The
cost-benefit analysis for a type-H person is shown in Table 9.4.
y w CH w −CH
years in school gross wage cost net wage
y=6 $6,000 $0 $6,000
y = 12 $20,000 $6,000 $14,000
y = 16 $25,000 $10,000 $15,000
y = 18 $30,000 $12,000 $18, 000
y = 21 $32,000 $15,000 $17,000
Thus an H-type will maximize her net income by pursuing a Master’s degree (corre-
sponding to 18 years of schooling).
On the other hand, a type-L person faces the cost-benefit calculations shown in Table
9.5.
y w CL w −CL
years in school gross wage cost net wage
y=6 $6,000 $0 $6,000
y = 12 $20,000 $12,000 $8, 000
y = 16 $25,000 $20,000 $5,000
y = 18 $30,000 $24,000 $6,000
y = 21 $32,000 $30,000 $2,000
9.2 Signaling in the job market 301
Hence a type-L person will maximize his net income by obtaining a High-school
diploma (corresponding to 12 years of schooling).
In this hypothetical world type-L individuals, whose productivity is $20,000, will
present themselves at a job interview with a High-school diploma and, according to the
employers’ wage schedule (Table 9.3), they will be offered a wage of $20,000 which
matches their true productivity. On the other hand, type-H individuals, whose productivity
is $30,000, will present themselves at a job interview with a Master’s degree and, according
to the employers’ wage schedule, they will be offered a wage of $30,000 which matches
their true productivity. After hiring candidates according to the wage schedule shown in
Table 9.3 - which reflects their initial beliefs that more education causes higher productivity
- employers will observe that, indeed, those with a Master’s degree turn out to be more
productive than those who only have a High-school diploma, thus confirming their initial
(wrong) beliefs; furthermore, every employee is paid an amount that matches her/his true
productivity. Hence employers have no reason to change their beliefs and their hiring
practices. Since the wage schedule remains the same, both types have no reason to change
their education choices. Thus we have a situation of equilibrium. Following Spence, we
call such a situation a signaling equilibrium.
income is [pH 30, 000 + (1 − pH )20, 000] − 6, 000 (where $6,000 is her cost of obtaining a
High-school diploma). Thus, a type-H individual is made better off by the government
intervention if and only if
[pH 30, 000 + (1 − pH )20, 000] − 6, 000 > 18, 000,
that is, if and only if pH > 52 . That is, if type-H individuals constitute at least 40% of the
population, then eliminating all the institutions of higher education would lead to a Pareto
improvement!4
no effect on productivity, while if sH > sL then more education translates into higher
productivity (for both types if sL > 0 or only for the H type if sL = 0).5
As in Section 9.2.1, the additional crucial assumption is that education is more costly
for individuals of type L than for individuals of type H. Let the monetary cost of education
be:
For type H : cH y (9.3)
The term ‘separating’ means that the two types make difference choices of education.
1.
a + sH y∗ − cH y∗ ≥ b (9.5)
(this inequality says that a type H prefers to choose y = y∗ and be classified correctly
as type H, rather than choosing y = 0 and be classified incorrectly as type L, where
‘prefers’ means ‘obtains a higher net income’),
2.
b ≥ a + sH y∗ − cL y∗ (9.6)
(this inequality says that a type L prefers to choose y = 0 and be classified correctly
as type L, rather than choosing y = y∗ and be classified incorrectly as a type H, in
which case he would be paid a wage equal to a + sH y∗ ).
It should be noted that there may be a multiplicity of separating signaling equilibria (indeed,
even a continuum). For example, let us consider the case where a = 2, b = 1, sH = sL = 0,
cH = 12 and cH = 1 (thus, in this case, productivity is not affected by education). Then
inequalities (9.5) and (9.6) become:
2 − 12 y∗ ≥ 1
1 ≥ 2 − y∗
which are equivalent to 1 ≤ y∗ ≤ 2. Thus, for every y∗ ∈ [1, 2], the triple (y∗ , yH = y∗ , yL =
0) is a separating signaling equilibrium. These equilibria are Pareto ranked: the lower the
value of y∗ the better off individuals of type H are (since their net income is 2 − 12 y∗ ); on
the other hand, the L types are indifferent among all the equilibria, since their net income is
always 1 and the employers are also indifferent because their total wage bill is the same at
each of those equilibria, namely 2pH N + (1 − pH )N (where, as before, pH is the proportion
of type H in the population and N is the number of people employed), which, in turn is
equal to total productivity.
To reiterate the point made in Section 9.2.2, we show that also in this example it
is possible for a signaling equilibrium to be Pareto inefficient. Consider the alternative
situation where
(1) the possibility of signaling through education is removed, so that every individual is
forced to choose y = 0 and
(2) everybody is paid a wage equal to the average productivity of the population, which is
6 Whether we use weak or strict inequalities depends on what we assume in case an individual is indifferent.
Since we use weak inequalities, our implicit assumption is that, when indifferent between y = 0 and y = y∗ ,
a type H would choose y = y∗ while a type L would choose y = 0.
9.3 Indices versus signals 305
2pH + (1 − pH ) = 1 + pH .
Then employers are as well off in the new situation and type L individuals are better off
(their wage is now 1 + pH whereas in the previous situation it was 1). Type H individuals
are better off in the new situation if and only if 1 + pH > 2 − 12 y∗ , which is possible: for
example if y∗ = 1.5 and pH > 14 .
Thus employers’ beliefs embody a wrong view of the effect of education on productivity
(recall that productivity is assumed to be constant and thus independent of the amount of
education) as well as a prejudice against women (women are thought to be innately less
able to convert education into productivity).
We continue to assume, as in Section 9.2.1, that the two types of individuals have
different monetary costs of acquiring education:
0 if y ≤ 6
For type H (man or woman): CH (y) =
1, 000(y − 6) if y > 6
0 if y ≤ 6
For type L (man or woman): CL (y) =
2, 000(y − 6) if y > 6
Thus every year of schooling beyond 6th grade costs $1,000 to an H-type, whether it is a
man or a woman, and $2,000 to an L-type, whether it is a man or a woman.
9.3 Indices versus signals 307
The cost-benefit analysis for a man of type L is the same as in Table 9.5 in Section
9.2.1, with the conclusion that a man of type L would choose y = 12.
A woman of type-L faces the cost-benefit calculations shown in Table 9.7.
y w CL w −CL
years in school gross wage cost net wage
y=6 $6,000 $0 $6,000
y = 12 $20,000 $12,000 $8, 000
y = 16 $23,000 $20,000 $3,000
y = 18 $26,000 $24,000 $2,000
y = 21 $30,000 $30,000 $0
y w CL w −CL
years in school gross wage cost net wage
y=6 $6,000 $0 $6,000
y = 12 $20,000 $6,000 $14,000
y = 16 $23,000 $10,000 $13,000
y = 18 $26,000 $12,000 $14,000
y = 21 $30,000 $15,000 $15, 000
Hence we have a separating equilibrium where all type-L individuals (men and women)
choose to stop at a High-school diploma and are hired at a salary of $20,000, which
corresponds to their true productivity; type-H men choose to obtain a Master’s de-
gree and are hired at a salary of $30,000, which corresponds to their true productivity;
type-H women choose to obtain a Ph.D. degree and are hired at a salary of $30,000,
which corresponds to their true productivity. At this equilibrium employers never observe
308 Chapter 9. Signaling
men and women with the same graduate degree: if they did, they would realize that their
beliefs concerning women’s abilities were wrong and would revise those beliefs, thereby
abandoning their prejudices. What they do observe is that those whose productivity is
$30,000 are either men with 18 years of schooling or women with 21 years of schooling
and would thus find confirmation of their beliefs that “women need to spend more time in
school in order to achieve the higher level of productivity”. In other words, the employers’
doubly wrong beliefs are self-confirming at a separating equilibrium.
To sum up, the question that the presence of indices gives rise to is the following: can the
informational structure of the market bring about persistent and consistent discrimination
between objectively identical individuals? We considered the case where the unchangeable
attribute (index) is gender. We supposed that within each gender there were both low-
productivity and high-productivity individuals. Signaling costs (in our example, costs of
acquiring education) were different for low-productivity and high-productivity people, but
individuals with the same level of productivity had the same signaling costs, no matter
whether they were men or women. We also assumed that people (both men and women)
with the same productivity had the same preferences and the same objective: to maximize
their income net of education costs. Therefore the index (gender) should be absolutely
irrelevant: in principle, people with the same opportunities and the same preferences ought
to make similar decisions and end up in similar situations. The informational structure
of the market, however, can destroy this principle. If employers believe that gender
(besides education) is correlated with productivity, they might offer a wage schedule that
is differentiated on the basis of education and gender, thus presenting otherwise identical
individuals with different opportunity sets. As a consequence, the employer’s beliefs may
force high-productivity women to invest in education more than their male counterpart
(that is, more than high-productivity men). The reason why this situation can persist is
that employers will interpret the incoming data separately for the two groups of men and
women. If different levels of education were associated with the same observed level of
productivity within the same group, employers would be forced to revise their beliefs.
For example, if within the group of men different levels of education were accompanied
by the same observed productivity, then employers would conclude that, at least above
a certain level, education no longer increases productivity (at least for men). But since
men and women are judged separately and independently (data on men is not used to
classify women and vice versa), employers can consistently think that women need to
acquire more education than their male counterpart in order to compensate for a “genetic”
handicap. The employers’ beliefs induce women to invest more in education than men,
thereby confirming the prejudice that women require more education in order to achieve
the same level of productivity as men (despite the fact that those beliefs have no objective
grounds: they constitute, indeed, a prejudice). As a consequence, women end up being
over-qualified for their jobs as compared to men.
Suppose that employers set two threshold levels of education, y∗1 and y∗2 , and announce that
wage offers will depend on the level of education y of the applicant, as follows:
That is, applicants with y < y∗1 are presumed to be of type 1, applicants with y∗1 ≤ y < y∗2
are presumed to be of type 2 and applicants with y ≥ y∗2 are presumed to be of type 3.
For what values of y∗1 and y∗2 do we have a separating signaling equilibrium?
Since - except for the threshold values - increasing y increases the cost of education
but leaves the wage constant, each individual will only consider choosing either y = 0 or
y = y∗1 or y = y∗2 . At a separating equilibrium, type-1 individuals must choose y = 0 in
order to be paid a wage equal to their true productivity, type-2 individuals must choose
y = y∗1 and type-3 individuals must choose y = y∗2 ; furthermore, it must be in their interest
to do so. Thus we need two inequalities to be satisfied for each type.
Let us begin with Type 1. The following table shows the net wage for a type-1 individual
for each of the three choices of education.
Type 1
choice wage cost net wage
y=0 4 0 4
y = y∗1 5 3y∗1 5 − 3y∗1
y = y∗2 6 3y∗2 6 − 3y∗2
310 Chapter 9. Signaling
The following table shows the net wage for Type 2 for each of the three choices of
education.
Type 2
choice wage cost net wage
y=0 4 0 4
y = y∗1 5 2y∗1 5 − 2y∗1
y = y∗2 6 2y∗2 6 − 2y∗2
The following table shows the net wage for Type 3 for each of the three choices of
education.
Type 3
choice wage cost net wage
y=0 4 0 4
y = y∗1 5 y∗1 5 − y∗1
y = y∗2 6 y∗2 6 − y∗2
It follows from (9.15) and (9.16) that indifference curves in the (y, m) plane must be
increasing: starting from a point (ŷ, m̂) if another point (y, m) with y > ŷ is to give the same
utility as (ŷ, m̂) (that is, is to be on the indifference curve that goes through (ŷ, m̂)) it must
be that m > m̂. We shall consider the case where indifference curves are convex, as shown
in Figure9.1.8
income
m
direction of
increasing
utility indifference
curve
y
education
0
Indifference curves can be used to obtain the monetary equivalent of the psychological
cost (or “disutility”) associated with an increase in the amount of education. Starting from
an education-income pair (ŷ, m̂) (which lies on the indifference curve corresponding to
some level of utility ū), if education is increased by one unit and income is kept constant,
then utility goes down, that is, the individual is worse off. One can thus ask: How much
extra income should the individual be given to compensate for the reduction in utility
due to the increase in y? In other words, what is the amount by which income has to be
increased in order to keep the individual on the same indifference curve?
8 For √
example, if U(y, m) = m − y then the equation of the indifference curve corresponding to utility
2
level ū is m = (ū + y) , which is increasing and convex in y.
9.5 A more general analysis 313
This is shown in Figure 9.2: starting from the education-income pair (ŷ, m̂), if education
is increased by one unit to ŷ +1 then the individual requires an additional amount of income
equal to (m1 − m̂) in order to remain on the same indifference curve; if education is then
increased by one more unit, from ŷ + 1 to ŷ + 2, then the individual requires an additional
income compensation equal to (m2 − m1 ). Note that, since the indifference curve is
convex, (m2 − m1 ) > (m1 − m̂), that is, every additional unit of education requires a larger
compensation in terms of income.
income
m
indifference
curve
m2
m1
m̂
y
0 ŷ ŷ + 1 ŷ + 2 education
Recall that the crucial assumption in the analysis of the previous sections was that
higher-productivity people had a lower monetary cost of education. We can now express
this difference in education costs in terms of preferences, as follows. Let us focus on
the simple case where there are only two types of individuals: Type H and Type L; the
latter are the less productive ones. We want to capture, using indifference curves, the fact
that education is more “psychologically costly” for type-L individuals than for type-H
individuals: for any given increase in education, the income compensation required by an
L-type is larger than the income compensation required by an H-type. This will be the case
if and only if the indifference of an L-type that goes through a given education-income
pair (ŷ, m̂) is steeper than the indifference curve of an H-type through (ŷ, m̂).
314 Chapter 9. Signaling
This is shown in Figure 9.3: at income level m̂, increasing education from ŷ to ŷ + 1
makes every type worse off; in order to restore a type-L individual to the original utility
level (that is, in order to keep him on the L-type indifference curve through (ŷ, m̂)) an
increase in income equal to (mL − m̂) is required, while for a type-H individual the required
increase in income is (mH − m̂). As shown in Figure 9.3, (mL − m̂) > (mH − m̂).
income
m
Type L
Type H
mL
mH
m̂
y
0 ŷ ŷ + 1 education
Figure 9.3: The monetary compensation for an extra unit of education required by an
L-type is larger than that required by an H-type.
Let us assume that education does increase productivity, but at different rates for the
two types. Specifically, we assume that productivity, denoted by Π, is as follows, with
0 < kL < kH :
Type L : ΠL (y) = kL y
Type H : ΠH (y) = kH y.
9.5 A more general analysis 315
The productivity functions for the two types are shown in Figure 9.4.
productivity
Π
Type H
Type L
y
0 education
What education levels would the two types choose in a perfect-information world in
which employers were able to determine the type of each job applicant? The answer is
illustrated in Figure 9.5.
productivity
Π Type H
Type H
Type L
Type L
y
0 yL yH education
Figure 9.5: The education choices of the two types under perfect information.
316 Chapter 9. Signaling
In the perfect-information case (where employers can determine the type of a job
applicant and offer a wage equal to his/her productivity), the optimal choice of education
for a type is given by the horizontal coordinate of the point at which there is a tangency
between the productivity line of that type and an indifference curve of that type. Indeed, if
an individual chooses a level of education corresponding to a point at which the indifference
curve crosses the productivity line, such as point A in Figure 9.6 for type L, then there are
points to the left of that point (corresponding to lower levels of education) that lie on a
higher indifference curve (and thus are better for the individual).
productivity
Π
A Type L
y
0 yL ŷ education
Figure 9.6: An education level, such as ŷ, at which the indifference curve crosses the
wage/productivity line is not optimal (this is shown for a type-L individual).
We denote the perfect-information choice of education for type L by yL and the perfect-
information choice of education for type H by yH .
Let us now return to the case where employers are not able to determine the type of a
job applicant. We assume, as in the previous sections, that employers set a threshold level
of education, denoted by y∗ , and announce that anybody with a level of education less than
y∗ will be classified as a type-L individual and will be offered a wage equal to kL y (where y
is the level of education he/she has) and anybody with a level of education greater than or
equal y∗ will be classified as a type-H individual and will be offered a wage equal to kH y .
9.5 A more general analysis 317
wage
offered
(Type H)
(Type L)
kH y∗
y
0 y∗ education
Figure 9.7: The wage offered by employers as a function of education, with a threshold
value y∗ .
Then the indifference curve of an H-type that goes through the point (y∗ , kH y∗ ) crosses the
H-type productivity line at that point, so that points to the right of it are worse choices for
the H-type. Hence a type-H individual will only consider either education level y∗ (that
will have her classified correctly as type H) or a lower education level (that would have her
classified incorrectly as type L). Two cases are possible:
1. the indifference curve of the H-type that goes through the point (y∗ , kH y∗ ) lies
entirely above the segment of the wage schedule to the left of that point,
2. the indifference curve of the H type that goes through the point (y∗ , kH y∗ ) crosses
the segment of the wage schedule to the left of that point.
In Case 1, an H-type would choose level of education y∗ , while in Case 2 an H-type would
choose a level of education less than y∗ and thus be classified incorrectly as type-L, with
the consequence that there would be no separating signaling equilibrium.
318 Chapter 9. Signaling
productivity
Π
(Type H)
kH y∗
y
0 yH y∗ education
productivity
Π
(Type H)
kH y∗
y
0 yH y∗ education
Figure 9.9: Case 2: the optimal choice of education for an H-type is less than y∗ .
9.5 A more general analysis 319
Thus, in order for a separating signaling equilibrium to exist, it is necessary that we are
in Case 1 above where the indifference curve of the H-type that goes through the point
(y∗ , kH y∗ ) lies entirely above the segment of the wage schedule to the left of that point.
This guarantees that the H-type individuals will choose level of education y∗ and will thus
be correctly classified as type-H and remunerated according to their true productivity.
The second requirement for the existence of a separating signaling equilibrium is that
the L-types choose a level of education less than y∗ (so that they will be correctly classified
as type-L and remunerated according to their true productivity). The relevant indifference
curve for the L-type is the one that goes through the point corresponding to the L-type
choice of education under perfect information, namely point (yL , kL yL ).9
One possibility, illustrated in Figure 9.10, is that this indifference curve of the L-type
cuts the second segment of the wage schedule. In this case the L-type would choose a
level of education greater than, or equal to, y∗ and thus there is no separating signaling
equilibrium.
productivity
Π
(Type L)
kL yL
y
0 yL y∗ education
Figure 9.10: The case where the optimal choice of education for an L-type is greater than,
or equal to, y∗ .
The other possibility is that the indifference curve of the L-type that goes through point
(yL , kL yL ) lies entirely above the second segment of the wage schedule, that is, the segment
corresponding to levels of education greater than or equal to y∗ . In this case the optimal
choice of education for the L-type is yL , with the consequence that type-L individuals are
correctly classified as type L and are paid according to their true productivity.
9 Note that, since we are looking at the case where yH < y∗ , and yL < yH , the point (yL , kL yL ) lies on the
first segment of the wage schedule, that is, the segment corresponding to choices of education less than y∗ .
320 Chapter 9. Signaling
productivity
Π
(Type L)
y
0 yL y∗ education
Figure 9.11: The case where the optimal choice of education for an L-type is yL .
Putting together the cases illustrated in Figures 9.8 and 9.11 we obtain the conditions
that yield the existence of a separating signaling equilibrium:
1. the indifference curve of the H-type that goes through the point (y∗ , kH y∗ ) lies
entirely above the segment of the wage schedule to the left of that point (that is, the
segment from (0, 0) to (y∗ , kL y∗ ),
2. the indifference curve of the L-type that goes through point (yL , kL yL ) lies entirely to
the left of the second segment of the wage schedule (that is, the segment that starts
at the point (y∗ , kH y∗ )
9.5 A more general analysis 321
Figure 9.12 illustrates the case where a separating signaling equilibrium exists: all type-
H individuals choose level of education y∗ and are paid kH y∗ , while all type-L individuals
choose level of education yL and are paid kL yL .
productivity
Π
(Type H
(Type L)
y
0 yL y∗ education
Note that, since y∗ > yH , at the separating signaling equilibrium illustrated in Figure
9.12 there is over-investment in education: the H-types invest in education more than
they would in a perfect-information world. Thus the asymmetry of information between
employers and job applicants involves a cost for society that would not be incurred if there
were perfect information.
We considered the case where the threshold value y∗ is greater than the level of
education that would be chosen by an H-type under perfect information, that is, the case
where y∗ > yH . The remaining two cases, namely y∗ < yL and yL < y∗ < yH are examined
in Exercises 9.10 and 9.11.
We conclude with a brief discussion of other markets in which signaling can take place.
We saw in Chapter 7 that, due to adverse selection, it is possible that in the market for
second-hand cars some qualities (in the extreme case, all but the lowest quality) cannot
be traded, giving rise to an inefficiency if potential buyers value those cars more than the
owners. Thus the owner of a good-quality car gets “stuck” with her car, unable to convince
a potential buyer that her car is of good quality and thus justifies a higher price than the
low market price. Merely claiming that her car is of good quality is “cheap talk” which
can be imitated by the owners of cars of inferior quality. Credible communication with
potential buyers would require a costly signal which other sellers would not find profitable
to imitate. What could constitute such a signal? The answer is: offering a warranty to
cover future repairs.
We shall not develop a formal analysis of this, since - as explained below - this is more
of a theoretical possibility than a practical one. However, a formal example is developed in
detail in Exercise 9.12, which the reader is urged to attempt.
Presumably, in the case of cars ‘high quality’ means ‘unlikely to break down and
require an expensive repair’ and ‘low quality’ means ‘likely to require an expensive repair’.
Indeed, the buyer of a used car will typically worry that she will end up with a “lemon” into
which she will have to sink a lot of money for maintenance. The owner of a good-quality
car knows that her car is unlikely to require major repairs and would in principle be willing
to offer a warranty against such future repairs, that is, she will be willing to contractually
commit herself to paying for the cost of any such repairs (within a specified period of
time). This commitment will end up involving a low expected cost for her, because of the
low probability of a major breakdown (since the car is, after all, a good-quality car). On
the other hand, the owner of a low-quality car would not be willing to undertake such a
commitment, because she recognizes that the probability of her having to pay for a major
repair in the future is, in fact, quite high. Thus offering to sell with warranty can indeed
work as a credible signal of high quality and induce buyers to correspondingly pay a higher
price for cars sold with warranty.
However, from a practical point of view there are several factors that work against this
form of signaling. First of all, while a warranty offered by an established dealership is
perhaps sufficiently credible,10 a warranty offered by a private individual is less credible:
the seller might “disappear” (e.g. move to a different state) and become untraceable when
the warranty becomes relevant (due to a needed repair) or the seller might declare personal
bankruptcy or become insolvent, etc. There is also an issue of moral hazard involving the
buyer, which is a source of concern for the seller: if the car is insured against future repairs,
the new owner might abuse the car and cause a breakdown that could be avoided with due
care. The topic of moral hazard will be discussed in the next chapter.
10 However, even dealerships can go bankrupt, leaving the buyer with a worthless warranty.
9.6 Signaling in other markets 323
Advertising is pervasive in our daily lives: we are exposed to it as we drive on the freeway,
as we watch television, while we browse the internet, etc. Why do firms advertise?
What is the purpose of advertising? What information, if any, do consumers obtain from
advertisements?
It is clear that there cannot be a single answer to the above questions, because advertis-
ing falls into several different categories. In terms of content, for example, advertising can
be informative or uninformative. The former makes consumers aware of something they
did not know, such as the existence of a product, or its properties, or its price, or where
it can be obtained. Most advertising, however, does not seem to fall into such category:
for example, an ad showing a pop-star or an athlete drinking Coca Cola does not seem
to convey to consumers any information that they did not already have (are there many
people who do not know what Coca Cola is?); an ad claiming that a particular model of
car is “the best” seems to be just “cheap talk”, etc.
In this section we focus on “cheap talk” advertising, that is, claims made about a
product that are easily imitated by other producers. Does this type of advertising make
sense? Should consumers pay attention to it?
Nelson11 provided an explanation for this type of advertising in terms of what we now
call signaling. He observed that the characteristics that differentiate products in a given
market can be classified into search characteristics and experience characteristics.
Search characteristics are those that can be ascertained before purchasing the good
(e.g. the style of a dress). Since consumers can verify by themselves what the properties
of the good are - before buying it - in this case there is no scope for false or misleading
advertising: a false or exaggerated claim by the seller will not induce any purchases and
will only damage the seller’s reputation.
Experience characteristics, on the other hand, can only be learned through use and
therefore after the purchase (e.g. the quality of a stereo system or the taste of a brand of
canned tuna fish). In this case there is scope for false or misleading advertising: some
consumers might be induced to buy the product by claims that will turn out to be false.
However, Nelson argues that high-quality brands will obtain more repeat purchases, ceteris
paribus, than low-quality brands, because consumers will be satisfied with the product
and will buy it again in the future and/or recommend it to their friends. A dollar spent on
advertising for a high-quality good generates future revenue that justifies the advertising
expense. Producers of low-quality goods, on the other hand, might be able to fool a
consumer into buying the first time, but will experience no repeat purchases. In other
words, advertising is an expensive signal, which is more costly for the producer of a low-
quality product than for the producer of a high-quality product and can thus be used by the
latter to separate herself from the competitors who sell inferior goods. Hence advertising
is informative, no matter what its content: the information conveyed by advertising is that
the advertiser can afford the cost of advertising because its product is successful due to its
high quality: advertising is a signal that the advertised product is of high quality.
11 Phillip Nelson, Advertising as information, Journal of Political Economy, 1974, Vol. 82, No. 4, pp.
729-754.
324 Chapter 9. Signaling
12 Michael Spence, Market signaling: Informational transfer in hiring and related screening processes,
Harvard University Press, 1974.
13 Sudipto Bhattacharya, Imperfect information, dividend policy and the “Bird in the Hand” fallacy, Bell
9.7 Exercises
The solutions to the following exercises are given in Section 9.8 at the end of this chapter.
Exercise 9.1 Suppose that employers offer the wage schedule shown in Table 9.9.
There are two types of individuals in the population: type H with productivity $34,000
(independent of education) and type L with productivity $10,000 (also independent of
education). Education is costly and the monetary cost of acquiring education for the
two types is as follows:
0 if y ≤ 6
For type H : CH (y) =
1, 600(y − 6) if y > 6
0 if y ≤ 6
For type L : CL (y) =
2, 500(y − 6) if y > 6
Exercise 9.2 Suppose that there are two groups of individuals: Group L with produc-
tivity 2 (independent of education) and Group H with productivity 3 (independent of
education). Group H constitutes 13 of the population. Workers of both types are able to
buy education, at a cost. The amount of education y is a continuous variable and is fully
verifiable (e.g. through a certificate).
Type-L individuals face a higher cost of acquiring education than type-H individuals:
CL (y) = y and CH (y) = 2y .
Employers believe that anybody with a level of education less than y∗ has a produc-
tivity of 2 (and thus is offered a wage of 2) while anybody with a level of education
greater than or equal to y∗ has a productivity of 3 (and thus is offered a wage of 3).
(a) What values of y∗ give rise to a separating signaling equilibrium?
(b) If the government intervened and forced everybody to choose y = 0 and employers
to pay the same wage to everybody, equal to the average productivity in the
population, would there be a Pareto improvement?
Exercise 9.3 Consider the following situation. Education does increase productivity.
There are two groups in the population.
People in Group L have a productivity of 1 + 4y (where y is the amount of education)
education is $ 2y
Find all the signaling equilibria, when the employers’ beliefs are as follows: if a job
applicant has y < y∗ , then he/she must be from Group L (and thus will be offered a wage
of 1 + 4y ), while if he/she has y ≥ y∗ , then he/she must be from Group H (and thus will
be offered a wage of 2 + 4y ).
Exercise 9.4 Consider the following situation, where education increases productivity.
There are two groups in the population.
People in Group L have a productivity of (4 + 2y) (where y is the amount of education)
and their cost of acquiring y units of education is $5y.
People in Group H have a productivity of (6 + 3y) and the cost of acquiring y units of
education is $4y
Find all the signaling equilibria, when the employers’ beliefs are as follows: if a job
applicant has y < y∗ , then he/she must be from Group L (and thus will be offered a wage
of 4 + 2y), while if he/she has y ≥ y∗ , then he/she must be from Group H (and thus will
be offered a wage of 6 + 3y).
9.7 Exercises 327
Exercise 9.5 Suppose that employers offer the wage schedule shown in Table 9.10.
There are two types of individuals in the population: type H with productivity πH (inde-
pendent of education) and type L with productivity πL (also independent of education).
Education is costly and the monetary cost of acquiring education for the two types is as
follows:
0 if y ≤ 6
For type H : CH (y) =
900(y − 6) if y > 6
0 if y ≤ 6
For type L : CL (y) =
1, 400(y − 6) if y > 6
Exercise 9.6 There are two types in the population: type H and type L. Productivity
and cost are functions of the amount of education y, as follows:
Employers believe that applicants with y ≥ y∗ > 0 are of type H and are thus offered a
wage equal to a + 4y and everybody else is offered a wage equal to b + 3y.
For what values of a and b is there a separating signaling equilibrium?
328 Chapter 9. Signaling
Note that productivity is innate and is not influenced by either education or gender.
Suppose that employers mistakenly believe, not only that education affects pro-
ductivity, but also that women are somewhat “genetically handicapped” and therefore
need to acquire more education in order to become more productive. Accordingly, the
employers offer the following wage schedules, with y∗w > y∗m .
Are there values of y∗m and y∗w that give rise to a separating signaling equilibrium
where the employers beliefs are confirmed?
9.7 Exercises 329
Exercise 9.8 Let y denote the amount of education. There are three types of potential
workers:
those (Group I) with productivity 36 (a constant, thus independent of education),
those (Group II) with productivity (60 + 3y), and
those (Group III) with productivity (72 + 2y).
Each worker knows whether she belongs to Group I or Group II or Group III, while the
potential employer does not. The cost of acquiring y units of education is
24y for Group I,
12y for Group II, and
6y for Group III.
Let 0 < a < b. The potential employer believes that those applicants with education
less than a belong to Group I, those with education at least a but less than b belong
to Group II and those with education at least b belong to Group III and offers each
applicant a wage equal to the applicant’s estimated productivity, given the applicant’s
level of education (the level of education can be verified by the employer during the job
interview).
(a) Write a list of inequalities (involving the parameters a and b) that are necessary
and sufficient for the existence of a signaling equilibrium.
(b) Explain why a = 2 and b = 2.5 is not a signaling equilibrium.
(c) Is a = 2.5 and b = 4 a signaling equilibrium?
Exercise 9.9 Suppose that there are four types in the population with the following
productivity and cost of education (y is the amount of education), with a1 < a2 < a3 < a4
and b1 > b2 > b3 > b4 > 2.
Employers set three threshold levels of education, y∗1 , y∗2 and y∗3 and offer the following
wages, depending on the level of education y of the job applicant:
Education y wage
y < y∗1 a1
y∗1 ≤ y < y∗2 a2
y∗2 ≤ y < y∗3 a3 + y
y ≥ y∗3 a4 + 2y
Write the inequalities that need to be satisfied in order to have a separating signaling
equilibrium (where any two types make different choices of education).
330 Chapter 9. Signaling
Exercise 9.11 Continuing the analysis of Section 9.5, consider the case where the
threshold value y∗ is greater than the level of education that would be chosen by an
L-type under perfect information, and less than the level of education that would be
chosen by an H-type under perfect information that is, the case where yL < y∗ < yH .
Draw a diagram showing the wage schedule and the relevant indifference curves of the
two types and determine whether there can be a separating signaling equilibrium in this
case. [Hint: you need to distinguish two cases.]
quality of car: A B C D
1 2 2 3
proportion: 8 8 8 8
value to seller: $15,000 $10,000 $8,000 $6,000.
All sellers and buyers are risk neutral. While each seller knows the quality of his car,
each buyer is unable to determine the quality of any car offered for sale.
A car of quality i ∈ {A, B,C, D} has probability qi of requiring a major repair
1
within the next 5 years with qA = 10 , qB = 12 , qC = 54 , qD = 100
99
. A major repair costs
$15,000. The buyer’s valuation bi of a car of quality i reflects this, in the sense that, for
all i ∈ {A, B,C, D}, bi = $(24, 000 − 15, 000qi ).
(a) Compute the value of each quality to the buyers.
(b) Suppose first that there is only one price for used cars, p = $9, 000. What cars
are traded?
(c) Suppose now that each seller can offer his car for sale either with or without
warranty. If a car is sold with warranty, the seller credibly undertakes to pay for
the repair himself, should a repair become necessary within the next 5 years. In
this new situation, there can be two prices in the market: a price pN for cars sold
without warranty and a price pW for cars sold with warranty. A car of quality A
sold with warranty is worth $24,000 to the buyer. Suppose that pW = $17, 000
and pN = $11, 000. What cars are sold with warranty and what cars are sold
without warranty?
9.8 Solutions to Exercises 331
y w CH w −CH
years in school gross wage cost net wage
y=6 $10,000 $0 $10,000
y = 12 $24,000 $9,600 $14,400
y = 16 $29,000 $16,000 $13,000
y = 18 $34,000 $19,200 $14, 800
y = 21 $36,000 $24,000 $12,000
The cost-benefit analysis of a type-L person is shown in Table 9.12. Thus a type-
L individual will choose y = 6 and be paid $10,000 which is equal to his true
productivity.
Hence we do have a separating signaling equilibrium.
y w CL w −CL
years in school gross wage cost net wage
y=6 $10,000 $0 $10, 000
y = 12 $24,000 $15,000 $9, 000
y = 16 $29,000 $25,000 $4,000
y = 18 $34,000 $30,000 $4,000
y = 21 $36,000 $37,500 $-1,500
(b) In the new world everybody chooses y = 6 and faces zero costs of education. The
average productivity is 34, 000pH + 10, 000(1 − pH ) = 10, 000 + 24, 000pH . Type-
L individuals will be better off because 10, 000 + 24, 000pH > 10, 000. Type-H
individuals will now have a net income of 10, 000 + 24, 000pH , which is greater than
or equal to their net income at the signaling equilibrium of Part (a), namely $14,800,
if and only if pH ≥ 15 . Thus we have a Pareto improvement if and only if pH ≥ 15 .
332 Chapter 9. Signaling
Thus a Group-L person will choose y = 0 and be paid a wage equal to his true productivity
Thus a Group-H person will choose y = y∗ and be paid a wage equal to her true productivity
if and only if 2 − 14 y∗ ≥ 1, that is, if and only if y∗ ≤ 4 .
y w CH w −CH
y=6 $10,000 $0 $10,000
y = 12 $15,000 $5,400 $9,600
y = 16 $20,000 $9,000 $11,000
y = 18 $25,000 $10,800 $14,200
y = 21 $30,000 $13,500 $16, 500
The cost-benefit analysis of a type-L person is shown in the following table. Thus a
type-L individual will choose y = 6 and be paid $10,000.
Hence in order to have a signaling equilibrium it must be that πH = 30, 000 and
πL = 10, 000.
y w CL w −CL
y=6 $10,000 $0 $10, 000
y = 12 $15,000 $8,400 $6, 600
y = 16 $20,000 $14,000 $6,000
y = 18 $25,000 $16,800 $8,200
y = 21 $30,000 $21,000 $9,000
(b) In the new world everybody chooses y = 6 (the only available choice). The average
productivity is 30, 000pH + 10, 000(1 − pH ) = 10, 000 + 20, 000pH . Thus type-L
individuals will be better off, since 10, 000 + 20, 000pH > 10, 000. Type-H individu-
als will now have a net income of 10, 000 + 20, 000pH ; this is greater than, or equal
to, their net income at the signaling equilibrium of Part (a), namely $16,500, if and
13 13
only if pH ≥ 40 . Thus we have a Pareto improvement if and only if pH ≥ 40 .
334 Chapter 9. Signaling
Thus any two values of a and b such that y∗ ≤ a − b ≤ 2y∗ will give rise to a separating
signaling equilibrium.
1. For (1) we need 1 ≥ 2 − y∗m (for men), that is y∗m ≥ 1 and 1 ≥ 2 − y∗w (for women),
that is y∗w ≥ 1. Since y∗w > y∗m , both conditions are satisfied if
1 ≤ y∗m . (9.19)
y∗m
2. For (2) we need 2 − 2 ≥ 1, that is,
y∗m ≤ 2. (9.20)
y∗w
3. For (3) we need 2 − 2 ≥ 1, that is,
y∗w ≤ 2. (9.21)
Since y∗w > y∗m , (9.21) implies (9.20). Thus any pair of values (y∗m , y∗w ) such that
1 ≤ y∗m < y∗w ≤ 2 gives rise to a separating equilibrium where employers’ beliefs are
confirmed in every respect. In particular, they obtain no evidence against their belief
that, in order to become more productive, women need to spend more time in school: a
man acquires a productivity of 2 by spending y∗m years in school, while women acquire a
productivity of 2 by spending more time in school, namely y∗w years rather than y∗m .
9.8 Solutions to Exercises 335
wage
offered (Type H)
(Type L)
y
0 y∗ yL yH education
Figure 9.13: Every point on the first segment of the wage schedule is worse for the L-types
than some point on the second segment of the wage schedule.
14 We have drawn the case where the point (yH , kH yH ) lies above the L-type indifference curve that goes
through the point (yL , kL yL ), but this is not a relevant fact: even if this were not the case, the argument would
remain valid.
9.8 Solutions to Exercises 337
wage
offered (Type H)
(Type L)
y
0 yL y∗ yH education
Figure 9.14: The case where the point (y∗ , kH y∗ ) lies above the L-type indifference curve
that goes through the point (yL , kL yL ).
In this case the point (y∗ , kH y∗ ) - which is on the wage schedule - is better than the point
(yL , kL yL ), which is the best point on the first segment of the wage schedule (indeed the
best point on the entire line corresponding to the productivity of the L-type: Π = kL y).
Thus the L-type will not choose a level of education less than y∗ and thus will not be
remunerated according to their true productivity. As remarked in the answer to Exercise
9.10, even if the choice of education of the L-types is different from the choice made by
the H-types (namely yH ) - so that we would have a separating outcome - it would not be
an equilibrium, because employers would discover that they are overpaying some of the
employees and would want to change the wage schedule.
338 Chapter 9. Signaling
Case 2: The point (y∗ , kH y∗ ) lies to the right of the L-type indifference curve that goes
through the point (yL , kL yL ). This case is illustrated in Figure 9.15.
wage
offered (Type H)
(Type L)
y
0 yL y∗ yH education
Figure 9.15: The case where the point (y∗ , kH y∗ ) lies to the right of the L-type indifference
curve that goes through the point (yL , kL yL ).
In this case every point on the second segment of the wage schedule (the segment that
starts at (y∗ , kH y∗ )) is worse, for the L-type, than the point (yL , kL yL ), which - in turn - is
the best point on the first segment of the wage schedule. Thus the L-types will choose
level of education yL and the H-types will choose level of education yH and both types
will be remunerated according to their true productivity. Hence in this case we do have
a separating signaling equilibrium. Note that it coincides (in terms of education choices
and remuneration) with what would happen in the case of perfect information. Thus in
this case the asymmetry of information does not have negative consequences; in particular
there is no over-investment in education by the H-types (unlike the case where y∗ > yH
considered in Section 9.5).
9.8 Solutions to Exercises 339
whose expected value is 15,500, greater than the value of the car. Thus he is willing
to sell with warranty at price pW = 17, 000.
If the owner of a car of quality B sells at price pW = 17, 000 then he faces the lottery
$17, 000 $(17, 000 − 15, 000)
1 1
2 2
whose expected value is 9,500 which is less than the value of the car to him, namely
10,000. Thus he is not willing to sell with warranty.
If the owner of a car of quality C sells at price pW = 17, 000 then he faces the lottery
$17, 000 $(17, 000 − 15, 000)
1 4
5 5
whose expected value is 5,000 which is less than the value of the car to him, namely
8,000. Thus he is not willing to sell with warranty.
If the owner of a car of quality D sells at price pW = 17, 000 then he faces the lottery
$17, 000 $(17, 000 − 15, 000)
1 99
100 100
whose expected value is 2,150 which is less than the value of the car to him, namely
6,000. Thus he is not willing to sell with warranty.
Thus the only cars offered for sale with warranty are the cars of quality A. Since
the buyer values a car of quality A sold with warranty at $24,000, he is willing to
pay $17,000, realizing a gain equal to $7,000. Thus all cars of quality A will be sold
with warranty at price pW = 17, 000.
15 Alternatively one could say that the lottery - after paying the price - is
$3, 000 $150
2 3
5 5
whose expected value is $1,290, hence a positive net benefit from buying.
340 Chapter 9. Signaling
Now let us look at the market for cars sold without warranty. Since pN = 11, 000
the owners of cars of qualities B, C and D are willing to sell (they value their cars
less than 11,000). Thus buying a car without warranty means facing the lottery
$16, 500 $12, 000 $9, 150
2 2 3
7 7 7
which has an expected value of 12,064.29, greater than the price of 11,000. Thus
consumers are willing to buy.
In conclusion, cars of quality A will be sold with warranty at a price of $17,000 and
all the other cars will be sold without warranty at a price of $11,000.
Thus, offering to sell with warranty is a signal used by the owners of the best cars
(those of quality A) to credibly convey information to potential buyers about the
quality of their cars.
V
Moral Hazard
12 Glossary . . . . . . . . . . . . . . . . . . . . . . . . . 409
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 413
10. Moral Hazard in Insurance
If the customer’s behavior can be observed by the insurance company and verified by
a court of law, then the insurance company can specify it in the contract and make any
payments conditional on the customer’s actions. For example, the insurance company could
require the customer to install a security system in her house and make any reimbursements
due to theft conditional on proof that the security system was in fact installed. However,
in most cases it is impossible, or prohibitively expensive, for the insurer to monitor the
behavior of its customers. For example, no insurance company will find it worthwhile to
have an insurance agent follow the customer around to make sure that she always locks her
bicycle, when left unattended!
We will assume that the customer’s behavior cannot be observed by the insurance
company; however, the insurance company can try to figure out what the potential customer
would do under different insurance contract.
It follows from (10.2) and (10.3) that, for any point (W1 ,W2 ) in the wealth space,
- with No-effort, the slope of the indifference curve at point (W1 ,W2 ) is
0
pN U (W1 )
− (10.5)
1 − pN U 0 (W2 )
- with Effort, the slope of the indifference curve at point (W1 ,W2 ) is
0
pE U (W1 )
− . (10.6)
1 − pE U 0 (W2 )
Thus, for any point A = (W1A ,W2A ) in the wealth space, we deduce from (10.4), (10.5) and
(10.6) that
Wealth in
good state
W2
NI 45o line
W0
indifference curves
with Effort
indifference curves
with No-effort
Wealth in
W1 bad state
0 W0 − `
Figure 10.1: Indifference curves corresponding to Effort are less steep than those corre-
sponding to No-effort.
Figure 10.1 shows two sets of indifference curves: one corresponding to Effort (the
less steep ones) and one corresponding to No-effort (the steeper ones).
10.2 Moral hazard and insurance 347
When offered an insurance contract, the individual will have four possible choices:
1. remain uninsured and choose Effort,
2. remain uninsured and choose No-effort,
3. purchase the insurance contract and choose Effort,
4. purchase the insurance contract and choose No-effort,
and she will choose the option that yields the highest expected utility.
Her vNM preferences are represented the following vNM utility-of-money function
(for m > 0):3
10 − 10,000
(
m if she chooses Effort
U(m) =
10.01 − 10,000
m if she chooses No-effort.
Suppose that she is offered the following menu of insurance contracts:
premium deductible
Contract 1: $1,000 $5,000
Contract 2: $500 $8,000
Contract 3: $100 $12,000
Will she insure and, if so, which contract will she choose?
Step 1. Determine the best effort level in case of no insurance NI = (20000, 50000):
1
10 − 21 + 10
9
10 − 15 = 9.77 .
Effort: E[UE (NI)] = 10
No-effort: E[UN (NI)] = 51 10.01 − 12 + 45 10.01 − 15 = 9.75.
Step 2. Determine the best effort level for Contract 1, namely C1 = (44000, 49000):
1 1
9 1
Effort: E[UE (C1 )] = 10 10 − 4.4 + 10 10 − 4.9 = 9.7936.
1
No-effort: E[UN (C1 )] = 15 10.01 − 4.4 + 45 10.01 − 4.9
1
= 9.8013 .
Thus, under Contract 1, the individual will choose No-effort.
Step 3. Determine the best effort level for Contract 2, namely C2 = (41500, 49500):
1 1
9 1
Effort: E[UE (C2 )] = 10 10 − 4.15 + 10 10 − 4.95 = 9.7941.
No-effort: E[UN (C2 )] = 15 10.01 − 4.15
1
+ 45 10.01 − 4.95
1
= 9.8002 .
Thus, under Contract 2, the individual will choose No-effort.
Step 4. Determine the best effort level for Contract 3, namely C3 = (37900, 49900):
1 1
9 1
Effort: E[UE (C3 )] = 10 10 − 3.79 + 10 10 − 4.99 = 9.7933.
No-effort: E[UN (C3 )] = 15 10.01 − 3.79
1
+ 45 10.01 − 4.99
1
= 9.7969 .
Thus, under Contract 3, the individual will choose No-effort.
Of all these options, the one that gives the highest expected utility is Contract 2 with No-
effort. Thus the individual would purchase Contract 2 and choose No-effort. The insurer’s
expected profit from Contract 2 is thus h − pN (` − d) = 500 − 15 (30, 000 − 8, 000) =
$ − 3, 900: a loss! Hence the insurance company would not want to offer Contract 2 (or
any of the other two contracts, since they all involve a loss). Before we address the issue
of what contract(s) would be offered by an insurer, we need to re-examine the notion of
“reservation indifference curve” in the context of moral hazard.
The interesting case is where under no insurance the individual will choose Effort; it is
interesting because insurance might provide an incentive for the individual to switch to
10.2 Moral hazard and insurance 349
No-effort. Thus in this section and the next we will assume that E[UE (NI)] > E[UN (NI)]
so that, by (10.7),
The individual will reject any contract which, with the best choice of effort, will yield a
utility which is less than E[UE (NI)], that is, E[UE (NI)] provides the reservation level of
utility. Does this mean that the indifference curve corresponding to Effort that goes through
NI can be taken to be the reservation indifference curve, that is, the set of contracts that
give an expected utility equal to E[UE (NI)]? The answer is negative, because the fact that
the individual prefers to choose Effort if uninsured does not imply that she will continue
to choose Effort when insured. For example, if fully insured then she will be better off by
choosing No-effort.
Figure 10.2 shows the two indifference curves that go through the NI point: the less
steep one corresponds to Effort and a utility level of ū = E[UE (NI)] and the steeper one to
No-effort and a utility level of û = E[UN (NI)].
Wealth in
good state
W2
NI 45o line
W0
utility
ū F assume: ū > û
W2F = W1F
utility indifference curve
û with Effort; utility: ū
indifference curve
with No-effort; utility: û
Wealth in
W1 bad state
0 W0 − ` W1F
Let us use the expression reservation utility locus4 to denote the set of points (contracts) in
the wealth plane that give an expected utility equal to ū, when the individual makes the
best choice of effort; then, while NI belongs to it, point F does not. Thus the reservation
utility locus does not coincide with the indifference curve corresponding to Effort that goes
through NI.
How do we determine the reservation utility locus?
Continue to denote by ū the individual’s expected utility when uninsured and choosing
Effort. We need to consider the expected utility from choosing No-effort. By hypothesis,
Thus, by continuity, there must be a point, call it A = W1A ,W2A , on the indifference curve
corresponding to Effort that goes through NI, such that (see Figure 10.3)
Wealth in
good state
W2
NI 45o line
W0
utility: ū
A assume: ū > û
F
utility: indifference curve
û with Effort; utility: ū
indifference curves
with No-effort
Wealth in
W1 bad state
0 W0 − `
Figure 10.3: Point A is such that E[UN (A)] = E[UE (A)] = ū.
Thus, since the reservation utility level (that is, the maximum utility that the individual
can achieve if not insured) is ū (= E[UE (NI)]), the reservation utility locus is a kinked
4 Instead of ‘reservation indifference curve’, since it does not coincide with an indifference curve.
10.2 Moral hazard and insurance 351
curve consisting of the initial segment from NI to point A of the indifference curve
through NI corresponding to Effort and the segment from point A to the 45o line of
the indifference curve through A corresponding to No-effort, where point A is such that
E[UN (A)] = E[UE (A)] = E[UE (NI)].
The reservation utility locus is shown as a thick continuous kinked curve in Figure
10.4.
Wealth in
good state
W2
NI 45o line
W0
indifference curve
with Effort; utility: ū
indifference curve
with No-effort; utility: ū
Wealth in
W1 bad state
0 W0 − `
Figure 10.4: The reservation utility locus is the union of the thick continuous curves.
The individual’s initial wealth is W0 and she faces a potential loss of `. The probability
of her incurring a loss is pE if she chooses Effort and pN if she chooses No-effort, with
0 < pE < pN < 1.
352 Chapter 10. Moral Hazard in Insurance
Let us first determine for what values of c she will choose Effort when not insured.
and her expected utility if she has no insurance and chooses Effort is
p p
E[UE (NI)] = pE W0 − ` − c + (1 − pE ) W0 − c
p p
= pE W0 − ` + (1 − pE ) W0 − c.
that is,
p p
c < (pN − pE ) W0 − W0 − ` (10.9)
For the rest of this section let us fix the following values of the parameters, which
satisfy (10.9):5
1 1 15
W0 = 2, 500 ` = 1, 600 pE = pN = c=
20 10 16
Since (10.9) is satisfied, the individual – if uninsured – will choose Effort and thus her
reservation utility is:
1√ 19 p 15 769
E[UE (NI)] = 900 + 2, 500 − = = 48.0625.
20 20 16 16
Let us find the contract, call it A, that would make the individual indifferent between
1. not insuring and choosing Effort,
2. purchasing contract A and choosing Effort,
3. purchasing contract A and choosing No-effort.
5 In fact, with these values, the right-hand side of (10.9) is equal to 1.
10.2 Moral hazard and insurance 353
Let A = W1A ,W2A . Then it must be that E[UE (NI)] = E[UE (A)] and E[UE (A)] = E[UN (A)],
that is,
1 19 15
q q
48.0625 = W1A + W2A − and (10.10)
20 20 16
1 19 15 1 9
q q q q
W1A + W2A − = W1A + W2A . (10.11)
20 20 16 10 10
The solution is W1A = 972.66 and W2A = 2, 493.75; thus A is the contract with premium
2, 500 − 2, 493.75 = $6.25 and deductible 2, 493.75 − 972.66 = $1, 521.09.
Suppose that the individual is offered contract A and she breaks her indifference by
purchasing the contract. What is the insurer’s expected profit from this contract?
With contract A the individual is indifferent between Effort and No-effort. Thus expected
profit will be
1
◦ h − pE (` − d) = 6.25 − 20 (1, 600 − 1, 521.09) = $2.31 if the individual chooses
Effort.
1
◦ h − pN (` − d) = 6.25 − 10 (1, 600 − 1, 521.09) = $ − 1.641 if the individual chooses
No-effort.
Let us find the full-insurance contract, call it F, that makes the individual indifferent
between purchasing the contract and not insuring.
We saw above that, without insurance, the individual can achieve a level of utility of
48.0625 (by opting for Effort). On the other hand, with any full-insurance contract, the
individual will maximize√ her utility by choosing No-effort. Thus we are looking for a level
of wealth W such that W = 48.0625. The solution is W = 2, 310. Thus F = (2310, 2310),
that is, a contract with premium 2, 500 − 2, 310 = $190 and zero deductible.
Suppose that the individual is offered contract F and she breaks her indifference by
purchasing the contract. What is the insurer’s expected profit from this contract?
1
Expected profit will be h − pN ` = 190 − 10 1600 = $30.
In the above example, an insurer would be better off offering contract F than contract
A. We now turn to the issue of what contract(s) would be offered by a monopolist.
Wealth in
good state
W2
NI 45o line
W0
indifference curve
with Effort
indifference curve
with No-effort
Wealth in
W1 bad state
0 W0 − `
Figure 10.5: The reservation utility locus is the union of the two thick, continuous curves.
10.2 Moral hazard and insurance 355
Let us begin by considering the first segment of the reservation utility locus, namely
the portion of the indifference curve through NI corresponding to Effort, from point NI
to point A (where, as before, A is the contract that yields the reservation utility no matter
whether the individual chooses Effort or No-effort). It is shown in Figure 10.6.
Wealth in
good state
W2
isoprofit line
pE
NI slope: − 1−p 45o line
W0 E
A
indifference curve
with Effort
Wealth in
W1 bad state
0 W0 − `
contract B cannot be profit-maximizing for the insurer, since there are points on this portion
of the reservation utility locus that lie below the isoprofit line through B and thus yield
higher profits than B.
Thus we conclude that, of all the points on the portion of the reservation utility locus
considered so far, point A represents the profit-maximizing contract.
Let us now turn to the other portion of the reservation utility locus, namely the segment
of the indifference curve through A corresponding to No-effort from point A to the 45o
line, shown in Figure 10.7, where the point of intersection between the indifference curve
and the 45o line is denoted by F.
Wealth in
good state
W2
NI 45o line
W0
C isoprofit line
pN
F slope: − 1−p
N
indifference curve
with No-effort
Wealth in
W1 bad state
0 W0 − `
Note that if the individual purchases a contract that lies on this portion of the reservation
utility locus then she will choose No-effort. In fact, since contract A lies on the indifference
curve corresponding to Effort and utility level ū = E[UE (NI)], any point to the right of A
on this portion of the reservation utility locus will be lie on an Effort-indifference curve
corresponding to a utility level less than ū (while it lies on the No-effort-indifference curve
corresponding to ū).7
Since, for any of the contracts being considered, the individual will choose No-effort,
the relevant probability of loss is pN and thus the relevant isoprofit lines are those with
pN
slope − 1−p N
. Figure 10.7 shows one such isoprofit line, namely the one that goes through
point C. Recall from Chapter 5 (Section 5.1.4) that at any point above the 45o line the
No-effort-indifference curve through that point is steeper than the pN -isoprofit-line. Thus
contract C cannot be profit-maximizing for the insurer, since there are points on this portion
of the reservation utility locus that lie below the isoprofit line through C and thus yield
7 As for point A we continue to assume that the individual would choose Effort.
10.2 Moral hazard and insurance 357
Let A = W1A ,W2A and F = W F ,W F so that the corresponding premia and de-
ductibles are
premium deductible
Contract A : hA = W0 −W2A dA = W2A −W1A
Contract F : hF = W0 −W F dF = 0
Furthermore, let πA be the expected profit from contract A and πF be the expected profit
from contract F, that is,
πA = hA − pE (` − dA )
πF = hF − pN `.
Then we can conclude from the above analysis that the monopolist will offer
• contract A if πA > πF ,
• contract F if πF > πA
In the numerical example considered at the end of the previous section, the monopolist
would opt for the full-insurance contract F, since πF = 30 and πA = 2.31 (as shown on
page 353).
Now we give an example where the monopolist prefers to offer the partial-insurance
contract A. Let us keep the same data as in the example considered at the end of the
1 1
previous section, but change the value of pE from 20 to 40 . Thus we have that the vNM
utility-of-money function is
( √
UN (m) = m if she chooses No-effort
√ 15
UE (m) = m − 16 if she chooses Effort
1 1
W0 = 2, 500 ` = 1, 600 pE = pN =
40 10
q q
1
48.5625 =
40 W1 + 40 W2A − 15
A 39
16
q q
1 9
W1 + 10 W2A
A
48.5625 =
10
The solution is W1A = 1, 392.22 and W2A = 2, 481.29, that is, A = (1392.22, 2481.29). Thus
hA = 2, 500 − 2, 481.29 = 18.71 and dA = 2, 481.29 − 1, 392.22 = 1, 089.07.
√
Contract F is given by the solution to W F = 48.5625, which is W F = 2, 358.32 so that
F = (2358.32, 2358.32) and hF = 2, 500 − 2, 358.32 = 141.68. Hence8
1
πA = 18.71 − (1, 600 − 1, 089.07) = $5.94
40
1
πF = 141.68 − (1, 600) = $ − 18.32
10
so that the monopolist would offer contract A, thereby inducing the individual to reduce
the probability of loss by choosing Effort.
What would the monopolist’s profit be if it offered the full-insurance contract G which
is given by the intersection of the 45o line and the indifference curve corresponding to
Effort that goes through the NI point? In this example contract G is obtained by solv-
√ 15
ing W F − 16 = 48.5625, that is, G = (2450.25, 2450.25) with corresponding premium
hG = 2, 500 − 2450.25 = 49.75. Thus one might be tempted to answer that the monopo-
1
list’s profit would be 49.75 − 40 (1, 600) = $9.75, but this answer is wrong! The individual
would be very happy to purchase contract G because her best choice would then be
√
No-effort, with an expected utility of 2, 450.25 = 49.5 (instead of 48.5625) and thus, as
1
a matter of fact, the monopolist’s profits would turn out to be πG = 49.75 − 10 (1, 600) =
$ − 110.25!
8 Recall the assumption that, with contract A, the individual would choose Effort: see Footnote 6 on page
355.
10.3 Exercises 359
10.3 Exercises
The solutions to the following exercises are given in Section 10.4 at the end of this chapter.
Exercise 10.1 Emily has an initial wealth of $80,000 and faces a potential loss of
$36,000. The probability of loss depends on the amount of effort she puts into trying to
avoid the loss. If she puts a high level of effort, then the probability is 5%, while if she
exerts low effort the probability is 15%. Her vNM utility-of-money function is
√
m if low effort
U(m) = √
m − 1 if high effort.
(a) If Emily remains uninsured, what level of effort will she choose?
(b) If Emily is offered a full insurance contract with premium $2,250 and she accepts
it, what level of effort will she choose?
(c) If Emily is offered a full insurance contract with premium $2,250 will she accept
it?
(d) What is the insurance company’s expected profit from a full insurance contract
with premium $2,250?
Exercise 10.2 Susan has an initial wealth of $10,000 and faces a potential loss of
$1,900. The probability of loss depends on the amount of effort she puts into trying to
1
avoid the loss. If she puts a high level of effort, then the probability is 10 , while if she
4
exerts low effort the probability is 10 . Her vNM utility-of-money function is
√
m if low effort
U(m) = √
m − 2 if high effort
(a) If Susan remains uninsured, what level of effort will she choose?
(b) If Susan is offered a partial insurance contract with premium $800 and deductible
$200 and she accepts it, what level of effort will she choose?
(c) If Susan is offered the contract of Part (b) will she accept it?
360 Chapter 10. Moral Hazard in Insurance
Exercise 10.3 Bob owns a house near Lake Tahoe. The house is worth $949,000.
He also has $1,000 in his bank account, so that his entire wealth is $950,000. The
1
probability that there will be a forest fire next year is 10 . If a forest fire occurs then the
house will incur damages equal to $400,000. However, by spending $x on protective
1
measures Bob can reduce the probability that the fire will reach the house from 10 to
1 x
10 − 15,000 . Thus the more he spends, the lower the probability. The most he can spend
is $1,000. Bob’s vNM utility of money function is U(m) = 10 ln(m).
(a) If Bob is not insured, which of the following four options will he choose (assuming
that these are the only options)?
(1) x = 0,
(2) x = $400,
(3) x = $750,
(4) x = $1, 000.
(b) If Bob is offered a full-insurance contract with premium h, what value of x will
he choose?
(c) Suppose that Bob is offered a full insurance contract with premium h = $40, 000.
Will he purchase it?
Exercise 10.4 In this exercise we consider the case where the individual’s effort has an
effect not on the probability of loss but on the size of the loss.
Mike’s initial wealth is $6,400 and he faces a potential loss with probability 14 . The
size of the loss depends on his choice of effort: it he chooses Effort then the loss is
`E = $471, while if he chooses No-effort then the loss is `N = $1, 216. Mike’s vNM
utility-of-money function is
√
UN (m) = m if he chooses No-effort
UE (m) = √m − 1 if he chooses Effort
Exercise 10.5 Consider again the information given in Exercise 10.2: Susan has an
initial wealth of $10,000 and faces a potential loss of $1,900; the probability of loss
depends on the amount of effort she puts into trying to avoid the loss; if she puts a high
1
level of effort, then the probability is 10 , while if she exerts low effort the probability is
4
10 ; her vNM utility-of-money function is
(√
m if low effort
U(m) = √
m− 2 if high effort
Exercise 10.6 In this exercise we consider a slightly different type of vNM utility-of-
money function, where instead of UE (m) = UN (m) − c we have that UE (m) = αUN (m)
with 0 < α < 1.
Tom has an initial wealth of $4,900 and faces a potential loss of $1,300; the probability
of loss depends on the amount of effort he puts into trying to avoid the loss; if he puts a
high level of effort, then the probability is 18 , while if he exerts low effort the probability
is 38 ; his vNM utility-of-money function is
(√
m if low effort
U(m) = √ with 0 < α < 1.
α m if high effort
(a) For what values of α would Tom choose high effort if uninsured?
(b) Let α = 0.9. What is Tom’s reservation level of utility?
(c) Let α = 0.98. What is Tom’s reservation level of utility?
(d) Assume that α = 0.98. Write two equations whose solution gives that insurance
contract, call it A, such that if Tom purchases this contract he gets his reservation
level of utility, no matter whether he chooses low effort or high effort.
(e) Continue to assume that α = 0.98. Find a full-insurance contract, call if F, such
that Tom is indifferent between not insuring and purchasing contract F.
(f) Continue to assume that α = 0.98. Describe in words how you would draw the
reservation utility locus for Tom.
362 Chapter 10. Moral Hazard in Insurance
Exercise 10.7 Carol has an initial wealth of $10,000 and faces a potential loss of
$4,000; the probability of loss depends on the amount of effort she puts into trying to
1
avoid the loss; if she puts a high level of effort, then the probability is 10 , while if she
1
exerts low effort the probability is 2 ; her vNM utility-of-money function is
m
10 ln( 1,000 ) if low effort
U(m) = with c > 0.
m
10 ln( 1,000 )− c if high effort
(a) For what values of c would Carol choose high effort if uninsured?
(b) Assume that c = 1.8. What insurance contract would a monopolist offer to Carol?
(c) Assume that c = 1.5. What insurance contract would a monopolist offer to Carol?
10.4 Solutions to Exercises 363
(a) If Susan is uninsured and chooses high effort, her expected utility is
p p
0.1 10, 000 − 1, 900 − 2 + 0.9 10, 000 − 2 = 97
(a) If Bob chooses to spend $x on preventive measures his expected utility – when not
insured – is
1 x
NI(x) = − 10 ln (950, 000 − 400, 000 − x)
10 15, 000
1 x
+ 1− − 10 ln (950, 000 − x).
10 15, 000
Since NI(0) = 137.096, NI(400) = 137.237, NI(750) = 137.361 and NI(1, 000) =
137.449, of the four options he will choose x = $1, 000.
(b) If he is fully insured at premium h then his utility, if he does not spend any money
on preventive measures, is 10 ln(950, 000 − h), while if he spends $x (with x > 0)
then his utility is less, namely 10 ln(950, 000 − h − x). Thus he will choose x = 0.
(c) As determined in Part (b), if he buys the full insurance contract with premium
$40, 000 then he will choose x = 0, so that his utility will be 10 ln(950, 000 −
40, 000) = 10 ln(910, 000) = 137.21. This is less than his expected utility if he
remains uninsured and spends $1, 000 on preventive measures, which is 137.449 as
calculated in Part (a). Thus he will not accept the contract.
(a) If Mike is uninsured and chooses No-effort then his expected utility is
1p 3p
E[UN (NI)] = 6, 400 − 1, 216 + 6, 400 = 78
4 4
while if he chooses Effort his expected utility is
1p 3p
E[UE (NI)] = 6, 400 − 471 + 6, 400 − 1 = 78.25.
4 4
Thus he will choose Effort, if not insured.
(b) With a full-insurance contract with premium h his utility if he chooses Effort is
√ √
6, 400 − h−1 while his utility if he chooses No-effort is higher, namely 6, 400 − h.
Thus he would choose No-effort (he does not care about the size of the loss, since he
gets fully reimbursed by the insurance company if the loss occurs).
(c) When the deductible is $471 (= `E , loss with Effort) it would not make sense for
Mike to purchase the contract and choose Effort, because he would be better off by
not insuring and choosing Effort (his wealth would be larger by an amount equal
to the premium). Thus he would compare expected utility from no insurance with
Effort (which was calculated to be 78.25 in Part (a)) with expected utility with the
contract and No-effort which is (recall that the premium is $80)
1p 3p
6, 400 − 80 − 471 + 6, 400 − 80 = 78.74.
4 4
Thus Mike would purchase the contract and choose No-effort.
10.4 Solutions to Exercises 365
(a) We saw in Part (a) of Exercise 10.2 that if Susan is uninsured and chooses low effort,
her expected utility is 96 and if she chooses high effort her expected utility is 97.
Thus her reservation level of utility is 97: she would not accept a contract that did
not give her an expected utility of at least 97.
(b) Let A = W1A ,W2A . The first equation says that A should lie on the indifference
The second equation says that contract A gives the same expected utility whether
Susan chooses high effort or low effort:
q q q q
A
0.4 W1 + 0.6 W2 = 0.1A A
W1 − 2 + 0.9 A
W2 − 2 .
The solution of these two equations is: W1A = 8, 649 and W2A = 9, 933.44. Thus A is
the contract with premium 10, 000 − 9, 933.44 = $66.56 and deductible
9, 933.44 − 8, 649 = $1, 284.44.
(c) We are looking for a level of wealth W such that
√
W = 97.
The solution is W = 9, 409. Thus F = (9409, 9409), that is, a contract with premium
10, 000 − 9, 409 = $591 and zero deductible.
(d) The reservation utility locus for Susan is the union of the following two curves:
(1) the portion of the indifference curve through NI corresponding to high effort
from NI to point A, followed by (2) the portion of the indifference curve through A
corresponding to low effort from point A to point F.
(c) The assumption is that α = 0.98. Since in this case α > 0.9636, Tom – when
uninsured – will choose high effort. Thus his reservation level of utility is 68.75(0.98)
= 67.375, as computed in Part (a).
1 7 3 5
q q q q
(0.98) W1A + (0.98) W2A = W1A + W2A
8 8 8 8
1 7
q q
(0.98) W1A + (0.98) W2A = 67.375.
8 8
The solution is W1A = 4, 088 and W2A = 4, 821.57.
1 W1 W2
2 10 ln 1,000 + 21 10 ln 1,000 = 20.715
10.4 Solutions to Exercises 367
1 W1 1 W2
2 10 ln 1,000 + 2 10 ln 1,000 = 21.015
Although, typically, the Agent cannot fully control the outcome, she can have some
influence on it by exerting sufficient effort. For example, while a lawyer cannot guarantee
that her client will win the lawsuit, she can make a victory more likely if she devotes a lot
time and effort to preparing the case. The lawyer’s behavior cannot be observed by the
client, but the outcome of the lawsuit is observed and thus the client might want to make
his payment to the lawyer dependent on the outcome, in order to provide the lawyer with
the incentive to work hard. Examples of payments that are contingent on the observed
outcome are: contingency fees for lawyers (if the client loses he owes nothing to the lawyer,
if he wins he owes a percentage of what he has been awarded), stock options for managers
(part of their salaries is in the form of a percentage of the profits of the firm), deductibles
for policy-holders, etc.
e Agent’s effort
The Agent’s reservation level of utility u is that level of utility that the Agent can obtain
with an “outside option”, that is, by not signing a contract with the Principal. Thus any
contract that is signed must give the Agent an expected utility of at least u.
11.2 Risk sharing under moral hazard 371
We shall assume that the Agent’s effort influences the outcome, in the sense that greater
effort increases the probability of better outcomes. Consider first the case where there are
only two outcomes: X1 and X2 (with X1 < X2 ), and two possible levels of effort: low-effort
eL and high effort eH . Let pL1 be the probability of the lower outcome (outcome X1 ) when
the Agent exerts low effort (so that the probability of the higher outcome is (1 − pL1 )) and
pH1 be the probability of the lower outcome (outcome X1 ) when the Agent exerts high effort
(so that the probability of the higher outcome is (1 − pH 1 )). Then the assumption is that
pL1 > pH
1.
UP (m) = m,
while the Agent is risk averse, for any level of effort, and dislikes effort:
√
UA (m, e) = m − e.
There are two possible levels of effort for the Agent: eL = 10 and eH = 12. The Agent is
currently employed elsewhere at a job that involves low effort eL and pays a salary of $169,
so that her reservation level of utility is
√
u = 169 − eL = 13 − 10 = 3. (11.1)
Let
X1 = $1, 500 and X2 = $9, 000
and suppose that, if a contract is signed and the Agent exerts low effort, then the probability
of the lower outcome is 80%, while if the Agent exerts high effort then the probability of
the lower outcome is 50%:
4 1
pL1 = and pH
1 = .
5 2
We know from Chapter 6 that when one party is risk neutral and the other is risk averse,
Pareto efficient risk sharing requires that all the risk be borne by the risk-neutral party;
in other words, the risk-averse party (in this case the Agent) ought to be given a fixed
income: w1 = w2 . However, this principle clashes with the need to provide the Agent
with an incentive
√ to exert high effort: if √
the Agent is given a sure payment of $w then her
utility is w − 10 is she choose eL and w − 12 is she choose eH and therefore she will
choose eL . The Principal realizes this and figures out that the best (from his point of view)
fixed-salary contract that he can offer the Agent is one that gives the Agent exactly her
reservation utility, that is, contract B = (w1 = 169, w2 = 169), in which case the Principal
will face the money lottery
$(1, 500 − 169) $(9, 000 − 169)
4 1
5 5
372 Chapter 11. Moral Hazard in Principal-Agent
E[UP (B)] = 45 (1, 500 − 169) + 15 (9, 000 − 169) = 2, 831. (11.2)
Consider now the following alternative contract, which exposes the Agent to risk:
C = (w1 = 100, w2 = 484). Will the Agent accept this contract and, if she does, what
level of effort will she exert? If the Agent exerts low effort, then she faces the money
lottery
$100 $484
4 1
5 5
which exceeds her reservation utility. Hence the Agent will accept contract C and exert
high effort.
Realizing this, the Principal knows that if he signs contract C then he will face the money
lottery
$(1, 500 − 484) $(9, 000 − 100)
1 1
2 2
E[UP (C)] = 12 (9, 000 − 484) + 21 (1, 500 − 100) = 4, 958. (11.4)
By (11.3) the Agent prefers contract C to contract B and by (11.2) and (11.4) also the
Principal prefers C to B, so that B is not Pareto efficient, despite the fact that it allocates
risk optimally. Hence, when moral hazard is present, the principles of optimal risk sharing
established in Chapter 6 no longer hold.
Let us now consider an example with three possible outcomes and three possible levels
of effort. As before, the Principal is risk neutral and the Agent has the following utility
11.2 Risk sharing under moral hazard 373
√
function: UA (m, e) = m − e with e ∈ {10, 12, 24}. The Agent’s reservation utility is 3.
The outcomes and corresponding probabilities are as follows:
1 1 1
probability if e = 10 : 2 4 4
1 1 1
probability if e = 12 : 4 4 2
1 1 5
probability if e = 24 : 8 4 8
• under contract D: 18 (196 − 0) + 41 (1, 000 − 200) + 58 (5, 000 − 1, 680) = 2, 299.5.
Thus both Principal and Agent prefer contract D to contract C so that the latter is not
Pareto efficient, despite the fact that it allocates risk optimally, according to the principles
established in Chapter 6.
What are the Pareto efficient contracts in a situation where there is moral hazard? We
will approach this question in several steps.
11.3 The case with two outcomes and two levels of effort
In this section we assume that there are only two outcomes: X1 and X2 (with X1 < X2 ), and
two possible levels of effort: low-effort eL and high effort eH . Let pL1 be the probability of
X1 when the Agent exerts low effort and pH 1 the probability of X1 when the Agent exerts
high effort. We assume that
pL1 > pH
1. (11.5)
We shall also assume that the Principal is risk neutral, so that his vNM utility-of-money
function can be taken to be the identity function:
UP (m) = m,
while the Agent is risk-averse and dislikes effort; in particular, we take her utility function
to be1
(
U(m) if e = eL
UA (m, e) = c > 0, U 0 (m) > 0, U 00 (m) < 0. (11.6)
U(m) − c if e = eH
The reader might have noticed the similarity between the situation under consideration
and the insurance context analysed in Chapter 10 (compare (10.2) and (10.3) to (11.6)).
Also in this case we can view a possible contract as a point in the positive quadrant of
the (w1 , w2 ) Cartesian plane and we can draw indifference curves for the Agent and the
Principal.
Let us begin with the Agent. As in the context studied in Chapter 10, there are two
indifference curves for the Agent that go through a point (ŵ1 , ŵ2 ):
one - corresponding to high effort eH - that joins all the contracts (w1 , w2 ) that yield
the same utility to the Agent as contract (ŵ1 , ŵ2 ) if the Agent chooses eH , and
one - corresponding to low effort eL - that joins all the contracts (w1 , w2 ) that yield
the same utility to the Agent as contract (ŵ1 , ŵ2 ) if the Agent chooses eL .
We shall call the indifference curve corresponding to eH the “H-indifference curve” and
the indifference curve corresponding to eL the “L-indifference curve”. As explained in
Chapter 10, at any point (w1 , w2 ),
pH U 0 (w1 )
− 1 H (11.7)
1 − p1 U 0 (w2 )
pL U 0 (w1 )
− 1 L (11.8)
1 − p1 U 0 (w2 )
1 One can start with UA (m, eL ) = V (m) − eL and UA (m, eH ) = V (m) − eH and then define U(m) = V (m) +
eL (recall from Chapter 3 that adding a constant is an “allowed transformation" of a vNM utility function) so
that UA (m, eL ) = U(m) and UA (m, eH ) = V (m) − (eH − eL ) and define c = eH − eL and thus obtain (11.6).
11.3 The case with two outcomes and two levels of effort 375
pL1 pH
1
L
> (11.9)
1 − p1 1 − pH
1
that is,
w2
(payment if X2 )
45o line
eH
eL
w1
(payment if X1 )
0
Figure 11.1: At point A, the indifference curve corresponding to high effort eH is less steep
than the indifference curve corresponding to low effort eL .
The 45o line represents the fixed-wage contracts, that is, contracts (w1 , w2 ) with
w1 = w2 .
Note that, in the case of insurance (Chapter 10), we restricted attention to contracts
above the 45o line, because - in that context - a contract below the 45o line could not be
interpreted as an insurance contract. On the other hand, in the Principal-Agent case, a
contract below the 45o line is a meaningful contract and should not be ruled out in principle.
However, by the following proposition, such contracts will not be observed in practice (if
the parties are rational) since they are Pareto inefficient.
Proposition 11.3.1 Any contract (w1 , w2 ) such that w1 > w2 is Pareto inefficient.
376 Chapter 11. Moral Hazard in Principal-Agent
= pL1 − pH L H
1 U(w1 ) − p1 − p1 U(w2 ) + c
= pL1 − pH
1 [U(w1 ) −U(w2 )] + c. (11.11)
By (11.5), pL1 − pH
1 > 0; by assumption, c > 0; by (11.10) and the fact that U(m) is
increasing in m, [U(w1 ) −U(w2 )] > 0. Thus (11.11) is positive, that is, E[UA (C, eL )] >
E[UA (C, eH )].
Graphically, consider contract C in Figure 11.2.
w2
(payment if X2 )
45o line
D
mL
B
mH C
wC2 eH
eL
w1
mH mL (payment if X1 )
0 wC1
Figure 11.2: E[UA (C, eH )] = E[UA (B, eH )] = U(mH ) − c; E[UA (C, eL )] = E[UA (B, eL )] =
U(mL ).
w2
(payment if X2 )
45o line
D
mL
C
wC2
eL
w1
0 mL wC1
(payment if X1 )
prefers E[C] for sure to lottery C, that is, UA (E[C], eL ) = U(E[C]) > E[UA (C, eL )] and,
since D is on the same indifference curve as C, E[UA (C, eL )] = E[UA (D, eL )] = U(mL ).
Thus U(E[C]) > U(mL ), which implies, since U(m) is increasing in m, that
On the other hand,! for the Principal contract D corresponds to the lottery
X1 − mL X2 − mL
whose expected value (and expected utility) is
pL1 1 − pL1
pL1 X1 + 1 − pL1 X2 − mL .
(11.14)
It follows from (11.12), (11.13) and (11.14) that the Principal strictly prefers contract D to
contract C. Thus contract C is Pareto inefficient.
w2
(payment if X2 )
eH eL 45o line
wC2 C
mH E
mL D
w1
mL mH (payment if X1 )
0 wC1
Figure 11.4: What level of effort will the Agent choose under contract C?
What level of effort will the Agent choose under contract C? The expected utility from
contract C under low effort eL is the same as with contract D under low effort and the latter
is U(mL ). The expected utility from contract C under high effort eH is the same as with
contract E under high effort and the latter is U(mH ) − c. Thus
11.3 The case with two outcomes and two levels of effort 379
w2
(payment if X2 )
eH eL 45o line
eH
eH
wC2 C
E
m̂ D̂
w1
(payment if X1 )
0 wC1 m̂
Figure 11.5: The two indifference curves through point D̂ and two more H-indifference
curves.
The H-indifference curve that goes through point D̂ (the continuous less steep curve)
corresponds to a level of utility less than û (namely U(m̂) − c = û − c < û). Consider
now the dashed H-indifference curve: it is above the one that goes through D̂ and thus
corresponds to higher utility. As we consider higher and higher H-indifference curves the
level utility keeps increasing. At some stage we must reach an H-indifference curve that
corresponds to utility û. Suppose it is the dotted curve in Figure 11.5. Let C = (wC1 , wC2 ) be
the point of intersection between this H-indifference curve and the L-indifference curve
that goes through D̂. Then it must be that E[UA (C, eH )] = E[UA (C, eL )] = û, that is,
pH C H C L C L C
1 U(w1 ) + 1 − p1 U(w2 ) − c = p1 U(w1 ) + 1 − p1 U(w2 ) = û.
380 Chapter 11. Moral Hazard in Principal-Agent
Once we reach point C, in order to keep utility constant at the value û we need to stop
traveling along the L-indifference curve and switch to the dotted H-indifference curve.
Thus the û-utility-locus if given by the union of two segments of two indifference curves,
just like in the case of insurance.
Figure 11.6 shows the û-utility-locus, where û = UA (m̂, eL ) = U(m̂), as the union of
the two thick curves.
wage if X2
w2
45o line
m̂ indifference curve
D̂ with high effort eH
indifference curve
with low effort eL
w1 wage if X1
0 m̂
Figure 11.6: The û-utility-locus (where û = UA (m̂, eL ) = U(m̂)) is the union of the two
thick curves.
What contracts on the û-utility-locus are Pareto efficient? We will show that the only
candidates are contracts C and D̂ in Figure 11.6.
So far we have focused on the indifference curves of the Agent. Now we turn to the
Principal, whom we have assumed to be risk neutral. Recall from Chapter 5 (Section 5.1.1)
that the indifference curves of a risk-neutral individual are downward-sloping straight lines
p
with slope − 1−p , where p is the probability of the outcome measured on the horizontal
axis. In the present context, there are two possible values of p: pL , if the Agent chooses
low effort eL , and pH , if the Agent chooses high effort eH , with pL > pH . Thus through
any contract C on or above the 45o line we can draw two straight-line indifference curves
for the Principal:
pL
• a steeper one with slope − 1−p L
, connecting all the contracts that give the same utility
to the Principal, conditional on the Agent choosing eL , and
pH
• a less steep one with slope − 1−p H
, connecting all the contracts that give the same
utility to the Principal, conditional on the Agent choosing eH .
11.3 The case with two outcomes and two levels of effort 381
Which indifference curve for the Principal is relevant at contract C depends on the choice
of effort of the Agent under contract C. Figure 11.7 shows the two indifference curves of
the Principal through a point C.
wage if X2
w2
45o line
if eH
if eL
w1 wage if X1
0
Figure 11.7: The two possible indifference curves of the Principal through contract C.
With an argument similar to the one used in Chapter 2 (Section 2.5), one can show that
1. given two contracts A and B, both of which would induce the Agent to choose
high effort eH , if B lies below the Principal’s indifference curve through A then the
Principal gets higher utility from B than from A, that is, he prefers B to A,
2. given two contracts D and E, both of which would induce the Agent to choose
low effort eL , if E lies below the Principal’s indifference curve through D then the
Principal gets higher utility from E than from D, that is, he prefers E to D.
The reader is asked to prove Point 1 in Exercise 11.6 and Point 2 in Exercise 11.7. The
above two claims are illustrated in Figures 11.8 and 11.9.
382 Chapter 11. Moral Hazard in Principal-Agent
wage if X2
w2
45o line
B if eH
w1 wage if X1
0
Figure 11.8: Assuming that the Agent would choose eH under both contract A and contract
B, the Principal prefers contract B to contract A.
wage if X2
w2
45o line
if eL
w1 wage if X1
0
Figure 11.9: Assuming that the Agent would choose eL under both contract D and contract
E, the Principal prefers contract E to contract D.
11.3 The case with two outcomes and two levels of effort 383
Let us focus on an L-indifference curve for the Agent and consider two points: point
F = (wF , wF ) lying on the 45o line and point D = (wD D o
1 , w2 ) lying above the 45 line (so
that wD D
1 < w2 ), and assume that, under contract D, the Agent would choose low effort eL .
By (11.8) (page 374), the slope of the Agent’s indifference curve at point F is
pL1 U 0 (wF ) pL1
− = − ;
1 − pL1 U 0 (wF ) 1 − pL1
thus, at point F, the Principal’s L-indifference curve (which is the relevant one, since we
know that at a point on the 45o line the Agent chooses low effort eL ) is tangent to the
Agent’s L-indifference curve. On the other hand, the slope of the Agent’s indifference at
point D is
pL U 0 (wD )
− 1 L 0 1D .
1 − p1 U (w2 )
Since wD D 0 00
1 < w2 and U (m) is decreasing in m (because U (m) < 0, by risk aversion),
U 0 (wD
1)
U 0 (wD
> 1 and thus
2)
pL1 U 0 (wD 1) pL1
>
1 − pL1 U 0 (wD
2) 1 − pL1
so that the Principal’s indifference curve through point D is less steep than the Agent’s
indifference curve through D (recall the assumption that, under contract D, the Agent
would choose eL ).
Figure 11.10 shows the relative slopes of the indifference curves of Principal and Agent
at a point above the 45o line (point D) and at a point on the 45o line (point F).
wage if X2
w2
eL 45o line
D
w1 wage if X1
0
Figure 11.10: The relative slopes of the indifference curves of Agent and Principal for the
case where the Agent chooses low effort eL .
384 Chapter 11. Moral Hazard in Principal-Agent
It follows from the above that a contract on an L-indifference curve of the Agent which
is above the 45o line (such as point D in Figure 11.10) is not Pareto efficient (as long
it yields higher utility to the Agent under low effort than under high effort): there are
contracts in the region between the two curves - such as contract E in Figure 11.10 -
that are better for both the Agent and the Principal. On the other hand, a contract at the
intersection of an L-indifference of the Agent and the 45o line (such as point F in Figure
11.10) is a candidate for Pareto efficiency.
Next we use a similar argument for contracts that are on an H-indifference curve of the
Agent and above the 45o line. Fix such a contract, call it A, and assume that under contract
A the Agent is better off with high effort eH than with low effort eL . Figure 11.11 shows
the relative slopes of the indifference curves of Principal and Agent at a point above the
45o line (point A) for a contract that induces the Agent to choose high effort eH . Since the
H-indifference curve of the Principal through point A is less steep than the H-indifference
curve of the Agent at point A, there are contracts between the two (such as point B in
Figure 11.11) that are better for both. Thus points such as point A in Figure 11.11 are not
Pareto efficient (given the hypothesis that under contract A the Agent would choose high
effort eH ).
wage if X2
w2
eH 45o line
A
B
w1 wage if X1
0
Figure 11.11: The relative slopes of the indifference curves of Principal and Agent at a
point above the 45o line (point A) for a contract that induces the Agent to choose high
effort eH .
11.3 The case with two outcomes and two levels of effort 385
We can now put together Figures 11.6 (reproduced below as Figure 11.12), 11.10 and
11.11 to complete the argument that, for any level of utility û of the Agent, there are only
two contracts on the û-utility-locus that are candidates for Pareto efficiency.
The top-left shaded area in Figure 11.12 consists of contracts that would induce the
Agent to choose high effort eH (since they are above the H-indifference curve for level
of utility û and to the left of the L-indifference curve for level of utility û), while the
bottom-right shaded area consists of contracts that would induce the Agent to choose low
effort eL (these contracts are above the L-indifference curve for level of utility û and below
the H-indifference curve for level of utility û). Thus the argument illustrated in Figure
11.11 rules out all the contracts on the H-indifference curve (for level of utility û) to the left
of point C (the first segment of the û-utility-locus) and the argument illustrated in Figure
11.10 rules out all the contracts on the L-indifference curve (for level of utility û) to the
right of point C (the second segment of the û-utility-locus) except for point D̂ on the 45o
line.
wage if X2
w2
eH 45o line
eL
m̂
D̂
w1 wage if X1
0 m̂
Figure 11.12: Copy of the û-utility-locus of Figure 11.6. The top-left shaded area consists
of contracts that would induce the Agent to choose high effort eH , while the bottom-right
shaded area consists of contracts that would induce the Agent to choose low effort eL .
R Recall that, at any contract F on the 45o line, the (straight-line) L-indifference curve
of the Principal is tangent to the (convex) L-indifference curve of the Agent (see
Figure 11.10). Hence any contract E that induces the Agent to choose low effort eL
and gives the Agent higher utility than F must lie above the Agent’s L-indifference
curve through F and thus must be worse for the Principal than F.
386 Chapter 11. Moral Hazard in Principal-Agent
From now on we will assume that, given a contract under which the Agent is indifferent
between choosing low effort eL and choosing high effort eH , the Agent will choose high
effort eH .2
Lemma 11.1 Let D̂ = (m̂, m̂) be a contract on the 45o line and let û be the utility of the
C C
D̂ (the Agent will choose eL under this contract and thus û = U(m̂)).
Agent under contract
Let C = w1 , w2 be a contract that gives utility û to the Agent no matter whether she
chooses eL or eH . Then
1. if D̂ P C (that is, the Principal prefers contract D̂ to contract C), then D̂ P E
(the Principal prefers D̂ to E) for every contract E 6= D̂ on the û-utility-locus of the
Agent,
2. if C P D̂ (that is, the Principal prefers contract C to contract D̂), then C P E (the
Principal prefers C to E) for every contract E 6= C on the û-utility-locus of the Agent.
Proof. .
1. Assume that D̂ P C. Pick an arbitrary contract E above the 45o line and on the
û-utility-locus of the Agent. By the Remark on page 385, if E induces the Agent
to choose eL then D̂ P E. Suppose, therefore, that E induces the Agent to choose
high effort eH . Then either E = C, and thus D̂ P E by hypothesis, or E 6= C, in
which case E lies on the first segment of the û-utility-locus of the Agent, to the left
of point C. Then, by the argument illustrated in Figure 11.11, C P E and thus, by
transitivity (since D̂ P C), D̂ P E.
2. Assume that C P D̂. Pick an arbitrary contract E 6= C above the 45o line and on the
û-utility-locus of the Agent. If E induces the Agent to choose high effort eH then
E lies on the first segment of the û-utility-locus of the Agent, to the left of point C;
then, by the argument illustrated in Figure 11.11, C P E. Suppose, therefore, that E
induces the Agent to choose low effort eL . Then E lies on the second segment of the
û-utility-locus of the Agent, to the left of point D̂. Then, by the argument illustrated
in Figure 11.10, D̂ P E and thus, by transitivity (since C P D̂), C P E.
Proposition 11.3.2 Let D̂ = (m̂, m̂), û and C be as in Lemma 11.1. Then
1. if D̂ P C then D̂ is the only Pareto efficient contract on the û-utility-locus of the
Agent,
2. if C P D̂ then C is the only Pareto efficient contract on the û-utility-locus of the
Agent,
3. if C ∼P D̂ (that is, the Principal is indifferent between C and D̂) then C and D̂ are
the only Pareto efficient contracts on the û-utility-locus for the Agent.
Proof. We already proved that, among the contracts on the û-utility-locus of the Agent, C
and D̂ are the only candidates for Pareto efficiency. It is clear that if the Principal prefers
D̂ to C then D̂ Pareto dominates C and thus C is not Pareto efficient, so that D̂ is the only
candidate for Pareto efficiency; similarly, if the Principal prefers C to D̂ then C Pareto
dominates D̂ and thus D̂ is not Pareto efficient, so that C is the only candidate for Pareto
2 Wemade the same assumption in Chapter 10. See the remarks in that section about this assumption: the
same considerations apply here.
11.3 The case with two outcomes and two levels of effort 387
efficiency.
Next we prove that if D̂ P C (the Principal prefers D̂ to C) then D̂ is Pareto efficient.
Suppose, by contradiction, that D̂ is not Pareto efficient. Then there exists a contract F 6= D̂
that is better than D̂ for one of the parties and at least as good as D̂ for the other party. It
cannot be that F ∼A D̂ (the Agent is indifferent between F and D̂) because then it must be
that F lies on the û-utility-locus of the Agent and F P D̂ (the Principal prefers F to D̂),
which contradicts Point 1 of Lemma 11.1. Thus it must be that F A D̂ (the Agent prefers
F to D̂) and
By (11.15) and the Remark on page 385, it must be that F induces the Agent to choose
high effort eH . Let E be the point at the intersection of the H-indifference curve of the
Agent corresponding to utility û and the H-indifference curve of the Principal through
pH
point F (the straight line with slope − 1−p1 H ). Then E ∼P F (the Principal is indifferent
1
between E and F) and, by Point 1 of Lemma 11.1, D̂ P E so that, by transitivity, D̂ P F,
contradicting (11.15).
Next we prove that if C P D̂ then C is Pareto efficient. Suppose, by contradiction, that C
is not Pareto efficient. Then there exists a contract F 6= C that is better than C for one of the
parties and at least as good as C for the other party. It cannot be that F ∼A C because then
it must be that F lies on the û-utility-locus of the Agent and F P C, which contradicts
Point 2 of Lemma 11.1. Thus it must be that F A C and
F %P C. (11.16)
Suppose first that contract F induces the Agent to choose low effort eL . Let E be the
point at the intersection of the L-indifference curve of the Agent corresponding to utility û
and the L-indifference curve of the Principal through point F (the straight line with slope
pL
− 1−p1 L ). Then E ∼P F and, by Point 2 of Lemma 11.1, C P E so that, by transitivity,
1
C P F, contradicting (11.16). Suppose, therefore, that contract F induces the Agent to
choose high effort eH . Let G be the point at the intersection of the H-indifference curve of
the Agent corresponding to utility û and the H-indifference curve of the Principal through
pH
point F (the straight line with slope − 1−p1 H ). Then G ∼P F and, by Point 2 of Lemma
1
11.1, C P G so that, by transitivity, C P F, contradicting (11.16).
The proof of Point 3 is along the same lines and we will omit it.
In Exercises 11.10-11.12 the reader is asked to characterize and draw (1) the set of
contracts that make the Principal indifferent between the Agent exerting low effort and
the Agent exerting high effort and (2) the set of contracts that make the Agent indifferent
between low effort and high effort.
388 Chapter 11. Moral Hazard in Principal-Agent
Secondly, if the Principal has all the bargaining power (because, for example,
there is “perfect competition” among possible Agents) and thus the Principal
is able to offer a contract that keeps the Agent at her reservation level of
utility, call it u, then the Principal will offer to the Agent that contract on
the u-utility-locus of the Agent that maximizes his own expected utility. By
Lemma 11.1, such a contract is either a fixed-wage contract or that incentive
contract that makes the Agent indifferent between choosing low effort and
choosing high effort (see Exercise 11.13).
possible outcomes: X1 , X2 , . . . , Xn (n ≥ 2)
levels of effort: e1 , e2 , . . . , em (m ≥ 2)
probability distribution
over {X1 , . . . , Xn } given ei : Pi
X1 < X2 < · · · < Xn
assumptions: e1 < e2 < · · · < em
Pm >FOC Pm−1 >FOC · · · >FOC P2 >FOC P1
where Pi >FOC Pj means that the probability distribution Pi dominates the probability
distribution Pj in the sense of first-order stochastic dominance (Chapter 4, Definition 4.4.1).
We will not characterize the solution to the general problem, since it is rather complex
and requires more advanced mathematical tools than we have employed so far (in particular,
the notion of constrained optimization).
We conclude this section with an example that shows that, when there are more than
two outcomes, a Pareto efficient incentive contract does not necessarily involve payments
that are an increasing function of the outcomes. Suppose that the Agent can choose
between a low level of effort eL and a high level of effort eH and, as usual, the Principal
cannot monitor the Agent’s effort. The Agent’s reservation utility is 10 and thus she will
not accept any contract that (with the optimal choice of effort) will give her a utility of less
than 10. The Principal is risk neutral while the Agent’s utility function is given by:
√
m if eL
UA (m, e) = √
m − 1 if e
H
There are three possible outcomes with the probabilities listed in the following table:3
Suppose that the Principal has all the bargaining power, so that he will only offer contracts
that give the Agent her reservation utility of 10. Among such contract that, furthermore,
induce the Agent to choose low effort eL the one that gives the Principal the largest expected
utility is the fixed-wage contract F = (w1 = 100, w2 = 100, w3 = 100) in which case the
Principal’s expected utility is 12 (100 −100)+ 21 (120 −100)+0(200 −100) = 10. Consider
3 Note that the probability distribution associated with eH dominates that associated with eL in the sense
of first-order stochastic dominance.
390 Chapter 11. Moral Hazard in Principal-Agent
now the following incentive contract C = (w1 = 81, w2 = 100, w3 = 163.84). Under this
contract the Agent’s expected utility is:
1
√ √
if she chooses eL : 2 81 + 12 100 = 9.5
2
√ 1
√ 1
√
if she chooses eH : 5 81 + 10 100 + 2 163.84 − 1 = 10.
Thus the Agent would accept contract C and exert high effort eH . With contract C the
1
Principal’s expected utility is 25 (100 − 81) + 10 (120 − 100) + 21 (200 − 163.84) = 27.68.
Thus the Agent is indifferent between F and C, while the Principal prefers C to F; hence C
Pareto dominates F.
Next we show that contract C is not the contract that maximizes the Principal’s ex-
pected utility among the contracts that induce the Agent to choose eH and give her an
expected utility of 10. To find such a contract, we need to solve the following constrained
maximization problem:
2 1 1
choose w1 , w2 , w3 to maximize 5 (100 − w1 ) + 10 (120 − w2 ) + 2 (200 − w3 )
= 152 − 52 w1 − 101
w2 − 12 w3
subject to
2√ 1√ 1√
5 w1 + 10 w2 + 2 w3 − 1 ≥ 10 (IR)
2√ 1√ 1√ 1√ 1√
5 w1 + 10 w2 + 2 w3 − 1 ≥ 2 w1 + 2 w2 . (IC)
(IR) is the individual rationality constraint for the Agent: it says that the proposed contract
must give the Agent an expected utility of at least 10 if she chooses high effort eH and
(IC) is the incentive compatibility constraint for the Agent: it says that, under the proposed
contract, the Agent cannot get higher expected utility by choosing low effort eL . We now
show that, at a solution to the above maximization problem, both constraints must be
satisfied as equalities. We will do so without explicitly employing the tools of constrained
maximization (the Kuhn-Tucker conditions).
First we argue that, at a solution of the maximization problem, the (IR) constraint must
be satisfied as an equality. Pick an arbitrary triple (w1 , w2 , w3 ) at which the (IR) constraint
is satisfied as a strict inequality and the (IC) constraint is also satisfied (possibly as an
equality). Now decrease the value of w2 by a small amount, so that the (IR) constraint is
still satisfied. Then, since the maximand, that is the function 152 − 52 w1 − 10 1
w2 − 12 w3 , is
decreasing in w2 , its value will increase. The only concern is that decreasing w2 might
lead to a violation of the (IC) constraint; however, if w2 is decreased, the left-hand-side
(LHS) of the (IC) constraint decreases less than the right-hand-side (RHS) of it and thus
the constraint will be preserved.4 Since we were able to increase the value of the maximand
without violating the constraints, the triple we started with could not have been a solution
to the maximization problem. Thus the (IR) constraint must be satisfied as an equality.
4 The 1 √1
derivative, with respect to w2 , of the LHS of (IC) is 10 2 w2 which is less than the derivative,
with respect to w2 , of the RHS of (IC) which is 12 2√1w2 .
11.4 The case with more than two outcomes 391
2 1 1
22 − 45 y1 − 15 y2 − 1 ≥ 1 1
5 y1 + 10 y2 + 2 2 y1 + 2 y2 (IC)
The next step in the argument is to show that if the (IC) constraint is satisfied as
2
a strict inequality then the function 152 − 25 (y1 )2 − 101
(y2 )2 − 12 22 − 45 y1 − 15 y2 is not
maximized. This part of the proof is developed in Exercise 11.15.
Thus the (IC) constraint must be satisfied as an equality, that is, it must be that
2 1 1 4 1
− 1 = 12 y1 + 12 y2
5 y1 + 10 y2 + 2 22 − 5 y1 − 5 y2
which simplifies to
y2 = 20 − y1 (11.18)
Using (11.18) we can reduce the maximization problem to the following unconstrained
one:
choose y1 to maximize
2
152 − 52 (y1 )2 − 10
1
(20 − y1 )2 − 12 22 − 54 y1 − 15 (20 − y1 )
74 17 2
= 5 y1 − 25 y1 − 50.
Let f (y1 ) = 74 17 2 0 74 34
5 y1 − 25 y1 − 50; then f (y1 ) = 5 − 25 y1 and setting this equal to zero and
solving we obtain y1 = 185 185 155
17 ; from this and (11.18) we get that y2 = 20 − 17 = 17 ; finally,
replacing these two values into y3 = 22 − 45 y1 − 15 y2 we get that y3 = 19517 . Thus, since
√
yi = wi we get the following solution to the original maximization problem:
185 2 155 2 195 2
w1 = 17 = 118.43, w2 = 17 = 83.13, w3 = 17 = 131.57.
Under contract D the Agent is paid the least if the intermediate outcome ($120) obtains.
If the outcome turns out to be $120 then she is paid approximately $83, while if the
outcome turns out to be lower ($100) then she is paid more: approximately $118! The
reason is that from a low outcome of $100 one cannot infer that the Agent did not work
hard: it is almost as likely that she exerted high effort as it is that she exerted low effort. On
the other hand, the intermediate outcome ($120) is much more likely if the Agent chooses
low effort than if she chooses high effort, hence it makes sense to associate a relatively low
payment to that outcome.
11.5 Exercises
The solutions to the following exercises are given in Section 11.6 at the end of this chapter.
11.5.1 Exercises for Section 11.2: Risk sharing under moral hazard
Exercise 11.1 A risk-neutral Principal wants to hire an Agent to run her firm. The
Agent’s utility depends on two things: money and effort. Denote effort by e and assume
that it can take on only two values: eL (for low) and eH (for high). Let the Agent’s
utility function be given as follows (where m denotes money)
√
90 + m − 9 if e = eL
UA (m, e) = √
90 + m − 10 if e = e .
H
Assume that if they don’t sign a contract they both get a utility of zero. The Agent’s
effort cannot be observed by the Principal and thus cannot be made part of the contract.
There are only three possible outcomes (profit levels): $ − 100, $100, $500. If the Agent
works hard, better outcomes are more likely than if the Agent does not work hard:
Exercise 11.2 Mr. O owns a firm. He can either run the firm himself or hire Ms. M
to run it for him. If he runs the firm himself he gets a utility of 0. If he hires Ms. M,
he will not be able to check whether she works hard or not. Under the management of
Ms. M, the firm’s profit levels and corresponding probabilities would be as follows (e
denotes effort, eL is low effort and eH is high effort):
Mr. O is risk neutral. Ms. M is currently unemployed and her current utility is 0; her
utility function is as follows (m denotes money and e denotes effort):
(
m − 8 if e = eL
UA (m, e) =
m − 10 if e = eH
11.5.2 Exercises for Section 11.3: Two outcomes and two levels of effort
Exercise 11.3 Suppose that there are only two outcomes, X1 = $2, 000 and
X2 = $5, 000, and two levels of effort: low effort eL and high effort eH . The Prin-
cipal is risk neutral, while the Agent’s utility function is given by
(√
m if e = eL
UA (m, e) = √
m − 4 if e = eH .
w2
(payment if X2 )
45o line
600 C
E
D
w1
(payment if X1 )
0 210
Exercise 11.4 Suppose that there are only two outcomes, X1 and X2 (with X1 < X2 ),
and two levels of effort: low effort eL and high effort eH . The Principal is risk neutral,
while the Agent’s utility function is given by
(
ln(m) if e = eL
UA (m, e) =
ln(m) − 1 if e = eH .
Exercise 11.5 Suppose that there are only two outcomes, X1 = $1, 000 and X2 =
$1, 500, and two levels of effort: low effort eL and high effort eH . The Principal is risk
neutral, while the Agent’s utility function is given by (with a > 0)
(√
m if e = eL
UA (m, e) = √
m − a if e = eH .
Exercise 11.6 Let A and B be two contracts above the 45o line such that the Agent
would choose high effort eH under either contract. Prove that, if B lies below the
Principal’s indifference curve through A, then the Principal gets higher utility from B
than from A, that is, he prefers B to A.
Exercise 11.7 Let D and E be two contracts above the 45o line such that the Agent
would choose low effort eL under either contract. Prove that, if E lies below the
Principal’s indifference curve through D, then the Principal gets higher utility from E
than from D, that is, he prefers E to D.
Exercise 11.8 Suppose that there are only two outcomes, X1 = $1, 000 and
X2 = $1, 500, and two levels of effort: low effort eL and high effort eH . The Prin-
cipal is risk neutral, while the Agent’s utility function is given by
(√
m if e = eL
UA (m, e) = √
m − 1 if e = eH .
The probability of X1 is 12 if the Agent chooses eL and 25 if the Agent chooses eH . Assume
that, if a contract makes the Agent indifferent between choosing eL and choosing eH ,
then the Agent chooses eH .
(a) Find the fixed-wage contract, call it D, that gives the Agent a utility equal to 24.
(b) Find the contract, call it C, that (1) makes the Agent is indifferent between
choosing eL and choosing eH and (2) gives the Agent an expected utility of 24.
(c) Is contract D Pareto efficient?
(d) Is contract C Pareto efficient?
Exercise 11.9 Repeat Exercise 11.8 with the same data with one exception: the proba-
1
bility of X1 is 3 if the Agent chooses eH .
Exercise 11.10 .
(a) What properties must contract C = (w1 , w2 ) satisfy in order for the Principal to
be indifferent between the Agent exerting low effort eL and the Agent exerting
high effort eH ?
(b) For every w1 ≥ 0, let w2 = f (w1 ) be such that the contract (w1 , f (w1 )) makes the
the Principal indifferent between the Agent exerting low effort eL and the Agent
exerting high effort eH . Find the expression for the function f (w1 ).
(c) Draw the graph of the function w2 = f (w1 ) in the (w1 , w2 ) plane and identify two
regions: (1) the region where the Principal prefers eL , (2) the region where the
Principal prefers eH .
11.5 Exercises 397
Exercise 11.11 Suppose that there are only two outcomes: X1 and X2 , with X1 < X2 ,
and two levels of effort: low effort eL and high effort eH . The Agent’s utility function is
given by (where m denotes money, e denotes effort and c ≥ 1 is a constant)
(
ln(m) if e = eL
UA (m, e) =
ln(m) − c if e = eH .
(b) Draw the graph of the function w2 = g(w1 ) and identify two regions:
(1) the region where the Agent prefers eL ,
(2) the region where the Agent prefers eH .
Exercise 11.12 Suppose that there are only two outcomes: X1 and X2 , with X1 < X2 ,
and two levels of effort: low effort eL and high effort eH . The Agent’s utility function is
given by (where m denotes money, e denotes effort, c is a positive constant, U 0 (m) > 0,
U 00 (m) < 0)
(
U(m) if e = eL
UA (m, e) =
U(m) − c if e = eH
Exercise 11.13 Ann’s current fixed salary is $16,900. She is bored with her job and
she will accept any new employment that does not make her worse off relative to
her current situation. Peter is thinking of starting a business and hiring an agent to
run it for him. Peter will not be able to monitor the agent, but knows that the agent
can either be lazy or work hard. There are two possible levels of income from the
business: $22,000 and $36,000. If the agent is lazy the probability of the income being
$22,000 is 50% while if the agent works hard it is 40%. Peter likes money (prefers to
have more rather than less) and is risk neutral. Ann also likes money, but she is risk
averse; furthermore, she dislikes effort. Her vNM utility function is as follows (where
m denotes money and(√e effort: eL means that she is lazy and eH means that she works
m if e = eL
hard): UA (m, e) = √ . What contract will Peter offer to Ann?
m − 3 if e = eH
11.5.3 Exercises for Section 11.4: The case with more than two outcomes
Exercise 11.14 A Principal is considering hiring an Agent to run his firm. The Principal
will not be able to monitor the Agent’s effort, which can be either low eL or high eH .
There are three possible outcomes with probabilities that are affected by the Agent’s
level of effort, as shown in the following table:
and her reservation utility is 20. Consider the following contract (where wi is the
payment to the Agent if the outcome is Xi ) C = (w1 = $484, w2 = $900, w3 = $961).
(a) If offered contract C, will the Agent accept it?
(b) What level of effort will the Agent choose if offered contract C?
(c) Show that contract C is not Pareto efficient, by constructing an alternative contract
D that Pareto dominates C. [Hint: outcome X2 is evidence that the Agent chose
low effort, thus the value of w2 can be used to induce the Agent to choose high
effort.]
11.5 Exercises 399
Exercise 11.15 Consider the maximization problem given on page 391, namely
choose y1 , y2 to maximize
2
f (y1 , y2 ) = 152 − 25 (y1 )2 − 10
1
(y2 )2 − 12 22 − 45 y1 − 15 y2
subject to
2 1 1
22 − 54 y1 − 15 y2 − 1 ≥ 1 1
5 y1 + 10 y2 + 2 2 y1 + 2 y2 (IC)
The objective of this exercise is to show that, if the (IC) constraint is satisfied as a strict
inequality, then the function f (y1 , y2 ) is not maximized. This can be done by showing
that at every point (y1 , y2 ) such that y1 ≥ 0, y2 ≥ 0 and y1 + y2 < 20 (the shaded area in
Figure 11.14) either ∂∂y f (y1 , y2 ) > 0 or ∂∂y f (y1 , y2 ) > 0 (or both), so that the value
1 2
of the function can be increased by increasing either y1 or y2 , until the (IC) constraint
becomes an equality.
∂
(a) Calculate the partial derivative ∂ y1 f (y1 , y2 ) and show in the (y1 , y2 ) plane the
∂
region where ∂ y1 f (y1 , y2 ) > 0.
∂
(b) Calculate the partial derivative ∂ y2 f (y1 , y2 ) and show in the (y1 , y2 ) plane the
∂
region where ∂ y2 f (y1 , y2 ) > 0.
(c) Verify that the shaded area shown in Figure 11.14 is entirely contained in the
union of the two regions identified in Parts (a) and (b).
y2
20
y1
0 20
Figure 11.14: The shaded area is the set of points (y1 , y2 ) such that y1 ≥ 0, y2 ≥ 0 and
y1 + y2 < 20.
400 Chapter 11. Moral Hazard in Principal-Agent
choose eH , that is, she will work hard. The Principal can figure this out and will realize
that his expected utility from this contract is: 14 (−100) + 41 (100) + 12 (500 − 100) = 200.
Thus contract C is Pareto superior to contract B, despite the fact that it does not allocate
risk optimally.
Thus the Agent will choose eL and her expected utility is 47.5.
(d) E[UP (C)] = 14 (2, 000 − 1, 600) + 43 (5, 000 − 2, 500) = 1, 975.
For the Principal, the expected utility of A is the same as the expected utility of A0
(since they belong to the same indifference curve) and the latter is equal to EH [X] − mA0 ,
where EH [X] = pH H
1 X1 + 1 − p1 X2 ; similarly, the expected utility of B is the same as
0
the expected utility of B (since they belong to the same indifference curve) and the latter
is equal to EH [X] − mB0 . Thus, by (11.19), the expected utility of B0 is greater than the
expected utility of A0 ; therefore, the expected utility of B is greater than the expected utility
of A.
402 Chapter 11. Moral Hazard in Principal-Agent
For the Principal, the expected utility of D is the same as the expected utility of D0 (since
they belong to the same indifference curve) and the latter is equal to EL [X] − mD0 , where
EL [X] = pL1 X1 + 1 − pL1 X2 ; similarly, the expected utility of E is the same as the expected
utility of E 0 (since they belong to the same indifference curve) and the latter is equal to
EL [X] − mE 0 . Thus, by (11.20), the expected utility of E 0 is greater than the expected utility
of D0 ; therefore, the expected utility of E is greater than the expected utility of D.
2 √ 3 √
5 ( w1 − 1) + 5 ( w2 − 1) = 24
Thus D P C. Hence, by Proposition 11.3.2, D is the only Pareto efficient contract on the
utility locus of the Agent corresponding to a utility of 24. Thus
(c) Yes, contract D is Pareto efficient.
(d) No, contract C is not Pareto efficient.
1 √ 2 √
3 ( w1 − 1) + 3 ( w2 − 1) = 24
In order to answer Parts (c) and (d) let us see how the Principal ranks the two contracts C
and D. As calculated in the previous exercise, E[UP (D)] = 674, while
Thus C P D. Hence, by Proposition 11.3.2, C is the only Pareto efficient contract on the
utility locus of the Agent corresponding to a utility of 24. Thus
(c) No, contract D is not Pareto efficient.
(d) Yes, contract C is Pareto efficient.
X1 − w1 = X2 − w2 . (11.21)
w2 = X2 − X1 + w1
w2
(payment if X2 )
eL ∼P eH
45o line
eL P eH
eH P eL
X2 − X1
w1
(payment if X1 )
0
Figure 11.15: The straight line is the set of contracts that make the Principal indifferent
between the Agent choosing low effort eL and the Agent choosing high effort eH . For
contracts above the line the Principal is better off if the Agent chooses low effort and for
contracts below the line the Principal is better off if the Agent chooses high effort.
404 Chapter 11. Moral Hazard in Principal-Agent
(a) A contract (w1 , w2 ) makes the Agent indifferent between eL and eH if and only if
c
ln(w2 ) − ln(w1 ) = .
pL1 − pH
1
c
Note that, since c ≥ 1 and 0 < pL1 − pH 1 < 1, pL −pH > 1. From the above equation
1 1
we get that (e denotes the irrational number 2.71828...)
c c
ln(w2 )−ln(w1 ) pL H w2 pL −pH
e =e 1 −p1 that is =e 1 1 .
w1
c
L H
Thus g(w1 ) = e p1 −p1 w1 .
w2 eL ∼A eH
(payment if X2 )
45o line
eH A eL
eL A eH
c
pL H
1 −p1
e w1
(payment if X1 )
0
Figure 11.16: The straight line is the set of contracts that make the Agent indifferent
between choosing low effort eL and choosing high effort eH . For contracts above the line
the Agent is better off choosing high effort and for contracts below the line the Agent is
better off choosing low effort.
11.6 Solutions to Exercises 405
that is,
c
U(wA2 ) −U(wA1 ) = . (11.22)
pL1 − pH
1
Let B = wB1 , wB2 be a contract close to A that also makes the Agent indifferent between
c
U(wB2 ) −U(wB1 ) = . (11.23)
pL1 − pH
1
We want to calculate the slope of the segment that joins A and B. Since B is close to A,
U(wB1 ) ' U(wA1 ) +U 0 (wA1 ) wB1 − wA1 , (11.24)
U(wB2 ) ' U(wA2 ) +U 0 (wA2 ) wB2 − wA2 . (11.25)
that is,
rise
z }| {
wB2 − wA2 U 0 (wA1 )
= 0 A ,
wB1 − wA1 U (w2 )
| {z }
run
U 0 (wA )
that is, the slope of the curve w2 = g(w1 ) at point A is U 0 (w1A ) . Since point A is above the 45o
2
line, wA1 < wA2 and thus (since U 0 (m) is decreasing because U 00 (m) < 0), U 0 (wA1 ) > U 0 (wA2 )
U 0 (wA )
so that U 0 (w1A ) > 1. Hence the curve is increasing with a slope greater than 1 everywhere.
2
406 Chapter 11. Moral Hazard in Principal-Agent
2 √ 3 √
5 ( w1 − 3) + 5 ( w2 − 3) = 130.
The solution is w1 = 13, 225 and w2 = 21, 025. Thus C = (13225, 21025). It only remains
to check which of C and D gives higher utility to Peter.
Peter’s expected utility from D is: 12 22000 + 12 36000 − 16900 = 12, 100.
Peter’s expected utility from C is: 25 (22000 − 13225) + 35 (36000 − 21025) = 12, 495.
Thus Peter will offer Ann the contingent contract C = (13225, 21025).
(a) Since the Agent can achieve a utility of at least 25.5 and her reservation utility is 20,
she will accept contract C.
(b) Under contract C the Agent will choose low effort, since it gives her higher utility
than high effort.
(c) Since the middle outcome X2 can only occur if the Agent chooses low effort, the
Principal can provide an incentive to the Agent to choose high effort by setting
w2 = 0. Let us keep the value w1 = 484 and find a contract with w2 = 0 that
gives
√ the Agent a utility of 26.25 if she chooses high effort, by solving the equation
1 1√
2 484 + 2 w3 − 1 = 26.25. The solution is w3 = 1, 056.25. Now consider contract
D = (w1 = 484, w2 = 0, w3 = 1056.25). The Agent’s expected utility under contract
D is:
√ p
if low effort eL : 12 484 + 14 1, 056.25 = 19.125
√ p
if high effort eH : 12 484 + 12 1, 056.25 − 1 = 26.25.
Thus the Agent chooses high effort eH . The Principal’s expected utility is:
1 1 1
with C : 2 (1000 − 484) + 4 (2000 − 900) + 4 (3000 − 961) = 1, 042.75
1 1
with D : 2 (1000 − 484) + 2 (3000 − 1056.25) = 1, 229.875.
∂ 1
∂ y1
f (y1 , y2 ) = 25 (440 − 36y1 − 4y2 ) .
∂
Thus ∂ y1 f (y1 , y2 ) > 0 if and only if y2 < 110 − 9y1 . The set of points (y1 , y2 ) such
∂
that ∂ y1 f (y1 , y2 ) > 0 is shown as the shaded area in Figure 11.17.
y2
110
y1
0 110
9
20
Figure 11.17: The shaded area is the set of points (y1 , y2 ) such that y2 < 110 − 9y1
(equivalently, such that ∂∂y f (y1 , y2 ) > 0). (The drawing is not accurate in terms of scale,
1
because we wanted the coordinates to be spaced sufficiently apart to be readable.)
∂ 1
∂ y2
f (y1 , y2 ) = 25 (110 − 4y1 − 6y2 ) .
55
Thus ∂
∂ y2 f (y1 , y2 ) > 0 if and only if y2 < 3 − 23 y1 . The set of points (y1 , y2 ) such
∂
that ∂ y2 f (y1 , y2 ) > 0 is shown as the shaded area in Figure 11.18.
408 Chapter 11. Moral Hazard in Principal-Agent
y2
20
55
3
15
y1
0 5 20 27.5
Figure 11.18: The shaded area is the set of points (y1 , y2 ) such that y2 < 55 2
3 − 3 y1 (equiva-
lently, such that ∂∂y f (y1 , y2 ) > 0). (The drawing is not accurate in terms of scale, because
2
we wanted the coordinates to be spaced sufficiently apart to be readable.)
(c) The shaded area in Figure 11.19 shows the union of the two shaded areas of Figures
11.17 and 11.18 which contains the set of points below the line of equation y2 =
20 − y1 . Algebraically, this can be seen as follows. The point of intersection
between the line of equation y2 = 110 − 9y1 and the line of equation y2 = 20 − y1 has
coordinates (y1 = 11.25, y2 = 8.75); from y1 = 0 to y1 = 11.25 the area under the
line of equation y2 = 20 − y1 is entirely contained within the area under the line of
equation y2 = 110 − 9y1 (that is, for y1 ≤ 11.25, if y2 < 20 − y1 then y2 < 110 − 9y1 ).
The point of intersection between the line of equation y2 = 55 2
3 − 3 y1 and the line
of equation y2 = 20 − y1 has coordinates (y1 = 5, y2 = 15); from y1 = 5 to y1 = 20
the area under the line of equation y2 = 20 − y1 is entirely contained within the area
under the line of equation y2 = 55 2
3 − 3 y1 (that is, for y1 ≥ 5, if y2 < 20 − y1 then
y2 < 55 2
3 − 3 y1 ).
y2
110
20
55
3
y1
0 110
9
20 55
2
Figure 11.19: The shaded area shows the union of the two shaded areas of Figures 11.17
and 11.18 which contains the set of points below the line of equation y2 = 20 − y1 .
12. Glossary
Asymmetric information
A situation where one of the two parties to a potential transaction has valuable
information that is not available to the other party.
Deductible
The amount, specified in an insurance contract, for which the insured is liable on
each loss (so that, in case of loss, the insurance provider will reimburse only an
amount equal to the difference between the loss and the deductible).
Full insurance
An insurance contract with zero deductible.
Incentive contract
A contractual arrangement according to which payments to one of the parties
are based, at least in part, on the observed outcome (taken as an indicator of
performance).
410 Chapter 12. Glossary
Index
An observable characteristic (such as race or gender) that cannot be changed.
Indifference curve
A graphical representation of the set of alternatives (e.g. contracts) that an individ-
ual considers to be just as good as some specified alternative (thus the individual
is indifferent among all of them).
Lottery
A list of possible outcomes with corresponding probabilities.
Money lottery
A lottery where the possible outcomes are sums of money.
Monopoly
A industry consisting of only one firm.
Pareto efficiency
A situation X is Pareto efficient if there is no other feasible situation Y such that
everybody considers Y to be at least as good as X and at least one person strictly
prefers Y to X.
Partial insurance
An insurance contract with positive deductible.
Premium
The amount paid by the holder of an insurance policy for coverage under the
contract.
Principal-Agent relationship
A contractual relationship where one party (the Principal) hires another party (the
Agent) to perform a task whose outcome is typically affected by external factors
that are not under the control of either party.
Risk aversion
One of the possible attitudes to risk, where the individual considers a money
lottery to be worse than the expected value of the lottery for sure.
Risk neutrality
One of the possible attitudes to risk, where the individual considers a money
lottery to be just as good as the expected value of the lottery for sure.
411
Risk love
One of the possible attitudes to risk, where the individual considers a money
lottery to be better than the expected value of the lottery for sure.
Risk sharing
The distribution of risk between the parties to a contract (typically a Principal-
Agent relationship) by means of a payment scheme.
Separating equilibrium
A situation where individuals are presented with a menu of choices and different
types of individuals make different choices, thereby separating themselves from
the other types.
Signal
An observable characteristic (such as an educational certificate) which is available
to, and can be chosen by, everybody.
Signaling
The phenomenon by which some types of individuals expend resources on a signal
in order to credibly convey information about themselves, since other types find it
too costly to imitate the signaling activity.
Index
A D
adverse selection . . . . . . . . . . . . . . 227, 322 dominance
affine transformation . . . . . . . . . . . . . . . . . 61 first-order stochastic . . . . . . . . . 94, 389
Allais paradox . . . . . . . . . . . . . . . . . . . . . . 68 second-order stochastic . . . . . . . . . . 95
Arrow-Pratt measure . . . . . . . . . . . . . . . . . 89
asymmetric information . . . . . . . . 227, 299 E
attitude to risk
risk aversion . . . . . . . . . . . . . . . . . . . . 17 Edgeworth box . . . . . . . . . . . . . . . . . . . . . 167
risk love . . . . . . . . . . . . . . . . . . . . . . . . 17 Ellsberg paradox . . . . . . . . . . . . . . . . . . . . 70
risk neutrality . . . . . . . . . . . . . . . . . . . 17 expected utility . . . . . . . . . . . . . . . . . . . . . . 57
expected value . . . . . . . . . . . . . . . . . . . . . . 16
B
F
basic outcome . . . . . . . . . . . . . . . . . . . . . . . 55
fair odds line . . . . . . . . . . . . . . . . . . . . . . . . 28
belief . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
revision . . . . . . . . . . . . . . . . . . . . . . . . 231
H
updating . . . . . . . . . . . . . . . . . . . . . . . 229
hidden
C action . . . . . . . . . . . . . . . . . . . . . 343, 344
features. . . . . . . . . . . . . . . . . . . . . . . .236
CARA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 type . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
certainty equivalent . . . . . . . . . . . . . . . . . . 19
concavity . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 I
conditional probability . . . . . . . . . . . . . . 229
convexity . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 incentive compatibility constraint . . . . 264
credit rationing . . . . . . . . . . . . . . . . . . . . . 242 index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305
CRRA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 indifference curve . . . . . . . . . . . . . . . . . . 114
414 INDEX
V
von Neumann-Morgenstern
ranking . . . . . . . . . . . . . . . . . . . . . . . . . 57
utility function . . . . . . . . . . . . . . . . . . 57
W
warranty. . . . . . . . . . . . . . . . . . . . . . . . . . .322
wealth diagram . . . . . . . . . . . . . . . . . . . . . . 22