Lect26 4up
Lect26 4up
1 / 59 2 / 59
3 / 59 4 / 59
The grade game The grade game
5 / 59 6 / 59
7 / 59 8 / 59
The grade game Choose the side
9 / 59 10 / 59
• Which side of the road to drive on? • Which side of the road to drive on?
11 / 59 12 / 59
Choose the side Choose the side
• How do you choose when you don’t know what “everyone else” is
doing.
13 / 59 14 / 59
• Cake or Death?
15 / 59 16 / 59
Preferences over outcomes Preferences over outcomes
17 / 59 18 / 59
19 / 59 20 / 59
Utility functions Utility versus Money
• Note that:
uSimon ppie q ą uSimon pcake q
really means
uSimon ppie & X q ą uSimon pcake & X q
where X captures “everything else”. money
21 / 59 22 / 59
τ: looAc
moon ˆ looAc
moon ÑΩ
agent i’s action agent j’s action
23 / 59 24 / 59
Multiagent Encounters Multiagent Encounters
• Neither agent has any influence in this environment. • This environment is controlled by j.
25 / 59 26 / 59
• Suppose we have the case where both agents can influence the
outcome, and they have utility functions as follows:
C , C ľi C , D ąi D , C ľi D , D
27 / 59 28 / 59
Key question Choose the side
• How do we establish the rational choice for an agent? • Which side to drive on.
Best strategy
29 / 59 30 / 59
31 / 59 32 / 59
Payoff Matrices Solution Concepts
33 / 59 34 / 59
35 / 59 36 / 59
Dominant Strategies Dominant Strategies
37 / 59 38 / 59
• If we are lucky, we can eliminate enough strategies so that the choice • If we are lucky, we can eliminate enough strategies so that the choice
of action is obvious. of action is obvious.
• In general we aren’t that lucky.
39 / 59 40 / 59
Nash Equilibrium Nash Equilibrium
41 / 59 42 / 59
43 / 59 44 / 59
Nash Equilibrium Nash Equilibrium
i • More formally:
Y X A strategy pi ˚ , j ˚ q is a Nash equilibrium solution to the game pA , B q if:
Y 2 1
j 2 4 @i , ai ˚ ,j ˚ ě ai ,j ˚
X 4 3 @j , bi ˚ ,j ˚ ě bi ˚ ,j
1 3
• In a game like this you can find the NE by cycling through the
outcomes, asking if either agent can improve its payoff by switching
its strategy.
• Thus, for example, pX , Y q is not an NE because i can switch its payoff
from 1 to 2 by switching from X to Y .
45 / 59 46 / 59
47 / 59 48 / 59
Nash Equilibrium Mirowski on Nash Equilibrium
49 / 59 50 / 59
51 / 59 52 / 59
Pareto Optimality Pareto Optimality
53 / 59 54 / 59
55 / 59 56 / 59
Social Welfare Social Welfare
i
D C
D 2 1
j 2 1
• A very skewed outcome can maximise social welfare. C 3 9
3 0
57 / 59 58 / 59
Summary
59 / 59