MiA T3 DynamicGames
MiA T3 DynamicGames
Matthew Embrey
University of Sussex
Autumn 2023
2 2
L′ R′ L′ R′
1
L R
2 2
L′ R′ L′ R′
1
L R
2 2
L′ R′ L′ R′
Need to specify a choice for P2 for every node; e.g., (L’, L’) means
choose L’ should P1 choose L and L′ should P1 choose R.
P2
(L’, L’) (L’, R’) (R’, L’) (R’, R’)
P1 L
R
1
L R
y
2 2
L′ R′ L′ R′
(3,1) (1, 2) (2, 1) (0, 0)
1
L R
y
2 2
L′ R′ L′ R′
(3,1) (1, 2) (2, 1) (0, 0)
1
L R
y
2 2
L′ R′ L′ R′
(3, 1) (1,2) (2, 1) (0, 0)
1
L R
y
2 2
L′ R′ L′ R′
(3, 1) (1,2) (2, 1) (0, 0)
1
L R
%
2 2
L′ R′ L′ R′
(3, 1) (1, 2) (2,1) (0, 0)
1
L R
%
2 2
L′ R′ L′ R′
(3, 1) (1, 2) (2, 1) (0,0)
1
L R
%
2 2
L′ R′ L′ R′
(3, 1) (1, 2) (2,1) (0, 0)
1
L R
%
2 2
L′ R′ L′ R′
(3, 1) (1, 2) (2, 1) (0,0)
1
P2
L’ R’ L R
P1 L (3,1) (1,2) 2 2
R (2,1) (0,0) L′ R′ L′ R′
1
L R
2 2
L′ R′ L′ R′
Start at the end of the tree - at any ultimate node, the player will
pick an action that yields the highest payoff to herself.
2 2
L′ R′ L′ R′
(3, 1) (1, 2) (2, 1) (0, 0)
Start at the end of the tree - at any ultimate node, the player will
pick an action that yields the highest payoff to herself.
1
L R
%
2 2
R′ L′
(1, 2) (2, 1)
Start at the end of the tree - at any ultimate node, the player will
pick an action that yields the highest payoff to herself.
These payoffs become the relevant payoffs for choices at the
penultimate nodes and one can work backwards to solve for the
backward induction outcome(s).
1
L R
%
2 2
L′ R′ L′ R′
(3, 1) (1, 2) (2,1) (0, 0)
Start at the end of the tree - at any ultimate node, the player will
pick an action that yields the highest payoff to herself.
These payoffs become the relevant payoffs for choices at the
penultimate nodes and one can work backwards to solve for the
backward induction outcome(s).
Every game of complete and perfect information (information sets are
singleton nodes) with a finite number of nodes can be solved by
backward induction.
Matthew Embrey (University of Sussex) Microeconomic Analysis (806L1) Autumn 2023 6 / 23
Subgame Perfect Nash Equilibrium
Subgames
1
L R
2 2
L′ R′ L′ R′
1
L R
2 2
L′ R′ L′ R′
1
L R
2 2
L′ R′ L′ R′
P2
(L’, L’) (L’, R’) (R’, L’) (R’, R’)
P1 L (3, 1) (3, 1) (1, 2) (1, 2)
R (2, 1) (0, 0) (2, 1) (0, 0)
The pure strategy Nash equilibrium are (R, (R’,L’)) and (L, (R’,R’)).
Recall that the backward induction outcome was (R, L’). The SPNE
is (R, (R’,L’)).
What about (L, (R’,R’))? It is a NE. Here player 2 plans to play R’ if
player 1 chose L, and also if player 1 chose R. But is that credible? If
player 1 did chose R, player 2’s best response is L’.
(1, 0) 2
L R
(0, 1) 1
t b
(4, 0) (2, 2)
Cooperate Defect
Cooperate (1, 1) (-1, 2)
Defect (2, -1) (0, 0)
(Y,Y) and (Z,Z) are the two pure strategy NE. Consider a 2 period
repeated game (no discounting):
Strategy: play (X,X) in period 1 and (Y,Y) in period 2. Punish a
deviation in period 1 by playing (Z,Z) in period 2.
(Y,Y) and (Z,Z) are the two pure strategy NE. Consider a 2 period
repeated game (no discounting):
Strategy: play (X,X) in period 1 and (Y,Y) in period 2. Punish a
deviation in period 1 by playing (Z,Z) in period 2.
If a player sticks to strategy, she gets 3 + 1 = 4.
If she deviates to playing Y in period 1, she knows that the other player
will punish her by playing Z in period 2, and so she will also play Z.
Payoff=4 − 1 = 3 so deviation is unprofitable.
The threat of (Z,Z) in the 2nd period supports cooperation.
Matthew Embrey (University of Sussex) Microeconomic Analysis (806L1) Autumn 2023 13 / 23
Repeated games
Infinitely repeated Prisoner’s Dilemma
Cooperate Defect
Cooperate (1, 1) (-1, 2)
Defect (2, -1) (0, 0)
Is (s ∗ , s ∗ ) SPNE?
Cooperate Defect
Cooperate (1, 1) (-1, 2)
Defect (2, -1) (0, 0)
Is (s ∗ , s ∗ ) SPNE?
First, show that cooperation is best response to cooperation to date.
Cooperate Defect
Cooperate (1, 1) (-1, 2)
Defect (2, -1) (0, 0)
Is (s ∗ , s ∗ ) SPNE?
First, show that cooperation is best response to cooperation to date.
Suppose no one has played D so far. If the other player is playing s ∗
1
Payoff from cooperation: 1 + δ + δ 2 + ... = 1−δ
Payoff from defection: 2 + 0 + 0 + ... = 2
1
Cooperation yields more if 1−δ ≥ 2 ⇒ δ ≥ 0.5
Cooperate Defect
Cooperate (1, 1) (-1, 2)
Defect (2, -1) (0, 0)
Is (s ∗ , s ∗ ) SPNE?
First, show that cooperation is best response to cooperation to date.
Suppose no one has played D so far. If the other player is playing s ∗
1
Payoff from cooperation: 1 + δ + δ 2 + ... = 1−δ
Payoff from defection: 2 + 0 + 0 + ... = 2
1
Cooperation yields more if 1−δ ≥ 2 ⇒ δ ≥ 0.5
Second, defection is best response to any defection in the past.
Cooperate Defect
Cooperate (1, 1) (-1, 2)
Defect (2, -1) (0, 0)
Is (s ∗ , s ∗ ) SPNE?
First, show that cooperation is best response to cooperation to date.
Suppose no one has played D so far. If the other player is playing s ∗
1
Payoff from cooperation: 1 + δ + δ 2 + ... = 1−δ
Payoff from defection: 2 + 0 + 0 + ... = 2
1
Cooperation yields more if 1−δ ≥ 2 ⇒ δ ≥ 0.5
Second, defection is best response to any defection in the past.
According to s ∗ , the other player will play D.
Then D is the best response.
Cooperate Defect
Cooperate (1, 1) (-1, 2)
Defect (2, -1) (0, 0)
Is (s ∗ , s ∗ ) SPNE?
First, show that cooperation is best response to cooperation to date.
Suppose no one has played D so far. If the other player is playing s ∗
1
Payoff from cooperation: 1 + δ + δ 2 + ... = 1−δ
Payoff from defection: 2 + 0 + 0 + ... = 2
1
Cooperation yields more if 1−δ ≥ 2 ⇒ δ ≥ 0.5
Second, defection is best response to any defection in the past.
According to s ∗ , the other player will play D.
Then D is the best response.
Therefore, (s ∗ , s ∗ ) is SPNE for δ ≥ 0.5.
Matthew Embrey (University of Sussex) Microeconomic Analysis (806L1) Autumn 2023 15 / 23
Repeated games
Folk theorem(s)