Subject Name: Operations Research Subject Code: 10CS661 Prepared By: Sindhuja K Department: CSE
Subject Name: Operations Research Subject Code: 10CS661 Prepared By: Sindhuja K Department: CSE
OPERATIONS RESEARCH
Subject Code:
10CS661
Prepared By:
Sindhuja K
Department:
CSE
05/19/20
UNIT VII-
Game Theory, Decision Analysis
05/19/20 2
Objective
05/19/20 3
Game Theory
05/19/20 4
Basic Terms used in game theory
05/19/20 6
Two person zero – sum game
• Terms used
– Pay off matrix: The representation of gains and
losses resulting from different actions of the
competitors is represented in the form of a matrix.
– Value of game: It is the expected outcome of the
player when all the players of the game follow
their optimum strategy.
– Fair game: Value of the game is zero.
05/19/20 7
Formulation of Two person zero – sum
game
B1 B2 ……… Bn
A1 a11 a12 ……… a1n
A2 a21 a22 …........ a2n
. .
. .
Am am1 am2 ………. amn
05/19/20 8
Formulation of Two person zero – sum
game
• A1,A2,…..,Am are the strategies of
player A
• B1,B2,…...,Bn are the strategies of
player B
• aij is the payoff to player A (by B) when
the player A plays strategy Ai and B plays
Bj (aij is –ve means B got |aij| from A)
05/19/20 9
Example
Consider the game of the odds and evens. This game consists of two
players A,B, each player simultaneously showing either of one finger or
two fingers. If the number of fingers matches, so that the total number for
both players is even, then the player taking evens (say A) wins Rs.1 from
B (the player taking odds). Else, if the number does not match, A pays
Rs.1 to B. Thus the payoff matrix to player A is the following table:
05/19/20 10
Optimum Solution
• A game can be solved by using the following three methods,
based on the nature of the problem.
– Saddle point concept/Max-min and Min max principle
• Games without saddle point
– Dominance rule
– Graphical method.
05/19/20 11
Min- Max and Max-Min principle
• Max –Min : A row(winning) payer will select the maximum
out of the minimum gains.
05/19/20 12
B1 B2 B3 B4 Row min
A1 8 6 2 8 2
A2 8 9 4 (SP) 5 4
7 5 3 5 3
A3
05/19/20 14
Solution (cont..)
• Now if B plays strategy 1, then whatever A plays, he will lose
a maximum of 8. Similarly for strategies 2,3,4. (These are the
maximum of the respective columns). Thus to minimize this
maximum loss, B should play strategy 3.
• and 4 = max (row minima)
• = min (column maxima)
• is called the value of the game.
• 4 is called the saddle-point.
05/19/20 15
Dominance Rule
• Definition: A strategy is dominated by a second strategy if
the second strategy is always at least as good (and sometimes
better) regardless of what the opponent does. Such a
dominated strategy can be eliminated from further
consideration.
• The following rules of dominance is used reduce the sixe of
the matrix
– Row dominance
– Column dominance
– Modified row dominance- Average of rows
– Modified column dominance- Average of columns
05/19/20 16
Example
• Thus in our example (below), for player A, strategy A3 is dominated by
the strategy A2 and so can be eliminated.
• Eliminating the strategy A3 , we get the
B1 B2 B3 B4
A1 8 6 2 8
A2 8 9 4 5
7 5 3 5
A3
05/19/20 17
Cont..
• following reduced payoff matrix:
• Now , for player B, strategies B1, B2, and B4 are dominated by the
strategy B3.
• Eliminating the strategies B1 , B2, and B4 we get the reduced payoff
matrix:
B1 B2 B3 B4
A1 8 6 2 8
A2 8 9 4 5
05/19/20 18
Cont..
• following reduced payoff matrix:
• Now , for player A, strategy A1 is dominated by the strategy A2.
• Eliminating the strategy A1 we thus see that A should always play A2 and B
always B3 and the value of the game is 4 as before.
B3
A1 2
A2
4
05/19/20 19
Example
The following game gives A’s payoff. Determine p,q that
will make the entry (2,2) a saddle point.
B1 B2 B3
Row min
A1 1 q 6
min(1,q)
A2 p 5 10
min(p,5)
A3 6 2 3
2
Col max max(p,6) max(q,5) 10
Since (2,2) must be a saddle point,
p5
05/19/20
q5 20
Example
Specify the range for the value of the game in the following case
assuming that the payoff is for player A.
B1 B2 B3 Row min
A1 3 6 1 1
A2 5 2 3 2
A3 4 2 -5 -5
Col max 5 6 3
Thus max( row min) <= min (column max)
The game has no saddle point. Thus the value of the game lies between 2 and 3.
05/19/20 21
Games without saddle point(mixed
strategy)
• No pure strategy or no saddle point exists.
• The optimal mix for each player may be determined by
assigning each strategy a probability of it being chosen.
Thus these mixed strategies are probabilistic combinations
of available better strategies and these games hence called
Probabilistic games.
• The probabilistic mixed strategy games without saddle points
are commonly solved by any of the following methods
– Analytical Method
– Graphical Method
– Simplex Method
05/19/20 22
Analytical Method
• A 2x2 game without saddle point can be solved using following formula.
05/19/20 23
Formula
05/19/20 24
Example
• Solve the following game and determine its value
05/19/20 25
05/19/20 26
Graphical Method : Solution of 2 x n and
m x 2 Games
• 2 x n and m x 2 Games : When the player A, for example, has
only 2 strategies to choose from and the player B has n, the
game shall be of the order 2 x n, whereas in case B has only
two strategies available to him and A has m strategies, the
game shall be a m x 2 game.
05/19/20 27
Example
• Solve the following using graphical method.
05/19/20 28
Algorithm for solving 2 x n matrix games
• Draw two vertical axes 1 unit apart. The two lines are x1 = 0, x1 =
1
• Take the points of the first row in the payoff matrix on the
vertical line x1 = 1 and the points of the second row in the
payoff matrix on the vertical line x1 = 0.
• The point a1j on axis x1 = 1 is then joined to the point a2j
on the axis x1 = 0 to give a straight line. Draw ‘n’ straight
lines for j=1, 2… n and determine the highest point of the lower
envelope obtained. This will be the maximin point.
• The two or more lines passing through the maximin point
determines the required 2 x 2 payoff matrix. This in turn gives
the optimum solution by making use of analytical method.
05/19/20 29
Example
• Solve using graphical method
05/19/20 30
Solution
05/19/20 31
Cont..
• V = 66/13
• SA = (4/13, 9 /13)
• SB = (0, 10/13, 3 /13
05/19/20 32
Algorithm for solving m x 2 matrix games
• Draw two vertical axes 1 unit apart. The two lines are x1 =0, x1 =
1
• Take the points of the first row in the payoff matrix on the
vertical line x1 = 1 and the points of the second row in the
payoff matrix on the vertical line x1 = 0.
• The point a1j on axis x1 = 1 is then joined to the point a2j
on the axis x1 = 0 to give a straight line. Draw ‘n’ straight
lines for j=1, 2… n and determine the lowest point of the upper
envelope obtained. This will be the minimax point.
• The two or more lines passing through the minimax point
determines the required 2 x 2 payoff matrix. This in turn gives
the optimum solution by making use of analytical method.
05/19/20 33
Example
05/19/20 34
Solution
05/19/20 35
Cont..
• V = 3/9 = 1/3
• SA = (0, 5 /9, 4/9, 0)
• SB = (3/9, 6 /9)
05/19/20 36
Decision Analysis
05/19/20 37
Decision Making without Experimentation
• Goferbroke Company owns a tract of land that may contain oil
• Consulting geologist: “1 chance in 4 of oil”
• Offer for purchase from another company: $90k
• Can also hold the land and drill for oil with cost $100k
• If oil, expected revenue $800k, if not, nothing
Payoff
Alternative Oil Dry
Drill for oil
Sell the land
Chance 1 in 4 3 in 4
05/19/20 38
Notation and Terminology
• Actions: {a1, a2, …}
– The set of actions the decision maker must choose from
– Example:
• Prior distribution:
– Distribution representing the relative likelihood of the possible states of nature.
05/19/20 39
Decision Making Criteria
– Maximin payoff
– Minimax regret
– Maximum likelihood
– Bayes’ decision rule (expected value)
05/19/20 40
Maximin Payoff Criterion
• For each action, find minimum payoff over all states of nature
• Then choose the action with the maximum of these minimum payoffs
State of Nature
Min
Action Oil Dry
Payoff
05/19/20 41
Minimax Regret Criterion
• For each action, find maximum regret over all states of nature
• Then choose the action with the minimum of these maximum regrets
State of Nature
Action Oil Dry
Drill for oil 700 -100
Sell the land 90 90
Prior probability 0.25 0.75
05/19/20 43
Bayes’ Decision Rule
(Expected Value Criterion)
• For each action, find expectation of payoff over all states of nature
• Then choose the action with the maximum of these expected payoffs
State of Nature
Expected
Action Oil Dry
Payoff
05/19/20 44
Sensitivity Analysis with
Bayes’ Decision Rule
• What is the minimum probability of oil such that we choose to drill the
land under Bayes’ decision rule?
State of Nature
Expected
Action Oil Dry
Payoff
05/19/20 45
Decision Making with Experimentation
State of Nature
Action Oil Dry
Drill for oil 700 -100
Sell the land 90 90
Prior probability 0.25 0.75
05/19/20 46
Posterior Probabilities
• Do experiments to get better information and improve estimates for the
probabilities of states of nature. These improved estimates are called
posterior probabilities.
• Experimental Outcomes: {x1, x2, …}
Example:
• Cost of experiment:
Example:
• Posterior Distribution: P( = k | X = xj)
05/19/20 47
Goferbroke Example (cont’d)
• Based on past experience:
• If there is oil, then
• the probability that seismic survey findings is USS = 0.4 =
P(USS | oil)
• the probability that seismic survey findings is FSS = 0.6 =
P(FSS | oil)
• If there is no oil, then
• the probability that seismic survey findings is USS = 0.8 =
P(USS | dry)
• the probability that seismic survey findings is FSS = 0.2 =
P(FSS | dry)
05/19/20 48
Bayes’ Theorem
• Calculate posterior probabilities using Bayes’ theorem:
Given P(X = xj | = k), find P( = k | X = xj)
P(X x j | k ) P( k )
P( k | X x j )
P(X x
i
j | i ) P( i )
05/19/20 49
Goferbroke Example (cont’d)
Optimal policies
• If finding is USS: State of Nature
Expected
Action Oil Dry
Payoff
05/19/20 51
Expected Value of Perfect Information
• Suppose we know the true state of nature. Then we will pick
the optimal action given this true state of nature.
State of Nature
Action Oil Dry
Drill for oil 700 -100
Sell the land 90 90
Prior probability 0.25 0.75
05/19/20 52
Expected Value of Experimentation
• We are interested in the value of the experiment. If the value
is greater than the cost, then it is worthwhile to do the
experiment.
• Expected Value of Experimentation:
EVE = E[EI] – E[OI]
where E[EI] is expected value with experimental information.
05/19/20 53
Decision Tree
• Tool to display decision problem and relevant computations
• A decision tree consists of 3 types of nodes:-
1. Decision nodes - commonly represented by squares
2. Chance nodes - represented by circles
3. End nodes - represented by triangles/ellipses
• A decision tree has only burst nodes (splitting paths) but no
sink nodes (converging paths) on a decision tree branch
05/19/20 54
Decision Tree Example
05/19/20 55
Analysis Using Decision Trees
• Start at the right side of tree and move left a column at a time.
For each column, if chance fork, go to (2). If decision fork, go
to (3).
• At each chance fork, calculate its expected value. Record this
value in bold next to the fork. This value is also the expected
value for branch leading into that fork.
• At each decision fork, compare expected value and choose
alternative of branch with best value. Record choice by putting
slash marks through each rejected branch.
• Comments:
• This is a backward induction procedure.
• For any decision tree, such a procedure always leads to an optimal solution.
05/19/20 56
Decision Tree Advantages
• Are simple to understand and interpret
• Have value even with little hard data
• Possible scenarios can be added
• Worst, best and expected values can be determined for
different scenarios
• Use a white box model. If a given result is provided by a
model
• Can be combined with other decision techniques
• e.g. Net Present Value calculations
05/19/20 57