0% found this document useful (0 votes)

39 views48 pages

L07 Adversarial Search

The document discusses adversarial search and algorithms like minimax and alpha-beta pruning that are used for game playing in artificial intelligence. It covers topics like minimax search, the minimax algorithm, alpha-beta pruning, and how the order of exploring nodes in the game tree impacts pruning.

Uploaded by

arabickathu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views48 pages

L07 Adversarial Search

Uploaded by

arabickathu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

COL333/671: Introduction to AI

Semester I, 2022-23

Adversarial Search

Rohan Paul

1
Outline
• Last Class
• Constraint Satisfaction
• This Class
• Adversarial Search
• Reference Material
• AIMA Ch. 5 (Sec: 5.1-5.5)

2
Acknowledgement
These slides are intended for teaching purposes only. Some material
has been used/adapted from web sources and from slides by Doina
Precup, Dorsa Sadigh, Percy Liang, Mausam, Dan Klein, Anca
Dragan, Nicholas Roy and others.

3
Game Playing and AI
• Games: challenging decision-making
problems
• Incorporate the state of the other agent in
your decision-making. Leads to a vast
number of possibilities.
• Long duration of play. Win at the end.
• Time limits: Do not have time to compute
optimal solutions.

4
Games: Characteristics
• Axes: • Zero-Sum Games
• Players: one, two or more.
• Adversarial: agents have opposite
• Actions (moves): deterministic or
stochastic utilities (values on outcomes)
• States: fully known or not.

• Core: contingency problem

• The opponent’s move is not known ahead of time. A player must respond
with a move for every possible opponent reply.
• Output
• Calculate a strategy (policy) which recommends a move from each state.
5
Playing Tic-Tac-Toe: Essentially a search problem!

Terminal nodes we get -1, 0 or 1 for loss, tie or

win. Think of this value as a ”utility” of a state.

6
Slide adapted from Dan Klein and from Mausam
Single-Agent Trees

2 0 … 2 6 … 4 6
7
Computing “utility” of states to decide actions
Non-Terminal States:
Value of a state:
The best achievable
outcome (utility)
from that state

2 0 … 2 6 … 4 6
Terminal States:

8
Game Trees: Presence of an Adversary

-20 -8 … -18 -5 … -10 +4 -20 +8

The adversary’s actions are not in our control. Plan as a contingency considering all possible actions taken by the adversary.
Minimax Values
States Under Agent’s Control: States Under Opponent’s Control:

-8 -5 -10 +8

Terminal States:
Adversarial Search (Minimax)
• Consider a deterministic, zero-sum game
• Tic-tac-toe, chess etc.
• One player maximizes result and the other minimizes result.

• Minimax Search
• Search the game tree for best moves.
• Select optimal actions that move to a position with the highest minimax
value.
• What is the minimax value?
• It is the best achievable utility against the optimal (rational) adversary.
• Best achievable payoff against the best play by the adversary.
Minimax Algorithm
• Ply and Move Minimax values:
• Move: when action taken by both players. computed recursively
• Ply: is a half move.
5 max
• Backed-up value
• of a MAX-position: the value of the largest successor
• of a MIN-position: the value of its smallest successor. 2 5 min

• Minimax algorithm
• Search down the tree till the terminal nodes.
• At the bottom level apply the utility function.
8 2 5 6
• Back up the values up to the root along the search
path (compute as per min and max nodes)
Terminal values:
• The root node selects the action. part of the game
Minimax Example

3 2 2

3 12 8 2 4 6 14 5 2
Minimax Implementation

def max-value(state): def min-value(state):

initialize v = -∞ initialize v = +∞
for each successor of state: for each successor of state:
v = max(v, min-value(successor)) v = min(v, max-value(successor))
return v return v
Minimax Implementation
def value(state):
if the state is a terminal state: return the state’s utility
if the next agent is MAX: return max-value(state)
if the next agent is MIN: return min-value(state)

def max-value(state): def min-value(state):

initialize v = -∞ initialize v = +∞
for each successor of state: for each successor of state:
v = max(v, value(successor)) v = min(v, value(successor))
return v return v

Useful, when there are multiple adversaries.

Minimax Properties
• Completeness
• Yes

• Complexity
• Time: O(bm)
• Space: O(bm)

• Requires growing the tree till the

terminal nodes.
• Not feasible in practice for a game
like Chess.
Minimax Properties You: Cricle. Opponent: Cross

• Optimal
• If the adversary is playing optimally (i.e.,
giving us the min value)
• Yes
• If the adversary is not playing optimally MAX
(i.e., not giving us the min value)
• No. Why? It does not exploit the opponent’s
weakness against a suboptimal opponent). MIN

10 10 9 100

If min returns 9? Or 100?

Necessary to examine all values in the tree?

3 <=2 2

3 12 8 2 14 5 2
Alpha-Beta Pruning: General Idea
• General Configuration (MIN version)
• Consider computing the MIN-VALUE at some node n, MAX
examining n’s children
• n’s estimate of the childrens’ min is reducing.
MIN a
• Who can use n’s value to make a choice? MAX
• Let a be the best value that MAX can get at any choice
point along the current path from the root
• If the value at n becomes worse than a, MAX will not pick MAX
this option, so we can stop considering n’s other children
(any further exploration of children will only reduce the MIN n
value further)
Alpha-Beta Pruning: General Idea
• General Configuration (MAX version)
• Consider computing the MAX-VALUE at some node n, MIN
examining n’s children
• n’s estimate of the childrens’ min is increasing.
MAX b
• Who can use n’s value to make a choice? MIN
• Let b be the lowest (best) value that MIN can get at any
choice point along the current path from the root
• If the value at n becomes higher than b, MIN will not pick MIN
this option, so we can stop considering n’s other children
(any further exploration of children will only increase the MAX n
value further)
Pruning: Example
Pruning: Example

8 <=4
Pruning: Example
Pruning: Example

10
<=2

>=100 2
10
Alpha-Beta Implementation
α: MAX’s best option on path to root
β: MIN’s best option on path to root

def max-value(state, α, β): def min-value(state , α, β):

initialize v = -∞ initialize v = +∞
for each successor of state: for each successor of state:
v = max(v, value(successor, α, β)) v = min(v, value(successor, α, β))
if v ≥ β return v if v ≤ α return v
α = max(α, v) β = min(β, v)
return v return v
Alpha-Beta Pruning - Properties
1. Pruning has no effect on the minimax value at the root.
• Pruning does not affect the final action selected at the root.
2. A form of meta-reasoning (computing what to compute)
• Eliminates nodes that are irrelevant for the final decision.

26
Alpha-Beta Pruning – Order of nodes matters

3 <=2 2

3 12 8 2 14 5 2

27
Alpha-Beta Pruning – Order of nodes matters

3 <=2
<=2

3 12 8 2 2 5 14

28
Alpha-Beta Pruning - Properties
1. Pruning has no effect on the minimax value at the root.
• Pruning does not affect the final action selected at the root.
2. A form of meta-reasoning (computing what to compute)
• Eliminates nodes that are irrelevant for the final decision.
3. The alpha-beta search cuts the largest amount off the tree when we
examine the best move first
• However, best moves are typically not known. Need to make estimates.

29
Alpha-Beta Pruning – Order of nodes matters
If the nodes were indeed encountered as “worst
moves first” – then no pruning is possible

If the nodes were encountered as “best moves first”

– then pruning is possible

Note: In reality, we don’t know the ordering.

30
Slide adapted from Prof. Mausam
Alpha-Beta Pruning - Properties
1. Pruning has no effect on the minimax value at the root.
• Pruning does not affect the final action selected at the root.
2. A form of meta-reasoning (computing what to compute)
• Eliminates nodes that are irrelevant for the final decision.
3. The alpha-beta search cuts the largest amount off the tree when we
examine the best move first
• Problem: However, best moves are typically not known.
• Solution: Perform iterative deepening search and evaluate the states.
4. Time Complexity
• Best ordering - O(bm/2). Can double the search depth for the same resources.
• On average – O(b3m/4) if we expect to find the min or max after b/2 expansions.

31
Minimax for Chess Alpha-Beta for Chess

Slide adapted from Prof. Mausam

Cutting-off Search
MAX
4
• Problem (Resource costraint): MIN
-2 4
• Minimax search: full tree till the terminal nodes.
• Alpha-beta prunes the tree but still searches till the -1 -2 4 9 Evaluations
terminal nodes.
• We can’t search till the terminal nodes. Cut off

• Solution:
• Depth-limited Search (H-Minimax)
• Search only to a limited depth (cutoff) in the tree
• Replace the terminal utilities with an evaluation function
for non-terminal positions.

? ? ? ?
Terminal nodes
Evaluation Functions
• Evaluation functions score non-terminals in depth-limited search.
• Estimate the chances of winning.

• Ideal function: returns the actual minimax value of the position

• In practice: typically weighted linear sum of features:

• e.g. fi(s) = (number of pieces of type i), each weight wi etc.

Evaluation Functions and Alpha-Beta

• Evaluation functions are always imperfect.

• Value at a min-node will only keep going down. Once value of min-node lower than
better option for max along path to root, can prune

• Evaluation function as a guidance for pruning

• IF evaluation function provides upper-bound on value at min-node, and upper-bound already
lower than better option for max along path to root THEN can prune
Determining “good” node orderings
• The ordering of nodes helps alpha-beta pruning.
• Worst ordering O(bm). Best ordering O(bm/2).

• How to find good orderings

• Problem: we only know them when we evaluate the nodes.

• One approach – iterative deepening to determine

evaluations for nodes
• What if we can do iterative deepening to a certain depth. Use the
evaluation function at the set depth and then compute the values for the
nodes in the tree that is generated.
• Next time, use the evaluations of the previous search to order the nodes.
Use them for pruning.
• Use evaluations of the previous search for order.
Incorporating Chance: Expectimax Search
• When the result of an action is not known. max

• Incorporate a notion of chance

• Include chance nodes
• Unpredictable opponents: the ghosts move
randomly in Pacman.
• Explicit randomness: rolling dice by a player in a
game.

10 10 9 100
• Expectimax search:
• At chance nodes the outcome is uncertain
• Calculate the expected utilities: weighted average
(expectation) of children

37
Expectimax Search
def value(state):
if the state is a terminal state: return the state’s utility
if the next agent is MAX: return max-value(state)
if the next agent is EXP: return exp-value(state)

def max-value(state): def exp-value(state):

initialize v = -∞ initialize v = 0
for each successor of state: for each successor of state:
v = max(v, value(successor)) p = probability(successor)
return v v += p * value(successor)
return v
Expectimax Search
def exp-value(state):
initialize v = 0
for each successor of state: 1/2 1/6
p = probability(successor) 1/3
v += p * value(successor)
return v 5
8 24
7 -12

v = (1/2) (8) + (1/3) (24) + (1/6) (-12) = 10

Expectimax Search

3 12 9 2 4 6 15 6 0 3 12 9 2

Can we perform pruning?

Depth-Limited Expectimax
• Depth-limit can be applied in
Expectimax search.
• Use heuristics to estimate the
values at the depth limit.

Estimate of true
400 300 … expectimax value

492 362 …
Multiple players and other games
• Other games: non zero-sum, or multiple players

• Generalization of minimax:
• Terminals have utility tuples
• Node values are also utility tuples
• Each player maximizes its own component

1,6,6 7,1,2 6,1,2 7,2,1 5,1,7 1,5,2 7,7,1 5,2,5

“Games are to AI as grand prix is to automobile design”
Games viewed as an indicator of intelligence.

43
Probabilities (Recap)
• A random variable represents an event whose outcome is unknown
• A probability distribution is an assignment of weights to outcomes 0.25
• Example: Traffic on freeway
• Random variable: T = whether there’s traffic
• Outcomes: T in {none, light, heavy}
• Distribution: P(T=none) = 0.25, P(T=light) = 0.50, P(T=heavy) = 0.25
0.50
• Some laws of probability:
• Probabilities are always non-negative
• Probabilities over all possible outcomes sum to one

• As we get more evidence, probabilities may change:

• P(T=heavy) = 0.25, P(T=heavy | Hour=8am) = 0.60
• Methods for reasoning and updating probabilities later.
0.25
Expectations (Recap)
• The expected value of a function of a random variable is the average, weighted by
the probability distribution over outcomes

• Example: How long to get to the airport?

Time: 20 min 30 min 60 min

x + x + x 35 min
Probability: 0.25 0.50 0.25
Probabilities for Expectimax
• In expectimax search, we have a probabilistic model of
how the opponent (or environment) will behave in any
state
• Model could be a simple uniform distribution (roll a die)
• Model could be sophisticated and require a great deal of
computation. The model might say that adversarial actions
are likely.

• For now, assume each chance node magically comes

along with probabilities that specify the distribution
over its outcomes (later formal ways).
Utilities and Decision-making
• Utilities are functions from outcomes
(states of the world) to real numbers
that describe an agent’s preferences

• Providing utilities
Getting ice cream
• In a game, may be simple (+1/-1)
• Utilities summarize the agent’s goals
Get Single Get Double

• We specify the utilities for a task, let the

behaviour emerge from the action. Oops Whew!
Maximum Expected Utility

• Maximum expected utility (MEU) principle:

• Choose the action that maximizes expected
utility
• The agent can be in several states, each with a
probability distribution. Utilities map states to a
value. Compute the expectation.
• We try to build models that maximize the
expected utility.

Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
62 pages
Lec7 LU Su20
No ratings yet
Lec7 LU Su20
46 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
AI Unit-III (1) (1)
No ratings yet
AI Unit-III (1) (1)
124 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
ai_lect_05
No ratings yet
ai_lect_05
39 pages
Game AI
0% (1)
Game AI
19 pages
Chapter 3 - Searching-Part 3
No ratings yet
Chapter 3 - Searching-Part 3
64 pages
AI-UNIT-2 PPT
No ratings yet
AI-UNIT-2 PPT
135 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Adversarial Search PPT
No ratings yet
Adversarial Search PPT
49 pages
Games
No ratings yet
Games
41 pages
Adverserial Search
No ratings yet
Adverserial Search
36 pages
AI UNIT 3 (1)
No ratings yet
AI UNIT 3 (1)
138 pages
Lect3 PDF
No ratings yet
Lect3 PDF
67 pages
Ai 4
No ratings yet
Ai 4
25 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
2.4 Adversarial Search
No ratings yet
2.4 Adversarial Search
29 pages
Minimax Search Algorithm: 3. Back-Up The Scores at Level D To Assign A Score To Each
No ratings yet
Minimax Search Algorithm: 3. Back-Up The Scores at Level D To Assign A Score To Each
57 pages
L06 Adversarial Search
No ratings yet
L06 Adversarial Search
66 pages
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
No ratings yet
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
52 pages
Ch 5 Adversarial Search
No ratings yet
Ch 5 Adversarial Search
20 pages
SP14 CS188 Lecture 6 -- Adversarial Search - print
No ratings yet
SP14 CS188 Lecture 6 -- Adversarial Search - print
31 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
ITSC6121 Lecture 4 -- Game Trees I
No ratings yet
ITSC6121 Lecture 4 -- Game Trees I
38 pages
AAI Lecture 7 Sp 25
No ratings yet
AAI Lecture 7 Sp 25
51 pages
ai lecture-4
No ratings yet
ai lecture-4
37 pages
CS2201.7
No ratings yet
CS2201.7
56 pages
UNIT 3 AI Notes
No ratings yet
UNIT 3 AI Notes
21 pages
06 Minimax
No ratings yet
06 Minimax
53 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Lecture11_AdversarialSearch
No ratings yet
Lecture11_AdversarialSearch
74 pages
AI-unit-3
No ratings yet
AI-unit-3
54 pages
AI Lec03 Adversarial Search
No ratings yet
AI Lec03 Adversarial Search
38 pages
Game Playing
No ratings yet
Game Playing
53 pages
IA-c06-NoAnim
No ratings yet
IA-c06-NoAnim
31 pages
W6-Adverserial Search
No ratings yet
W6-Adverserial Search
39 pages
GamePlaying_Minimax_Unit-2_SPS
No ratings yet
GamePlaying_Minimax_Unit-2_SPS
72 pages
Lecture 6 - minmax alpha beta
No ratings yet
Lecture 6 - minmax alpha beta
41 pages
6 Game
No ratings yet
6 Game
42 pages
6-GAME
No ratings yet
6-GAME
53 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
Adversarial Search MinMax Alpha Beta Pruning
No ratings yet
Adversarial Search MinMax Alpha Beta Pruning
43 pages
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
ITSC6121 Lecture 4 -- Game Trees I
No ratings yet
ITSC6121 Lecture 4 -- Game Trees I
34 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
Game Playing: Adversarial Search
No ratings yet
Game Playing: Adversarial Search
66 pages
UNIT-V
No ratings yet
UNIT-V
19 pages
III_AI-DS_AD3311_AI_Lab Manual
No ratings yet
III_AI-DS_AD3311_AI_Lab Manual
34 pages
Lec11&12-Adversarial Search
No ratings yet
Lec11&12-Adversarial Search
30 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
36 pages
AI Unit-2
No ratings yet
AI Unit-2
47 pages
aiml cia 1 QUESTION WITH ANSWER(1)
No ratings yet
aiml cia 1 QUESTION WITH ANSWER(1)
5 pages
Lectures 2
No ratings yet
Lectures 2
194 pages
AI Sheet 3 - Problem Solving As Search (Heuristic Search - Adversarial Search) PDF
No ratings yet
AI Sheet 3 - Problem Solving As Search (Heuristic Search - Adversarial Search) PDF
2 pages
CS607 Quiz-2 by Vu Topper RM
No ratings yet
CS607 Quiz-2 by Vu Topper RM
73 pages
Adversarial Search: in Artificial Intelligence
No ratings yet
Adversarial Search: in Artificial Intelligence
21 pages
Thesis Minimax Algorithm
No ratings yet
Thesis Minimax Algorithm
182 pages
AI unit 3
No ratings yet
AI unit 3
10 pages
279817393aams_vol_216_april_2022_a25_p3303-3313_kavita_sheoran,_et_al.
No ratings yet
279817393aams_vol_216_april_2022_a25_p3303-3313_kavita_sheoran,_et_al.
11 pages
AI - Project -Spring 2025
No ratings yet
AI - Project -Spring 2025
3 pages
6.034 Quiz 1, Spring 2005: 1 Search Algorithms (16 Points)
No ratings yet
6.034 Quiz 1, Spring 2005: 1 Search Algorithms (16 Points)
14 pages
Unit 5 AI
No ratings yet
Unit 5 AI
80 pages
1) What Is A Blockchain?
No ratings yet
1) What Is A Blockchain?
33 pages
AI & ML Unit 1 Notes
No ratings yet
AI & ML Unit 1 Notes
26 pages
A - Mini - Project - Report - Tic - Tac - Toe 12
No ratings yet
A - Mini - Project - Report - Tic - Tac - Toe 12
18 pages
AI MCQ QUESTION 100 MCQ
No ratings yet
AI MCQ QUESTION 100 MCQ
13 pages
Adversarial Search: Section 1 - 4
No ratings yet
Adversarial Search: Section 1 - 4
21 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
41 pages
Notes Artificial Intelligence Unit 4
No ratings yet
Notes Artificial Intelligence Unit 4
8 pages
MIET ASSIGNMENT 3 (Kapish 044)
No ratings yet
MIET ASSIGNMENT 3 (Kapish 044)
6 pages
03 Handout 1
No ratings yet
03 Handout 1
5 pages
Ai I Complete Notes by Murali Krishna
No ratings yet
Ai I Complete Notes by Murali Krishna
75 pages
AI 2 Mark Questions Unit 2
No ratings yet
AI 2 Mark Questions Unit 2
8 pages
Chess AI Base Paper
No ratings yet
Chess AI Base Paper
7 pages
Tictactoe AI Documentation
No ratings yet
Tictactoe AI Documentation
16 pages
Dot Research
No ratings yet
Dot Research
6 pages
CS 863 / CSE 860 Artificial Intelligence
No ratings yet
CS 863 / CSE 860 Artificial Intelligence
22 pages
AI - Experiment 6 - Alpha Beta Pruning
No ratings yet
AI - Experiment 6 - Alpha Beta Pruning
6 pages
Midterm F06 Solutions
No ratings yet
Midterm F06 Solutions
12 pages
Cs 4511 Mid1 s18
No ratings yet
Cs 4511 Mid1 s18
2 pages
Assassin's Creed Unity Guide (Unofficial)
From Everand
Assassin's Creed Unity Guide (Unofficial)
Fusion Media
No ratings yet
Game Guide for Titanfall (Unofficial)
From Everand
Game Guide for Titanfall (Unofficial)
Fusion Media
No ratings yet

L07 Adversarial Search

Uploaded by

L07 Adversarial Search

Uploaded by

COL333/671: Introduction to AI

• Core: contingency problem

Terminal nodes we get -1, 0 or 1 for loss, tie or

-20 -8 … -18 -5 … -10 +4 -20 +8

def max-value(state): def min-value(state):

def max-value(state): def min-value(state):

Useful, when there are multiple adversaries.

• Requires growing the tree till the

If min returns 9? Or 100?

def max-value(state, α, β): def min-value(state , α, β):

If the nodes were encountered as “best moves first”

Note: In reality, we don’t know the ordering.

Slide adapted from Prof. Mausam

• Ideal function: returns the actual minimax value of the position

• e.g. fi(s) = (number of pieces of type i), each weight wi etc.

• Evaluation functions are always imperfect.

• Evaluation function as a guidance for pruning

• How to find good orderings

• One approach – iterative deepening to determine

• Incorporate a notion of chance

def max-value(state): def exp-value(state):

v = (1/2) (8) + (1/3) (24) + (1/6) (-12) = 10

Can we perform pruning?

1,6,6 7,1,2 6,1,2 7,2,1 5,1,7 1,5,2 7,7,1 5,2,5

• As we get more evidence, probabilities may change:

• Example: How long to get to the airport?

Time: 20 min 30 min 60 min

• For now, assume each chance node magically comes

• We specify the utilities for a task, let the

• Maximum expected utility (MEU) principle:

You might also like