0% found this document useful (0 votes)

3 views

L06 (Adversarial Search) Ori

Uploaded by

mehedi.ratul111

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

L06 (Adversarial Search) Ori

Uploaded by

mehedi.ratul111

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 46

ADVERSARIAL SEARCH

Course Code: CSC4226 Course Title: Artificial Intelligence and Expert System

Dept. of Computer Science

Faculty of Science and Technology

Lecture No: Six (6) Week No: Six(6) Semester:

Lecturer: Dr. Ashraf Uddin [email protected]
Lecture Outline

1. Game and Its Components

2. Game as a Search Problem
3. Perfect Decision in Two Players Games
4. MiniMax Algorithm
5. Imperfect Decision
6. Cutoff Search
7. Alpha-Beta Pruning
Game Playing: Introduction

Competitive environments, in which the agents’ goals are in conflict, giving rise to
adversarial search problems—often known as games.

Mathematical game theory, a branch of economics, views any multiagent environment as a

game, provided that the impact of each agent on the others is “significant,” regardless of
whether the agents are cooperative or competitive

In AI, the most common games are of a rather specialized kind—what game theorists call
deterministic, turn-taking, two-player, zero-sum games of perfect information (such as
chess).

This means deterministic, fully observable environments in which two agents act alternately
and in which the utility values at the end of the game are always equal and opposite.
Games a Search Problem

• Some games can normally be defined in the form of a

tree.

• Branching factor is usually an average of the possible

number of moves at each node.

• This is a simple search problem: a player must search this

search tree and reach a leaf node with a favorable
outcome.
Tic-Tac-Toe
Components of a Game
Initial state: Set-up specified by the rules, e.g., initial board configuration of
chess.
Player(s): Defines which player has the move in a state.
Actions(s): Returns the set of legal moves in a state.
Result(s , a): Transition model defines the result of a move.
(2nd ed.: Successor function: list of (move , state) pairs specifying legal moves.)
Terminal-Test(s): Is the game finished? True if finished, false otherwise.
Utility function(s , p): Gives numerical value of terminal state s for player p.
E.g., win (+1), lose (-1), and draw (0) in tic-tac-toe.
E.g., win (+1), lose (0), and draw (1/2) in chess.
Two Player Game

Two players: Max and Min

Objective of both Max and Min to optimize winnings

Max must reach a terminal state with the highest utility
Min must reach a terminal state with the lowest utility

Game ends when either Max and Min have reached a terminal state

upon reaching a terminal state points maybe awarded or sometimes

deducted
Search Problem Revisited

Simple problem is to reach a favorable terminal state

Problem Not so simple...

Max must reach a terminal state with as high a utility as
possible regardless of Min’s moves

Max must develop a strategy that determines best possible move for
each move Min makes.
Tic-Tac-Toe Revisited
Example: Two-Ply Game

Minimax Decision - maximizes the utility for Max based on

the assumption that Min will attempt to Minimize this utility.
Minimax Algorithm

Minimax Algorithm determines optimum strategy for Max:

1. Generate the whole game tree, down to the leaves.
2. Apply utility (payoff) function to each leaf.
3. Use the utility of the terminal states to determine the utility of the nodes one level higher
up in the search tree. Back-up values from leaves through branch nodes:
a Max node computes the Max of its child values
a Min node computes the Min of its child values
4. At root: choose the move leading to the child of highest value.
MiniMax Pseudocode
Two-Ply Game Tree
Two-Ply Game Tree
Two-Ply Game Tree
Two-Ply Game Tree
Properties of minimax
Complete?
Yes (if tree is finite).
Optimal?
Yes (against an optimal opponent).

Time complexity?
O(bm)
Space complexity?
O(bm) (depth-first search, generate all actions at once)
O(m) (backtracking search, generate actions one at a time)
Game Tree Size
Tic-Tac-Toe
b ≈ 5 legal actions per state on average, total of 9 plies in game.
“ply” = one action by one player, “move” = two plies.
59 = 1,953,125
9! = 362,880
 exact solution quite reasonable
Is There Another Way?
Take Chess on average has:
35 branches and
usually at least 100 moves
so game space is: 35100

Is this a realistic game space to search?

Since time is important factor in gaming searching this game space

is highly undesirable.
Imperfect Decisions

• Many game produce very large search trees.

• Cutoffs must be implemented due to time restrictions,

Evaluation Functions

A function that returns an estimate of the expected utility

of the game from a given position.

Given the present situation give an estimate as to the

value of the next move.

The performance of a game-playing program is dependant

on the quality of the evaluation functions.
How to Judge Quality

Evaluation functions must agree with the utility functions on

the terminal states.

It must not take too long ( trade off between accuracy and
time cost).

Should reflect actual chance of winning.

Design of Evaluation Function

Different evaluation functions must depend on the nature

of the game.

Encode the quality of a position in a number that is

representable within the framework of the given language.

Design a heuristic for value to the given position of any

object in the game.
Material Advantage Evaluation Functions

Values of the pieces are judge independent of other pieces on the

board. A value is returned based on the material value of the
computer minus the material value of the player.

Weighted Linear Functions

w1f1+w2f2+……wnfn

w’s are weight of the pieces

f’s are features of the particular positions
Heuristic Evaluation Functions
An Evaluation Function:
• Estimates how good the current board configuration is for a player.
• Typically, evaluate how good it is for the player,
how good it is for the opponent,
then subtract the opponent’s score from the players.
• Often called “static” because it is called on a static board position.
• Othello: Number of white pieces - Number of black pieces
• Chess: Value of all white pieces - Value of all black pieces
Typical values from -infinity (loss) to +infinity (win) or [-1, +1].
If the board evaluation is X for a player, it’s -X for the opponent “Zero-
sum game”
Evaluation Function
Heuristic evaluation function for
tic-tac-toe
Cutoff Search
Cutting of searches at a fixed depth dependent on time

The deeper the search the more information is available to the program
the more accurate the evaluation functions

Iterative deepening – when time runs out the program returns the deepest
completed search.

Is searching a node deeper better than searching more nodes?

Consequences
Evaluation function might return an incorrect value.

If the search in cutoff and the next move results involves a

capture, then the value that is return maybe incorrect.

Horizon problem:
Moves that are pushed deeper into the search trees may result
in an oversight by the evaluation function.
Pruning

What is pruning?
The process of eliminating a branch of the search tree
from consideration without examining it.

Why prune?
To eliminate searching nodes that are potentially
unreachable.
To speedup the search process.
Alpha-Beta Pruning
A technique to find the optimal solution according to a
limited depth search using evaluation functions.

Returns the same choice as minimax cutoff decisions

but examines fewer nodes.

Gets its name from the two variables that are passed
along during the search which restrict the set of
possible solutions.
Alpha-beta: Definitions

Alpha –
the value of the best choice so far along the path for MAX.

Beta –
the value of the best choice (lowest value) so far along the
path for MIN.
Implementation

Set root node alpha to negative infinity and beta to positive infinity.

Search depth first, propagating alpha and beta values down to all
nodes visited until reaching desired depth.

Apply evaluation function to get the utility of this node.

Implementation (Cont’d)
The Max player will only update the value of alpha.

The Min player will only update the value of beta.

While backtracking the tree, the node values will be passed to

upper nodes instead of values of alpha and beta.

We will only pass the alpha, beta values to the child nodes.

Prune whenever  ≥ .
Alpha-Beta Example

α=
α= -∞
3
3
β=
β= ∞
∞
α=
α= 33
β=
β= ∞1
α= -∞
3 1
β= 3
∞
α=
α=-∞
5 α= 3
β=
β=33 β= ∞
α=
α= -∞
2
3
β=
β= ∞
∞ 3 5 1
General alpha-beta pruning

Consider a node n in the tree ---

If player has a better choice at:

Parent node of n
Or any choice point further up

Then n will never be reached in play.

Hence, when that much is known about n, it can be

pruned.
Alpha-Beta Search Algorithm
Effectiveness of Alpha-Beta Search
Worst-Case
branches are ordered so that no pruning takes place. In this case alpha-beta gives no
improvement over exhaustive search

Best-Case
each player’s best move is the left-most child (i.e., evaluated first)
in practice, performance is closer to best rather than worst-case
E.g., sort moves by the remembered move values found last time.
E.g., expand captures first, then threats, then forward moves, etc.
E.g., run Iterative Deepening search, sort by value last iteration.

In practice often get O(b(d/2)) rather than O(bd)

this is the same as having a branching factor of sqrt(b),
(sqrt(b))d = b(d/2),i.e., we effectively go from b to square root of b
e.g., in chess go from b ~ 35 to b ~ 6
this permits much deeper search in the same amount of time
Example
Answer to Example
Second Example
Answer
Final Comments about
Alpha-Beta Pruning
Pruning does not affect final results

Entire subtrees can be pruned.

Good move ordering improves effectiveness of pruning

Repeated states are again possible.

Store them in memory = transposition table
Problems

If there is only one legal move, this algorithm will still generate an
entire search tree.

Designed to identify a “best” move, not to differentiate between other

moves.

Overlooks moves that forfeit something early for a better position later.

Evaluation of utility usually not exact.

Assumes opponent will always choose the best possible move.

References

1. Chapter 5: Adversarial Search , Pages 161-176

“Artificial Intelligence: A Modern Approach,” by Stuart J. Russell and Peter Norvig,
Books

1. “Artificial Intelligence: A Modern Approach,” by Stuart J. Russell and Peter Norvig.

2. "Artificial Intelligence: Structures and Strategies for Complex Problem Solving", by
George F. Luger, (2002)
3. "Artificial Intelligence: Theory and Practice", by Thomas Dean.
4. "AI: A New Synthesis", by Nils J. Nilsson.
5. “Programming for machine learning,” by J. Ross Quinlan,
6. “Neural Computing Theory and Practice,” by Philip D. Wasserman, .
7. “Neural Network Design,” by Martin T. Hagan, Howard B. Demuth, Mark H.
Beale, .
8. “Practical Genetic Algorithms,” by Randy L. Haupt and Sue Ellen Haupt.
9. “Genetic Algorithms in Search, optimization and Machine learning,” by David E.
Goldberg.
10."Computational Intelligence: A Logical Approach", by David Poole, Alan
Mackworth, and Randy Goebel.
11.“Introduction to Turbo Prolog”, by Carl Townsend.

Admission: North South University (NSU) Question Bank Summer 2019
92% (12)
Admission: North South University (NSU) Question Bank Summer 2019
10 pages
Psychological Review: VOL. 80, No. 4 JULY 1973 On The Psychology of Prediction
No ratings yet
Psychological Review: VOL. 80, No. 4 JULY 1973 On The Psychology of Prediction
15 pages
Knitted Loop Structure and Notations
100% (7)
Knitted Loop Structure and Notations
3 pages
Game Playing: Adversarial Search
No ratings yet
Game Playing: Adversarial Search
66 pages
UNIT-2-AI-Notes
No ratings yet
UNIT-2-AI-Notes
26 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
Game Playing
No ratings yet
Game Playing
53 pages
Adversarial Search MinMax Alpha Beta Pruning
No ratings yet
Adversarial Search MinMax Alpha Beta Pruning
43 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
unit 3 &4
No ratings yet
unit 3 &4
58 pages
AI - Module 2 - Min Max & Alpha Beta Pruning
No ratings yet
AI - Module 2 - Min Max & Alpha Beta Pruning
11 pages
AI-UNIT-2 PPT
No ratings yet
AI-UNIT-2 PPT
135 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Adversarial Search: in Artificial Intelligence
No ratings yet
Adversarial Search: in Artificial Intelligence
21 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
Ai Unit 3
No ratings yet
Ai Unit 3
33 pages
Unit 2c Game Playing (Compatibility Mode)
No ratings yet
Unit 2c Game Playing (Compatibility Mode)
36 pages
Unit 2- AI
No ratings yet
Unit 2- AI
47 pages
Games
No ratings yet
Games
41 pages
GamePlaying_Minimax_Unit-2_SPS
No ratings yet
GamePlaying_Minimax_Unit-2_SPS
72 pages
1.1.4GamePlaying
No ratings yet
1.1.4GamePlaying
23 pages
UNIT-4(ES)
No ratings yet
UNIT-4(ES)
15 pages
Min Max and Alpha Beta
No ratings yet
Min Max and Alpha Beta
43 pages
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
No ratings yet
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
41 pages
Unit 3
No ratings yet
Unit 3
61 pages
18CS753 Ai Module 4
No ratings yet
18CS753 Ai Module 4
43 pages
Adversarial Search_ Game Trees and Minimax Evaluation
No ratings yet
Adversarial Search_ Game Trees and Minimax Evaluation
50 pages
AI 3 Unit New Savita
No ratings yet
AI 3 Unit New Savita
18 pages
Chapter 5 Adversarial Search Algorithms
No ratings yet
Chapter 5 Adversarial Search Algorithms
25 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Artificial Intelligence Unit 3 Ppt
No ratings yet
Artificial Intelligence Unit 3 Ppt
69 pages
Chap-4 Adversarial Search
No ratings yet
Chap-4 Adversarial Search
43 pages
07 Game Playing
No ratings yet
07 Game Playing
30 pages
UNIT-II
No ratings yet
UNIT-II
56 pages
AI-unit-3
No ratings yet
AI-unit-3
54 pages
Game Playing Algorithm
No ratings yet
Game Playing Algorithm
27 pages
Game Playing
No ratings yet
Game Playing
24 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
FAI - Unit 3 - Search Using Games
No ratings yet
FAI - Unit 3 - Search Using Games
33 pages
Lec3-Adversarial Search
No ratings yet
Lec3-Adversarial Search
73 pages
Lec11&12-Adversarial Search
No ratings yet
Lec11&12-Adversarial Search
30 pages
AI All Units
No ratings yet
AI All Units
93 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
UNIT-II-Adversarial Search
No ratings yet
UNIT-II-Adversarial Search
28 pages
Unit2e Adversarial Search
No ratings yet
Unit2e Adversarial Search
26 pages
Aiml Unit-2
No ratings yet
Aiml Unit-2
61 pages
Unit 3 Updated
No ratings yet
Unit 3 Updated
112 pages
Game Playing in AI
No ratings yet
Game Playing in AI
12 pages
Chapter 4 notes of AI
No ratings yet
Chapter 4 notes of AI
9 pages
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
No ratings yet
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
52 pages
Adver
No ratings yet
Adver
8 pages
Unit 1 Questions and Answers for Coaching Class 2024
No ratings yet
Unit 1 Questions and Answers for Coaching Class 2024
6 pages
Cs 171 07a Games MiniMax
No ratings yet
Cs 171 07a Games MiniMax
28 pages
Adversial Search
No ratings yet
Adversial Search
39 pages
Game Playing
No ratings yet
Game Playing
63 pages
6 min max
No ratings yet
6 min max
11 pages
AI Game Playing
0% (1)
AI Game Playing
46 pages
Game Playing: MIN-MAX Search
No ratings yet
Game Playing: MIN-MAX Search
6 pages
Mathematical Chess
From Everand
Mathematical Chess
Dr George Ho
No ratings yet
Minimax: Fundamentals and Applications
From Everand
Minimax: Fundamentals and Applications
Fouad Sabry
No ratings yet
QUESTION BANK MID SEM MAR 2025 (1)
No ratings yet
QUESTION BANK MID SEM MAR 2025 (1)
4 pages
Seismic Processing 3
100% (1)
Seismic Processing 3
169 pages
Bisection Method C Program
No ratings yet
Bisection Method C Program
10 pages
Patterningandalgebra
No ratings yet
Patterningandalgebra
12 pages
Centre For Distance Education: Acharya Nagarjuna University::Nagarjunanagar 522 510
No ratings yet
Centre For Distance Education: Acharya Nagarjuna University::Nagarjunanagar 522 510
3 pages
Sequences
No ratings yet
Sequences
17 pages
code hw
No ratings yet
code hw
21 pages
Statistics For Managers Using Microsoft Excel: 6 Global Edition
No ratings yet
Statistics For Managers Using Microsoft Excel: 6 Global Edition
44 pages
Major Test 02 (29-12-24)
No ratings yet
Major Test 02 (29-12-24)
38 pages
Envision G4 05 TA P
No ratings yet
Envision G4 05 TA P
4 pages
Lectures On Lectures On: Computer Graphics Computer Graphics
No ratings yet
Lectures On Lectures On: Computer Graphics Computer Graphics
28 pages
Literature Review Dyscalculia
100% (1)
Literature Review Dyscalculia
5 pages
1 Lecture 14: The Product and Quotient Rule: 1.1 Outline
No ratings yet
1 Lecture 14: The Product and Quotient Rule: 1.1 Outline
4 pages
Chapter Two PPT Logic & C.thinking
90% (10)
Chapter Two PPT Logic & C.thinking
100 pages
M.E. BIOMEDICAL Engineering Course Details
No ratings yet
M.E. BIOMEDICAL Engineering Course Details
39 pages
S2-Measures of Central Tendency
No ratings yet
S2-Measures of Central Tendency
10 pages
The Security Network Coding System With Physical Layer Key Generation in Two-Way Relay Networks
No ratings yet
The Security Network Coding System With Physical Layer Key Generation in Two-Way Relay Networks
9 pages
Chapter 5. Numerical Integration: Answer Is A Number and Does Not Involve X. Notation Is Convenient
No ratings yet
Chapter 5. Numerical Integration: Answer Is A Number and Does Not Involve X. Notation Is Convenient
16 pages
Information Flow Metrics
No ratings yet
Information Flow Metrics
26 pages
Spectroscopy: Dr. B. R. Thorat
100% (1)
Spectroscopy: Dr. B. R. Thorat
39 pages
2.2 Logic and Reasoning
No ratings yet
2.2 Logic and Reasoning
38 pages
Real Number System Notes
No ratings yet
Real Number System Notes
15 pages
Laterally Loaded Piles: 1 Soil Response Modelled by P-Y Curves
No ratings yet
Laterally Loaded Piles: 1 Soil Response Modelled by P-Y Curves
14 pages
Cimentaciones en Rocas Anisotropicas Serrano y Olalla
No ratings yet
Cimentaciones en Rocas Anisotropicas Serrano y Olalla
11 pages
Lecture Notes 11-Initial Value Problem ODE
100% (1)
Lecture Notes 11-Initial Value Problem ODE
51 pages
File 1
No ratings yet
File 1
15 pages
2 Unit 1 Impact of Jet
No ratings yet
2 Unit 1 Impact of Jet
16 pages

L06 (Adversarial Search) Ori

Uploaded by

L06 (Adversarial Search) Ori

Uploaded by

ADVERSARIAL SEARCH

Dept. of Computer Science

Lecture No: Six (6) Week No: Six(6) Semester:

1. Game and Its Components

Mathematical game theory, a branch of economics, views any multiagent environment as a

• Some games can normally be defined in the form of a

• Branching factor is usually an average of the possible

• This is a simple search problem: a player must search this

Two players: Max and Min

Objective of both Max and Min to optimize winnings

upon reaching a terminal state points maybe awarded or sometimes

Simple problem is to reach a favorable terminal state

Problem Not so simple...

Minimax Decision - maximizes the utility for Max based on

Minimax Algorithm determines optimum strategy for Max:

Is this a realistic game space to search?

Since time is important factor in gaming searching this game space

• Many game produce very large search trees.

• Cutoffs must be implemented due to time restrictions,

A function that returns an estimate of the expected utility

Given the present situation give an estimate as to the

The performance of a game-playing program is dependant

Evaluation functions must agree with the utility functions on

Should reflect actual chance of winning.

Different evaluation functions must depend on the nature

Encode the quality of a position in a number that is

Design a heuristic for value to the given position of any

Values of the pieces are judge independent of other pieces on the

Weighted Linear Functions

w’s are weight of the pieces

Is searching a node deeper better than searching more nodes?

If the search in cutoff and the next move results involves a

Returns the same choice as minimax cutoff decisions

Apply evaluation function to get the utility of this node.

The Min player will only update the value of beta.

While backtracking the tree, the node values will be passed to

Consider a node n in the tree ---

If player has a better choice at:

Then n will never be reached in play.

Hence, when that much is known about n, it can be

In practice often get O(b(d/2)) rather than O(bd)

Entire subtrees can be pruned.

Good move ordering improves effectiveness of pruning

Repeated states are again possible.

Designed to identify a “best” move, not to differentiate between other

Evaluation of utility usually not exact.

Assumes opponent will always choose the best possible move.

1. Chapter 5: Adversarial Search , Pages 161-176

1. “Artificial Intelligence: A Modern Approach,” by Stuart J. Russell and Peter Norvig.

You might also like