0% found this document useful (0 votes)

13 views

Chapter 4

Uploaded by

maheshyadav5048

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Chapter 4

Uploaded by

maheshyadav5048

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

1

Game Search:
Games are a form of multi-agent environment
What do other agents do and how do they affect our success?
Cooperative vs. competitive multi-agent environments.
Competitive multi-agent environments give rise to adversarial search often
known as games

Games – adversary
Solution is strategy (strategy specifies move for every possible opponent reply).
Time limits force an approximate solution
Evaluation function: evaluate “goodness” of game position
Example: chess

Difference between the search space of a game and the search space of a problem:

In the first case it represents the moves of two (or more) players, whereas in the latter
case it represents the "moves" of a single problem-solving agent.

2
An exemplary game: Tic-tac-toe
There are two players denoted by X and O.

They are alternatively writing their letter in one of the 9 cells of a 3 by 3 board.

The winner is the one who succeeds in writing three letters in line.

The game begins with an empty board.

It ends in a win for one player and a loss for the other, or possibly in a draw.

A complete tree is a representation of all the possible plays of the game.

The root node is the initial state, in which it is the first player's turn to move (the player
X).

The successors of the initial state are the states the player can reach in one move, their
successors are the states resulting from the other player's possible replies, and so on.

3
Terminal states are those representing a win for X, loss for X, or a
draw.

Each path from the root node to a terminal node gives a different
complete play of the game.

Figure given below shows the initial search space of Tic-Tac-Toe.

Fig: Partial game tree for Tic-Tac-Toe 4

A game can be formally defined as a kind of search problem as below:

Initial state: It includes the board position and identifies the players to move.

Successor function: It gives a list of (move, state) pairs each indicating a legal move
and resulting state.

Terminal test: This determines when the game is over. States where the game is
ended are called terminal states.

Utility function: It gives numerical value of terminal states. E.g. win (+1), lose (-1)
and draw (0). Some games have a wider variety of possible outcomes eg. ranging
from +92 to -192.

5
The Minimax Algorithm:
Let us assign the following values for the game: 1 for win by X, 0 for draw, -1 for loss
by X.

Given the values of the terminal nodes (win for X (1), loss for X (-1), or draw (0)), the
values of the non-terminal nodes are computed as follows:
•the value of a node where it is the turn of player X to move is the maximum of
the values of its successors (because X tries to maximize its outcome).

•the value of a node where it is the turn of player O to move is the minimum of
the values of its successors (because O tries to minimize the outcome of X).
Figure below shows how the values of the nodes of the search tree are computed from
the values of the leaves of the tree.

The values of the leaves of the tree are given by the rules of the game:
•1 if there are three X in a row, column or diagonal;
•-1 if there are three O in a row, column or diagonal;
•0 otherwise

6
X O O
O to move X
(Min) X

1(w in for X)
X O O X O O X O O X O O
X to move O X X O X X
(Max) X X X O X O
.. . .. . .. .

1(w in for X) 0(dr aw) 0(dr aw)

X O O X O O X O O
O to move O X O X O X X
(Min) X X X X X

1(w in for X) 0(dr aw) 1(w in for X) 0(dr aw)

X O O X O O X O O X O O
X to move O X O O X O X X O X X
(Max) X X X X O X O X O

1(w in for X) 0(dr aw) 1(w in for X) 0(dr aw)

X O O X O O X O O X O O
O X O O X X O X X O X X
X X X X X O X O X X X O
7
An Example:
Consider the following game tree (drawn from the point of view of the Maximizing
player):
Max a

Min b c

d e f g

h i j k l m n o p r
5 3 3 1 4 7 5 9 2 7

Show what moves should be chosen by the two players, assuming that both
are using the mini-max procedure.

8
Solution:

Max a 7

Min b 4 7 c

5 d 4 e 7 f 9 g

h i j k l m n o p r
5 3 3 1 4 7 5 9 2 7

Figure: The mini-max path for the game tree

9
Alpha-Beta Pruning:
The problem with minimax search is that the number if game states it has examine is
exponential in the number of moves.

Unfortunately, we can’t eliminate the exponent, but we can effectively cut it in half.

The idea is to compute the correct minimax decision without looking at every node in
the game tree, which is the concept behind pruning.

Here idea is to eliminate large parts of the tree from consideration.

The particular technique for pruning that we will discuss here is “Alpha-Beta
Pruning”.

When this approach is applied to a standard minimax tree, it returns the same move
as minimax would, but prunes away branches that cannot possibly influence the final
decision.

Alpha-beta pruning can be applied to trees of any depth, and it is often possible to
prune entire sub-trees rather than just leaves.

10
Alpha-beta pruning is a technique for evaluating nodes of a game tree that eliminates
unnecessary evaluations.

It uses two parameters, alpha and beta.

Alpha: is the value of the best (i.e. highest value) choice we have found so far at any
choice point along the path for MAX.

Beta: is the value of the best (i.e. lowest-value) choice we have found so far at any
choice point along the path for MIN.

Alpha-beta search updates the values of alpha and beta as it goes along and prunes
the remaining branches at a node as soon as the value of the current node is known to
be worse than the current alpha or beta for MAX or MIN respectively.

An alpha cutoff:
To apply this technique, one uses a parameter called alpha that represents a lower
bound for the achievement of the Max player at a given node.

Let us consider that the current board situation corresponds to the node A in the
following figure.

11
Max A  = 15

Min B  = 15 Min C f(C)10

f(B) = 15

Max D E Max
f(D) = 10

The minimax method uses a depth-first search strategy in evaluating the

descendants of a node.

It will therefore estimate first the value of the node B. Let us suppose that
this value has been evaluated to 15, either by using a static evaluation
function, or by backing up from descendants omitted in the figure.

If Max will move to B then it is guaranteed to achieve 15.

12
Therefore 15 is a lower bound for the achievement of the Max player (it may still
be possible to achieve more, depending on the values of the other descendants of
A).

Therefore, the value of a at node B is 15.

This value is transmitted upward to the node A and will be used for evaluating the
other possible moves from A.

To evaluate the node C, its left-most child D has to be evaluated first.

Let us assume that the value of D is 10 (this value has been obtained either by
applying a static evaluation function directly to D, or by backing up values from
descendants omitted in the figure).

Because this value is less than the value of alpha, the best move for Max is to
node B, independent of the value of node E that need not be evaluated.

Indeed, if the value of E is greater than 10, Min will move to D which has the
value 10 for Max.

Otherwise, if the value of E is less than 10, Min will move to E which has a value
less than 10.
13
So, if Max moves to C, the best it can get is 10, which is less than the value a = 15
that would be gotten if Max would move to B.

Therefore, the best move for Max is to B, independent of the value of E.

The elimination of the node E is an alpha cutoff.

One should notice that E may itself have a huge subtree.

Therefore, the elimination of E means, in fact, the elimination of this subtree.

A beta cutoff:
To apply this technique, one uses a parameter called beta that represents an upper
bound for the achievement of the Max player at a given node.

In the above tree, the Max player moved to the node B.

Now it is the turn of the Min player to decide where to move:

14
The Min player also evaluates its descendants in a depth-first order.

Let us assume that the value of F has been evaluated to 15.

From the point of view of Min, this is an upper bound for the achievement of Min
(it may still be possible to make Min achieve less, depending of the values of the
other descendants of B).
15
Therefore the value of beta at the node F is 15.

This value is transmitted upward to the node B and will be used for evaluating the other
possible moves from B.

To evaluate the node G, its left-most child H is evaluated first.

Let us assume that the value of H is 25 (this value has been obtained either by applying
a static evaluation function directly to H, or by backing up values from descendants
omitted in the figure).

Because this value is greater than the value of beta, the best move for Min is to node F,
independent of the value of node I that need not be evaluated.

Indeed, if the value of I is v ≥ 25, then Max (in G) will move to I.

Otherwise, if the value of I is less than 25, Max will move to H.

So in both cases, the value obtained by Max is at least 25 which is greater than beta (the
best value obtained by Max if Min moves to F).

16
Therefore, the best move for Min is at F, independent of the value of I.

The elimination of the node I is a beta cutoff.

One should notice that by applying alpha and beta cut-off, one obtains the same
results as in the case of mini-max, but (in general) with less effort.

This means that, in a given amount of time, one could search deeper in the game tree
than in the case of mini-max.

17
Game Theory
• Game theory is a study of how to mathematically determine the best strategy for given
conditions in order to optimize the outcome.

• Finding acceptable, if not optimal, strategies in conflict situations.

• Abstraction of real complex situation.

• Game theory is highly mathematical.

• Game theory assumes all human interactions can be understood and navigated by
presumptions.

18
Importance of game theory

• All intelligent beings make decisions all the time.

• AI needs to perform these tasks as a result.

• Helps us to analyze situations more rationally and formulate an acceptable alternative

with respect to circumstance.

• Useful in modeling strategic decision-making

 Games against opponents
 Games against nature

• Provides structured insight into the value of information

19
Games of Chance

• In the game with uncertainty Players include a random element (roll dice, flip a coin,
etc.) to determine what moves to make
 i.e. Dice are rolled at the beginning of a player’s turn to determine the legal moves.

 Such games are called game of chance.

• Chance games are good for exploring decision making in adversarial problems involving
skill and luck.

20
Constraints Satisfaction Problems
A constraint satisfaction problem consists of three components, X, D and C:
X is a set of variables, { X1, …….., Xn}
D is a set of domains, { D1,………..Dn}, one for each variable.
C is a set of constraints that specify allowable combinations of values.

Each domain Di consists of a set of allowable values, {v1,…….vk} for

variable Xi.

Each constraint Ci consists of a pair <scope, rel>, where scope is a tuple of

variables that participate in the constraint and rel is a relation that defines
the values that those variables can take on .

The constraint satisfaction problem is to find for each i from 1 to n, a value

in Di for xi so that all constraints are satisfied.

21
Example problem: Map coloring

22
We are looking at a map of Australia showing its states and territories.

We are given the task of coloring each region either red, green or blue in such a way
that no neighboring regions have same color.

To formulate this as CSP, we define the variables to be the regions

X = { WA, NT, Q, NSW, V, SA, T}
The domain of each variable is the set Di = {red, green, blue}.

The constraints require neighboring regions to have distinct colors.

Since there are nine places where regions border, there are nine constraints:
C = {SA ≠ WA, SA ≠ NT, SA ≠ Q, SA ≠ NSW, SA ≠ V, WA ≠ NT,
NT ≠ Q, Q ≠ NSW, NSW ≠ V}

There are many possible solutions to this problem, such as

{WA = red, NT = green, Q = red, NSW = green, V = red, SA = blue, T = green}.

23
24
Crypt-arithmetic Problem
• Many problems in AI can be considered as problems of constraint satisfaction, in
which the goal state satisfies a given set of constraint.

• Example of such a problem is Crypt-Arithmetic problem (a mathematical puzzle),

in which the goal state (solution) satisfies the following constraints:
 Values are to be assigned to letters from 0 to 9 only.

 No two letters should have the same value.

 If the same letter occurs more than once, it must be assigned the same digit
each time.

 The sum of the digits must be arithmetically correct with the added restriction
that no leading zeroes are allowed.

25
Example 1
Solve the following crypt arithmetic problem

TWO
+T W O
FOUR
9 2 8
TWO
9 2 8

+TWO
1 8 5 6
FOUR
F=1O=8U=5R=6T=9W=2

Example 2
FOUR
+F O U R
EIGHT

26
9 2 3 5
FOUR
9 2 3 5
+F O U R
1 8 4 7 0
EIGHT
E=1I=8G=4H=7T=0F=9O=2U=3R=5

Complete Download PMP Exam Prep 2023 11th Edition Rita Mulcahy PDF All Chapters
50% (2)
Complete Download PMP Exam Prep 2023 11th Edition Rita Mulcahy PDF All Chapters
40 pages
Pi World 2020 Lab Pi Vision - Migrating Pi Processbook Displays
No ratings yet
Pi World 2020 Lab Pi Vision - Migrating Pi Processbook Displays
18 pages
Unit 2c Game Playing (Compatibility Mode)
No ratings yet
Unit 2c Game Playing (Compatibility Mode)
36 pages
Game Tree Searching and Pruning:: Figure 2.1 Minimax Search Tree
No ratings yet
Game Tree Searching and Pruning:: Figure 2.1 Minimax Search Tree
12 pages
AI-UNIT-3-NOTES-08-AUG-2024
No ratings yet
AI-UNIT-3-NOTES-08-AUG-2024
15 pages
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
No ratings yet
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
41 pages
Chapter 4 notes of AI
No ratings yet
Chapter 4 notes of AI
9 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
07 Game Playing
No ratings yet
07 Game Playing
30 pages
B4 EFFd 01
No ratings yet
B4 EFFd 01
5 pages
5.2 Min Max & AB Pruning
No ratings yet
5.2 Min Max & AB Pruning
31 pages
AI-unit-3
No ratings yet
AI-unit-3
54 pages
Adverserial Search
No ratings yet
Adverserial Search
36 pages
5.module3 ADVERSARIAL SEARCH 4
No ratings yet
5.module3 ADVERSARIAL SEARCH 4
23 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
AI_03^004
No ratings yet
AI_03^004
48 pages
UNIT II Adversarial Search
No ratings yet
UNIT II Adversarial Search
44 pages
Game Playing: MIN-MAX Search
No ratings yet
Game Playing: MIN-MAX Search
6 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
36 pages
Unit 5 AI
No ratings yet
Unit 5 AI
80 pages
Adversial Search
No ratings yet
Adversial Search
21 pages
21CSC206T Unit3
100% (1)
21CSC206T Unit3
138 pages
6-A Star Search Adversarial Search-09!01!2025
No ratings yet
6-A Star Search Adversarial Search-09!01!2025
42 pages
Unit 3 Updated
No ratings yet
Unit 3 Updated
112 pages
Game Playing
No ratings yet
Game Playing
24 pages
Game Tree
100% (2)
Game Tree
25 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
The Minimax Algorithm
No ratings yet
The Minimax Algorithm
4 pages
Unit 3_ai_ii Aiml Full-1
No ratings yet
Unit 3_ai_ii Aiml Full-1
108 pages
Ai Unit 3
No ratings yet
Ai Unit 3
33 pages
Module 2 (Part 2)
No ratings yet
Module 2 (Part 2)
136 pages
Ai - Module 3 - Week - 7
No ratings yet
Ai - Module 3 - Week - 7
26 pages
AI Lec03 Adversarial Search
No ratings yet
AI Lec03 Adversarial Search
38 pages
Module 10
No ratings yet
Module 10
8 pages
AI_Unit3
No ratings yet
AI_Unit3
145 pages
W6-Adverserial Search
No ratings yet
W6-Adverserial Search
39 pages
Unit 2 MinMaxScaling With Alpha Beta Pruning
No ratings yet
Unit 2 MinMaxScaling With Alpha Beta Pruning
24 pages
Unit 2 Game Playing
No ratings yet
Unit 2 Game Playing
37 pages
AI UNIT 3 (1)
No ratings yet
AI UNIT 3 (1)
138 pages
Chapter 3:game Theory: 3.1optimal Decision in Games
No ratings yet
Chapter 3:game Theory: 3.1optimal Decision in Games
17 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
Lect3 PDF
No ratings yet
Lect3 PDF
67 pages
Biti1113 Games in Ai
No ratings yet
Biti1113 Games in Ai
58 pages
AI - Unit 4
No ratings yet
AI - Unit 4
11 pages
Unit - 2
No ratings yet
Unit - 2
9 pages
Chapter 3 - Searching-Part 3
No ratings yet
Chapter 3 - Searching-Part 3
64 pages
Cs 171 07a Games MiniMax
No ratings yet
Cs 171 07a Games MiniMax
28 pages
Yapay Zeka - 8
No ratings yet
Yapay Zeka - 8
48 pages
Ai Module-3notes
No ratings yet
Ai Module-3notes
35 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Adversarial Search MinMax Alpha Beta Pruning
No ratings yet
Adversarial Search MinMax Alpha Beta Pruning
43 pages
Lecture11_AdversarialSearch
No ratings yet
Lecture11_AdversarialSearch
74 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
5.1 GamePlaying (AIML)
No ratings yet
5.1 GamePlaying (AIML)
48 pages
Game Playing_AI
No ratings yet
Game Playing_AI
25 pages
ai_lect_05
No ratings yet
ai_lect_05
39 pages
AI unit 2 (1)
No ratings yet
AI unit 2 (1)
132 pages
Game Playing in Artificial Intelligence
No ratings yet
Game Playing in Artificial Intelligence
12 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
Artificial life: Random walk
From Everand
Artificial life: Random walk
Mietek Szyszkowicz
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Burj Khalief
No ratings yet
Burj Khalief
15 pages
Ferri Yanto: Mengapa Merekrut Saya?
No ratings yet
Ferri Yanto: Mengapa Merekrut Saya?
4 pages
Family Office Elite Summer 16 PDF
No ratings yet
Family Office Elite Summer 16 PDF
132 pages
Manual de RZR 1000 Turbo XMR
No ratings yet
Manual de RZR 1000 Turbo XMR
474 pages
Wheat PDF
No ratings yet
Wheat PDF
110 pages
Slot Reviews (BR-PT) - SlotCatalog 8к (Наталія) 2
No ratings yet
Slot Reviews (BR-PT) - SlotCatalog 8к (Наталія) 2
13 pages
Syllabus Class 5
100% (2)
Syllabus Class 5
17 pages
Nitocote EN901
No ratings yet
Nitocote EN901
4 pages
Module 3 The Coming of The Spaniards
No ratings yet
Module 3 The Coming of The Spaniards
26 pages
Logistics Decision Analysis Methods: Analytic Hierarchy Process
No ratings yet
Logistics Decision Analysis Methods: Analytic Hierarchy Process
62 pages
Argos
No ratings yet
Argos
62 pages
Recommended Practice For Evaluation of Strength Test Results of Concrete (ACI 214-77)
No ratings yet
Recommended Practice For Evaluation of Strength Test Results of Concrete (ACI 214-77)
14 pages
Method Statement For Installation
No ratings yet
Method Statement For Installation
8 pages
Career Objectives:: Course Institute Year University Total Marks Percentage Completed Marks Obtained %
No ratings yet
Career Objectives:: Course Institute Year University Total Marks Percentage Completed Marks Obtained %
2 pages
Short Questions On JOURNEY TO THE END OF THE EARTH
No ratings yet
Short Questions On JOURNEY TO THE END OF THE EARTH
1 page
Thesis On Plastic Waste Management
100% (2)
Thesis On Plastic Waste Management
7 pages
(2017) 최신도입 네트워크rtk 활성화 방안연구
No ratings yet
(2017) 최신도입 네트워크rtk 활성화 방안연구
198 pages
Understanding The Schemes of The Devil For The World and Humans
100% (1)
Understanding The Schemes of The Devil For The World and Humans
19 pages
Mehak
No ratings yet
Mehak
21 pages
OceanofPDF.com Jessicas Cowboy Daddy - Melinda Barron
No ratings yet
OceanofPDF.com Jessicas Cowboy Daddy - Melinda Barron
141 pages
Applying Virtue Ethics To Business: The Agent-Based Approach
No ratings yet
Applying Virtue Ethics To Business: The Agent-Based Approach
14 pages
CH 1. Introduction To Assurance
No ratings yet
CH 1. Introduction To Assurance
45 pages
Unit Operations of Chemical Engineering: (7th Edition)
No ratings yet
Unit Operations of Chemical Engineering: (7th Edition)
7 pages
Cry of Balintawak or Pugad Lawin
No ratings yet
Cry of Balintawak or Pugad Lawin
3 pages
Prof-Ed 303-Teacher and School Curriculum
No ratings yet
Prof-Ed 303-Teacher and School Curriculum
3 pages
Maple Syrup Day at Hartwick Pines Maple Syrup Day at Hartwick Pines
No ratings yet
Maple Syrup Day at Hartwick Pines Maple Syrup Day at Hartwick Pines
20 pages
S 7 - Surface Treatments of Dental Implants
No ratings yet
S 7 - Surface Treatments of Dental Implants
41 pages
A Walk Around Purana Qila
No ratings yet
A Walk Around Purana Qila
2 pages

Chapter 4

Uploaded by

Chapter 4

Uploaded by

1

The game begins with an empty board.

A complete tree is a representation of all the possible plays of the game.

Figure given below shows the initial search space of Tic-Tac-Toe.

Fig: Partial game tree for Tic-Tac-Toe 4

1(w in for X) 0(dr aw) 0(dr aw)

1(w in for X) 0(dr aw) 1(w in for X) 0(dr aw)

1(w in for X) 0(dr aw) 1(w in for X) 0(dr aw)

Figure: The mini-max path for the game tree

Here idea is to eliminate large parts of the tree from consideration.

It uses two parameters, alpha and beta.

Min B  = 15 Min C f(C)10

The minimax method uses a depth-first search strategy in evaluating the

If Max will move to B then it is guaranteed to achieve 15.

Therefore, the value of a at node B is 15.

To evaluate the node C, its left-most child D has to be evaluated first.

Therefore, the best move for Max is to B, independent of the value of E.

The elimination of the node E is an alpha cutoff.

One should notice that E may itself have a huge subtree.

Therefore, the elimination of E means, in fact, the elimination of this subtree.

In the above tree, the Max player moved to the node B.

Now it is the turn of the Min player to decide where to move:

Let us assume that the value of F has been evaluated to 15.

To evaluate the node G, its left-most child H is evaluated first.

Indeed, if the value of I is v ≥ 25, then Max (in G) will move to I.

Otherwise, if the value of I is less than 25, Max will move to H.

The elimination of the node I is a beta cutoff.

• Finding acceptable, if not optimal, strategies in conflict situations.

• Abstraction of real complex situation.

• Game theory is highly mathematical.

• All intelligent beings make decisions all the time.

• AI needs to perform these tasks as a result.

• Helps us to analyze situations more rationally and formulate an acceptable alternative

• Useful in modeling strategic decision-making

• Provides structured insight into the value of information

 Such games are called game of chance.

Each domain Di consists of a set of allowable values, {v1,…….vk} for

Each constraint Ci consists of a pair <scope, rel>, where scope is a tuple of

The constraint satisfaction problem is to find for each i from 1 to n, a value

To formulate this as CSP, we define the variables to be the regions

The constraints require neighboring regions to have distinct colors.

There are many possible solutions to this problem, such as

• Example of such a problem is Crypt-Arithmetic problem (a mathematical puzzle),

 No two letters should have the same value.

You might also like