FALLSEM2024-25 BEEE411L TH CH2024250101504 Reference Material I 02-09-2024 Module - 4 CSP

Uploaded by

kiranpawarindia1234

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views22 pages

FALLSEM2024-25 BEEE411L TH CH2024250101504 Reference Material I 02-09-2024 Module - 4 CSP

Uploaded by

kiranpawarindia1234

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Constraint satisfaction

problems
Constraint satisfaction
• Discover some problem that satisfies a given set of constraints.
• Design must be created within fixed limits on time, cost and materials.
• It reduces substantial amount of search that is required as compared with methods that
finds partial solutions.
• For example: If you want to travel to a place ‘X’, with ‘n’ possible modes, you will have
varieties of solution without any constraints. If the travel is bounded to time, money,
distance and/or any other factors, then the solution can be arrived at lesser time.
• It is a search procedure that operates in a space of constraint sets.
– Initial state contains constraints that are originally given in the problem description.
– Goal state is any state that has been constrained “enough”, where “enough” must be defined for each
problem.
Constraint satisfaction
• It is a two-step process:
– Constraints are discovered and propagated as far as possible throughout the system. If the solution is not found, then
search begins.
– A guess about something is made and added as a new constraint. Propagation can then occur with this new constraint,
and so forth.
• Constraint propagation terminates for two reasons:
– A contradiction may be detected, if it happens, then there is no solution consistent with all the known constraints. If the
contradictions involves only those constraints that were given as a part of the problem specification, then no solution
exists.
– The propagation has run out of the steam and there are no further changes that can be made on the current knowledge.
If this happens and a solution has not yet been adequately specified, then the search is necessary to get the process
moving again.
Algorithm: Constraint satisfaction
• Propagate the available constraints. To perform this, OPEN to the set of all objects that
must have values assigned to them in a complete solution. Do until an inconsistency is
empty.
– Select an OB from OPEN. Strengthen as much as possible the set of constraints that apply to OB.
– If this set is different from the set that was assigned the last time OB was examined or if this is the first time OB has been examined,
then add OPEN to all objects that share any constraints with OB.
– Remove OB from OPEN.
• If the union of constraints discovered above defines a solution, then quit and report the solution,
• If the union of constraints discovered above defines a contradiction, then return failure.
• If neither of the above occurs, then it is necessary to make a guess at something in order to proceed. To do
this, loop until a solution is found or all possible solutions have been eliminated:
– Select an object whose value is not determined and select a way of strengthening the constraints on that object.
– Recursively invoke constraint satisfaction with the current set of constraints augmented by the strengthening constraint just selected.
Monte – Carlo Tree search
• It is a heuristic search algorithm used for decision-making in artificial intelligence,
particularly in games and scenarios that involve complex choices.
• It combines tree search methods with Monte Carlo simulations to explore the
consequences of actions and to identify the optimal strategies over time.
• MCTS has been used successfully in board games like Go, chess, and other decision-making
problems, as it handles large state spaces efficiently.
• It consists of four main steps:
– Selection
– Expansion
– Simulation
– Backpropagation
Monte – Carlo Tree search
• Selection:
– Start from the root node and select child nodes based on a selection policy, typically a balance between exploitation
(favoring nodes that have yielded good results) and exploration (favoring nodes that are less explored).
– A popular selection strategy is the Upper Confidence Bound for Trees (UCB1), which balances exploration and
exploitation.
• Expansion:
– Once a leaf node is reached, if the node is not a terminal node (i.e., the game or decision isn't over), new child nodes are
added.
– These represent possible future actions or moves.
• Simulation (Rollout):
– From the newly added node, a simulation is run by making random moves until a terminal state is reached (win, lose, or
draw, in the case of games). This gives an estimate of the outcome of that sequence of actions.
• Backpropagation:
– The result of the simulation is then backpropagated through the tree, updating the statistics (e.g., win/loss records) for
each node along the path from the selected node back to the root.
– These steps are repeated many times, gradually refining the tree and guiding the search toward better actions.
Monte – Carlo Tree search
Monte – Carlo Tree search is applied in the following scenarios:
• The state space is very large, making traditional search methods infeasible.
• It's difficult to evaluate the exact value of non-terminal states.
• Randomized simulations can provide reasonable estimates of state value.
Monte – Carlo Tree search - example
Problem Setup: We are designing an AI to play Tic-Tac-Toe using MCTS. Assume the AI plays
as "X," and it is the AI's turn to make a move
✓ Selection: The algorithm starts at the root node, representing the current board
configuration. The selection process begins, navigating the game tree to find a leaf node to
expand. Initially, all moves are equally likely since no simulations have been run. Example
current board state:

✓ Here, "X" can make a move in one of the four remaining empty spaces. The selection policy
(UCB1) will guide the choice. At the start, since no statistics are available, it randomly
selects one of the available moves.
Monte – Carlo Tree search - example
Problem Setup: We are designing an AI to play Tic-Tac-Toe using MCTS. Assume the AI plays
as "X," and it is the AI's turn to make a move
✓ Simulation(Rollout): A simulation is run from the expanded node, randomly selecting
valid moves for both players until a terminal state (win/loss/draw) is reached.
✓ Simulated moves: "O" places its mark in the top right. "X" places its mark in the bottom left and wins.
✓ Simulated final state:
Monte – Carlo Tree search - example
Problem Setup: We are designing an AI to play Tic-Tac-Toe using MCTS. Assume the AI plays
as "X," and it is the AI's turn to make a move
✓ Backpropagation: The result of the simulation (a win for "X") is backpropagated through
the tree. Each node along the path from the expanded node to the root updates its
win/visit counts. For the node representing the bottom middle move, its win count is
incremented, and its visit count is updated.
Monte – Carlo Tree search - example
Problem Setup: We are designing an AI to play Tic-Tac-Toe using MCTS. Assume the AI plays
as "X," and it is the AI's turn to make a move
✓ Repeat: This process (selection, expansion, simulation, backpropagation) repeats thousands
of times. Over time, the algorithm will gather statistics for each possible move at the root
node (current state of the board). The move with the highest win rate is selected as the AI’s
next move.
✓ Decision Making: After sufficient simulations, the MCTS will select the move that maximizes its
winning probability. In this example, the AI might decide to place "X" in the bottom middle
position, as simulations suggest that this leads to victory more frequently.
Minimax search procedure
✓ Itis an algorithm used in decision-making and game theory to minimize the possible loss in
a worst-case scenario.
✓ It is typically applied to zero-sum games, where one player’s gain is another player’s loss.
✓ The algorithm assumes that both players play optimally. The key concepts of Minimax are
as follows:
✓ Maximizing Player: Tries to maximize their score.
✓ Minimizing Player: Tries to minimize the maximizing player’s score (i.e., maximize their own score in a two-player
game).
✓ Game Tree: Represents possible moves for both players. Each node is a game state, and branches are possible actions.
✓ Terminal Node: A game state where the game ends, and an outcome is assigned (win, loss, or draw).
✓ Evaluation Function: Assigns a score to each terminal state (e.g., win = +1, loss = -1, draw = 0).
Steps of Minimax search
✓ Generate the Game Tree: List all possible moves for both players from the current state.
✓ Apply Minimax:
✓ Recursively evaluate each move.
✓ For the maximizing player, choose the move with the highest score.
✓ For the minimizing player, choose the move with the lowest score.

✓ Backpropagate Values: The value of each node is the best possible outcome for the player
whose turn it is.
✓ Choose
the Optimal Move: The root of the game tree will show the best move based on the
Minimax evaluation.
Example: tic-tac-toe - game tree construction
1. Maximizing Player (X’s Turn): X has two possible moves:
1. Move 1: X plays at (2,1)
2. Move 2: X plays at (3,1)

2. After X’s move, O plays (Minimizing Player): O then places their mark, trying to minimize X's chances of
winning.
1. For each move, generate possible responses by O, and continue until the game reaches terminal states (win, loss, or draw).
• Evaluating Moves Using Minimax:
1. If X plays at (2,1), O may play at (3,1). This leads to one terminal state:
– X wins in this case, so the score is +1 for X.
2. If X plays at (3,1), O can block X by playing at (2,1). This leads to a draw, so the score is 0.

• Backpropagation:
• If X plays at (2,1), the outcome is a win for X (+1).
• If X plays at (3,1), the outcome is a draw (0).
• Final Decision:
– Since X (the maximizing player) wants to maximize their score, they will choose to play at (2,1), as it gives them the highest score (+1,
a guaranteed win).
Example: tic-tac-toe - game tree construction
Example: tic-tac-toe

• Advantages of Minimax:
• Guarantees an optimal solution if both players play perfectly.
• Simple and effective for small games like Tic-Tac-Toe.

• Limitations:
• For large games (e.g., chess), the number of possible states can become too large to
compute exhaustively. In such cases, algorithms like Alpha-Beta pruning are
used to reduce the number of nodes evaluated.
Alpha-beta pruning:
• It is an optimization technique for the Minimax algorithm used in decision-making, primarily in
game theory.
• It helps reduce the number of nodes that need to be evaluated in the game tree, making the
Minimax algorithm more efficient without affecting the outcome.
• Alpha-Beta pruning eliminates the need to explore subtrees that cannot possibly influence the
final decision. It "prunes" branches in the game tree, saving computational resources by ignoring
branches that are irrelevant for determining the best move.
• Terminology:
– Alpha (α): The best value (highest) that the maximizing player can guarantee up to that point.
– Beta (β): The best value (lowest) that the minimizing player can guarantee up to that point.
• Alpha and beta values are used to cut off branches of the game tree when a better option has
already been found.
Steps in Alpha-beta pruning:
• Maximizing Player (Alpha): The maximizing player tries to maximize the score.
Alpha is updated whenever a new best move is found for the maximizing player.
• Minimizing Player (Beta): The minimizing player tries to minimize the score.
Beta is updated whenever a new best move is found for the minimizing player.
• Pruning: If at any point during evaluation, the maximizing player finds a move
that is guaranteed to be better than any move the minimizing player has
evaluated so far, further evaluation of this branch is "pruned" or stopped.
• Pruning condition: Prune if β ≤ α: When the current move evaluation makes the
value of the node worse than what has already been found in previous moves, the
subtree below this node can be ignored because it cannot influence the final
decision.
Example: Tic-Tac-Toe
• Consider a simplified game tree for a Tic-Tac-Toe scenario, where
Player X (Maximizing) and Player O (Minimizing) alternate
moves.
• Evaluation without Alpha-Beta Pruning:
– The Minimax algorithm evaluates all nodes in the tree, leading to an exhaustive search of
all possible moves.
Example: Tic-Tac-Toe
• Evaluation with Alpha-Beta Pruning:
• Alpha-Beta pruning can cut off unnecessary
evaluations:

– Start at the root node (X’s turn), and explore the first child.
– Evaluate the first branch completely, computing alpha (for X) and beta (for O).
– As soon as you find a move that results in a worse outcome than the already evaluated
branches (for the minimizing player O), prune the remaining branches in that part of
the tree.
Alpha-beta pruning logic
• At the top, it's X's (Max) turn, and X wants to maximize the score.
• The first branch is evaluated fully, giving a score of 3.
• Then O (Min) is evaluated. On the left branch, O minimizes, and the score is 1.
This makes O’s best option beta = 1.
• If in subsequent evaluations, another branch gives X a score higher than 1, we
can stop evaluating other options for O, as O will never allow X to take this
branch (because O can block it earlier with a score of 1 or better).

Monte Carlo Tree Search
No ratings yet
Monte Carlo Tree Search
3 pages
Paper 49
No ratings yet
Paper 49
8 pages
Monte Carlo Tree Search
No ratings yet
Monte Carlo Tree Search
15 pages
AI - Notes
No ratings yet
AI - Notes
15 pages
Mcts Survey Master PDF
No ratings yet
Mcts Survey Master PDF
49 pages
Ai Endsem Unit 3
No ratings yet
Ai Endsem Unit 3
12 pages
Monte Carlo Tree Search
No ratings yet
Monte Carlo Tree Search
19 pages
2024 Mth058 Lecture06 Mcts
100% (1)
2024 Mth058 Lecture06 Mcts
38 pages
Applsci 11 02056 v2
No ratings yet
Applsci 11 02056 v2
18 pages
AI Unit 2 Sem 5 - Watermark
No ratings yet
AI Unit 2 Sem 5 - Watermark
26 pages
Accessing Gpt-4 Level Mathematical Olympiad Solutions Via Monte Carlo Tree Self-Refine With Llama-3 8B
No ratings yet
Accessing Gpt-4 Level Mathematical Olympiad Solutions Via Monte Carlo Tree Self-Refine With Llama-3 8B
12 pages
Ai Unit Ii
No ratings yet
Ai Unit Ii
74 pages
Cs 171 07a Games MiniMax
No ratings yet
Cs 171 07a Games MiniMax
28 pages
Unit2 Extra
No ratings yet
Unit2 Extra
16 pages
Unit III - Game Theory
No ratings yet
Unit III - Game Theory
39 pages
Monte Carlo Tree Search Method For AI Games: Volume 2, Issue 2, March - April 2013
No ratings yet
Monte Carlo Tree Search Method For AI Games: Volume 2, Issue 2, March - April 2013
6 pages
Game Search Algorithms in AI
No ratings yet
Game Search Algorithms in AI
5 pages
Week 3 C5 Adversarial Search and Games (Belano & Ong Chua)
No ratings yet
Week 3 C5 Adversarial Search and Games (Belano & Ong Chua)
57 pages
Unit II
No ratings yet
Unit II
7 pages
Monte Carlo Tree Self Refine For Math Llama 3
No ratings yet
Monte Carlo Tree Self Refine For Math Llama 3
12 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Session 04 Monte Carlo Tree Search
No ratings yet
Session 04 Monte Carlo Tree Search
28 pages
Unit Ii
No ratings yet
Unit Ii
53 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
C-MCTS: Safe Planning With Monte Carlo Tree Search: Preprint. Under Review
No ratings yet
C-MCTS: Safe Planning With Monte Carlo Tree Search: Preprint. Under Review
13 pages
AI M4 Notes
No ratings yet
AI M4 Notes
11 pages
2-Problem Solving and Search Techniques
No ratings yet
2-Problem Solving and Search Techniques
12 pages
Cpts 440 / 540 Artificial Intelligence: Search
No ratings yet
Cpts 440 / 540 Artificial Intelligence: Search
182 pages
Ai3 1
No ratings yet
Ai3 1
23 pages
Monte Carlo Tree Search A Review of Recent Modifications and Applications
No ratings yet
Monte Carlo Tree Search A Review of Recent Modifications and Applications
99 pages
Optimal Searching: - Advantages
No ratings yet
Optimal Searching: - Advantages
37 pages
AI Final
No ratings yet
AI Final
60 pages
Problem Solving Techniques in A.I: Broad Approaches
No ratings yet
Problem Solving Techniques in A.I: Broad Approaches
15 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Games Playing-2-57
No ratings yet
Games Playing-2-57
56 pages
4.2. Adversarial Search and Games
No ratings yet
4.2. Adversarial Search and Games
32 pages
16.MCTS Tutorial
No ratings yet
16.MCTS Tutorial
28 pages
AI Unit 2 - Constarint Sattisfaction, Means End Analysis, Adversial Search
No ratings yet
AI Unit 2 - Constarint Sattisfaction, Means End Analysis, Adversial Search
42 pages
Module4 Chapter2
No ratings yet
Module4 Chapter2
30 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
AI Lec11
No ratings yet
AI Lec11
33 pages
Applying Monte Carlo Tree Search To The Strategic Game HIVE
No ratings yet
Applying Monte Carlo Tree Search To The Strategic Game HIVE
46 pages
Unit-2 PPT
No ratings yet
Unit-2 PPT
55 pages
Exam Notes - AI-2 1
No ratings yet
Exam Notes - AI-2 1
12 pages
AI Notes - Module4
100% (1)
AI Notes - Module4
21 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
Monte Carlo Tree Search
No ratings yet
Monte Carlo Tree Search
8 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
Gaojie Ai Monte Carlo
No ratings yet
Gaojie Ai Monte Carlo
30 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Artificial Intelligence: Search
No ratings yet
Artificial Intelligence: Search
113 pages
AI Final-Revised
No ratings yet
AI Final-Revised
19 pages
MCTS and AlphaGo
No ratings yet
MCTS and AlphaGo
58 pages
Ai Question Bank Unit 3
No ratings yet
Ai Question Bank Unit 3
11 pages
Search AI
No ratings yet
Search AI
17 pages
AI Searchingstrategies PDF
No ratings yet
AI Searchingstrategies PDF
65 pages
UNIT 1 Daa
No ratings yet
UNIT 1 Daa
167 pages
Handout 36: Final Exam Solutions: Problem 1. Recurrences
No ratings yet
Handout 36: Final Exam Solutions: Problem 1. Recurrences
21 pages
Discrete-Time Convolution: Equivalent Forms of The Convolution Operation Will Also Be Presented. These Equivalent Forms
No ratings yet
Discrete-Time Convolution: Equivalent Forms of The Convolution Operation Will Also Be Presented. These Equivalent Forms
18 pages
Atmosphere 13 01887 v2
No ratings yet
Atmosphere 13 01887 v2
17 pages
6.5620 (6.875), Fall 2022: Template
No ratings yet
6.5620 (6.875), Fall 2022: Template
4 pages
CS 135 MP - Chua, Ralph (2009-10151)
No ratings yet
CS 135 MP - Chua, Ralph (2009-10151)
7 pages
Kalman Filter For Dummies
No ratings yet
Kalman Filter For Dummies
12 pages
Integrated Scheduling of Distributed Production and Distribution in Group Manufacturing With Uncertain Travel Time
No ratings yet
Integrated Scheduling of Distributed Production and Distribution in Group Manufacturing With Uncertain Travel Time
19 pages
Direct Methods
No ratings yet
Direct Methods
46 pages
Learn DSA
No ratings yet
Learn DSA
11 pages
Numnercal Solutions - ACTIVITY #2
No ratings yet
Numnercal Solutions - ACTIVITY #2
2 pages
A Comparison of Various Normalization in Techniques For Order Performance by Similarity To Ideal Solution (TOPSIS)
No ratings yet
A Comparison of Various Normalization in Techniques For Order Performance by Similarity To Ideal Solution (TOPSIS)
1 page
Decision Trees ID3
No ratings yet
Decision Trees ID3
45 pages
Adam Parower Full Duplex Radio
No ratings yet
Adam Parower Full Duplex Radio
26 pages
Algoritma Untuk Tataletak Fasilitas
No ratings yet
Algoritma Untuk Tataletak Fasilitas
53 pages
Very Deep Learning
No ratings yet
Very Deep Learning
38 pages
MS
No ratings yet
MS
18 pages
Persistent Placement Paper
No ratings yet
Persistent Placement Paper
3 pages
COL100 Major
No ratings yet
COL100 Major
19 pages
QP VTU With Ans 1A PDF
No ratings yet
QP VTU With Ans 1A PDF
19 pages
Iae 2 Answer Key
No ratings yet
Iae 2 Answer Key
4 pages
Runge-Kutta Second Order & Runge-Kutta Fourth Order Methods: Presented By: Gurcharanjeet Singh
No ratings yet
Runge-Kutta Second Order & Runge-Kutta Fourth Order Methods: Presented By: Gurcharanjeet Singh
10 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Chapter 3 - Image Enhancement in Frequency Domains
No ratings yet
Chapter 3 - Image Enhancement in Frequency Domains
57 pages
SCSA3015 Deep Learning Unit 4 PDF
No ratings yet
SCSA3015 Deep Learning Unit 4 PDF
30 pages
2013 14 S1 Final
No ratings yet
2013 14 S1 Final
13 pages
Basic Maths I Assignment 1
No ratings yet
Basic Maths I Assignment 1
1 page
2.6 Hashing
No ratings yet
2.6 Hashing
11 pages
DAALab Manual DSE Feb 2023
No ratings yet
DAALab Manual DSE Feb 2023
12 pages

FALLSEM2024-25 BEEE411L TH CH2024250101504 Reference Material I 02-09-2024 Module - 4 CSP

Uploaded by

FALLSEM2024-25 BEEE411L TH CH2024250101504 Reference Material I 02-09-2024 Module - 4 CSP

Uploaded by

Constraint satisfaction

You might also like