0% found this document useful (0 votes)

59 views

CS221 - Artificial Intelligence - Search - 4 Dynamic Programming

The minimal state to track the objective of traveling from city 1 to city n while visiting at least 3 odd cities is (minimum of number of odd cities visited or 3, current city). This captures just enough information - the number of odd cities visited up to 3, and the current city - to determine the optimal future actions.

Uploaded by

Ardiansyah Mochamad Nugraha

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

CS221 - Artificial Intelligence - Search - 4 Dynamic Programming

Uploaded by

Ardiansyah Mochamad Nugraha

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Search: dynamic programming

Dynamic programming
state s

Cost(s, a)

state s0

FutureCost(s0 )

end state
Minimum cost path from state s to a end state:
(
0 if IsEnd(s)
FutureCost(s) =
mina∈Actions(s) [Cost(s, a) + FutureCost(Succ(s, a))] otherwise

CS221 2
• Now let’s see if we can avoid the exponential running time of tree search. Our first algorithm will be dynamic programming. We have already
seen dynamic programming in specific contexts. Now we will use the search problem abstraction to define a single dynamic program for all
search problems.
• First, let us try to think about the minimum cost path in the search tree recursively. Define FutureCost(s) as the cost of the minimum cost
path from s to some end state. The minimum cost path starting with a state s to an end state must take a first action a, which results in
another state s0 , from which we better take a minimum cost path to the end state.
• Written in symbols, we have a nice recurrence. Throughout this course, we will see many recurrences of this form. The basic form is a base
case (when s is a end state) and an inductive case, which consists of taking the minimum over all possible actions a from s, taking an initial
step resulting in an immediate action cost Cost(s, a) and a future cost.
Motivating task

Example: route finding

Find the minimum cost path from city 1 to city n, only moving forward. It costs cij
to go from i to j.
1

2 3 4 5 6 7

3 4 5 6 7 4 5 6 7 5 6 7 6 7 7

4 5 6 7 5 6 7 6 7 7 5 6 7 6 7 7 6 7 7 7

5 6 7 6 7 7 6 7 7 7 6 7 7 7 7

6 7 7 7 7 7

Observation: future costs only depend on current city

CS221 4
• Now let us see if we can avoid the exponential time. If we consider the simple route finding problem of traveling from city 1 to city n, the
search tree grows exponentially with n.
• However, upon closer inspection, we note that this search tree has a lot of repeated structures. Moreover (and this is important), the future
costs (the minimum cost of reaching a end state) of a state only depends on the current city! So therefore, all the subtrees rooted at city 5,
for example, have the same minimum cost!
• If we can just do that computation once, then we will have saved big time. This is the central idea of dynamic programming.
• We’ve already reviewed dynamic programming in the first lecture. The purpose here is to construct one generic dynamic programming solution
that will work on any search problem. Again, this highlights the useful division between modeling (defining the search problem) and algorithms
(performing the actual search).
Dynamic programming
State: past sequence of actions current city

4
3 5

2 6

1 7

Exponential saving in time and space!

CS221 6
• Let us collapse all the nodes that have the same city into one. We no longer have a tree, but a directed acyclic graph with only n nodes
rather than exponential in n nodes.
• Note that dynamic programming is only useful if we can define a search problem where the number of states is small enough to fit in memory.
Dynamic programming
Algorithm: dynamic programming

def DynamicProgramming(s):
If already computed for s, return cached answer.
If IsEnd(s): return solution
For each action a ∈ Actions(s): ...

[semi-live solution: Dynamic Programming]

Assumption: acyclicity

The state graph defined by Actions(s) and Succ(s, a) is acyclic.

CS221 8
• The dynamic programming algorithm is exactly backtracking search with one twist. At the beginning of the function, we check to see if we’ve
already computed the future cost for s. If we have, then we simply return it (which takes constant time if we use a hash map). Otherwise,
we compute it and save it in the cache so we don’t have to recompute it again. In this way, for every state, we are only computing its value
once.
• For this particular example, the running time is O(n2 ), the number of edges.
• One important point is that the graph must be acyclic for dynamic programming to work. If there are cycles, the computation of a future
cost for s might depend on s0 which might depend on s. We will infinite loop in this case. To deal with cycles, we need uniform cost search,
which we will describe later.
Dynamic programming

Key idea: state

A state is a summary of all the past actions sufficient to choose future actions opti-
mally.

past actions (all cities) 1346

state (current city) 1346

CS221 10
• So far, we have only considered the example where the cost only depends on the current city. But let’s try to capture exactly what’s going
on more generally.
• This is perhaps the most important idea of this lecture: state. A state is a summary of all the past actions sufficient to choose future actions
optimally.
• What state is really about is forgetting the past. We can’t forget everything because the action costs in the future might depend on what we
did on the past. The more we forget, the fewer states we have, and the more efficient our algorithm. So the name of the game is to find the
minimal set of states that suffice. It’s a fun game.
Handling additional constraints

Example: route finding

Find the minimum cost path from city 1 to city n, only moving forward. It costs cij
to go from i to j.
Constraint: Can’t visit three odd cities in a row.

State: (whether previous city was odd, current city)

n/a, 1

3:c13 4:c14

odd, 3 odd, 4

7:c37 4:c34 5:c45

odd, 7 odd, 4 even, 5

CS221 12
• Let’s add a constraint that says we can’t visit three odd cities in a row. If we only keep track of the current city, and we try to move to a
next city, we cannot enforce this constraint because we don’t know what the previous city was. So let’s add the previous city into the state.
• This will work, but we can actually make the state smaller. We only need to keep track of whether the previous city was an odd numbered
city to enforce this constraint.
• Note that in doing so, we have 2n states rather than n2 states, which is a substantial savings. So the lesson is to pay attention to what
information you actually need in the state.
answer in chat Question

Objective: travel from city 1 to city n, visiting at least 3 odd cities. What is the minimal state?

CS221 14
State graph
State: (min(number of odd cities visited, 3), current city)

1,3 1,4

2,3 2,4
1,2 1,5
3,3 3,4
2,2 2,5

3,2 3,5
1,1 1,6

2,1 2,6

3,1 3,6

CS221 16
• Our first thought might be to remember how many odd cities we have visited so far (and the current city).
• But if we’re more clever, we can notice that once the number of odd cities is 3, we don’t need to keep track of whether that number goes
up to 4 or 5, etc. So the state we actually need to keep is (min(number of odd cities visited, 3), current city). Thus, our state space is O(n)
rather than O(n2 ).
• We can visualize what augmenting the state does to the state graph. Effectively, we are copying each node 4 times, and the edges are
redirected to move between these copies.
• Note that some states such as (2, 1) aren’t reachable (if you’re in city 1, it’s impossible to have visited 2 odd cities already); the algorithm
will not touch those states and that’s perfectly okay.
answer in chat Question
Objective: travel from city 1 to city n, visiting more odd than even cities. What is the minimal
state?

CS221 18
• An initial guess might be to keep track of the number of even cities and the number of odd cities visited.
• But we can do better. We have to just keep track of the number of odd cities minus the number of even cities and the current city. We can
write this more formally as (n1 − n2 , current city), where n1 is the number of odd cities visited so far and n2 is the number of even cities
visited so far.
Summary
• State: summary of past actions sufficient to choose future actions optimally

• Dynamic programming: backtracking search with memoization — potentially exponen-

tial savings

Dynamic programming only works for acyclic graphs...what if there are cycles?

CS221 20
Dynamic Programming Review
state s
Cost(s, a)
state s0
FutureCost(s0 )
end state
(
0 if IsEnd(s)
FutureCost(s) =
mina∈Actions(s) [Cost(s, a) + FutureCost(Succ(s, a))] otherwise

Key idea: state

A state is a summary of all the past actions sufficient to choose future actions opti-
mally.

CS221 22

ap-physics-1-2022-practice-exam-1-frq
No ratings yet
ap-physics-1-2022-practice-exam-1-frq
12 pages
Creep Calculations Carbon Steel
100% (1)
Creep Calculations Carbon Steel
4 pages
Cse3521 Hw1 Solutions
No ratings yet
Cse3521 Hw1 Solutions
5 pages
module2_2024.pptx
No ratings yet
module2_2024.pptx
69 pages
Editorial
No ratings yet
Editorial
12 pages
Literature Review On Travelling Salesman Problem
100% (1)
Literature Review On Travelling Salesman Problem
8 pages
Model Answer DAA
No ratings yet
Model Answer DAA
25 pages
Chapter Four - Dynamic Programming
No ratings yet
Chapter Four - Dynamic Programming
40 pages
A Two-Phase Heuristic Algorithm For No-Wait Flow S
No ratings yet
A Two-Phase Heuristic Algorithm For No-Wait Flow S
10 pages
Applications of Dynamic Programming: Steven Skiena
No ratings yet
Applications of Dynamic Programming: Steven Skiena
20 pages
Lecture#6 - Branch-and-Bound Algorithm
No ratings yet
Lecture#6 - Branch-and-Bound Algorithm
32 pages
cs188 sp23 Note08
No ratings yet
cs188 sp23 Note08
7 pages
Computer Aided Layout (2)
No ratings yet
Computer Aided Layout (2)
56 pages
DA2023 MakeUpFirstTest Withsolutions
No ratings yet
DA2023 MakeUpFirstTest Withsolutions
7 pages
3 DP PDF
No ratings yet
3 DP PDF
42 pages
Phase2 Report1 Updated
No ratings yet
Phase2 Report1 Updated
17 pages
Ai Lect2 Search
No ratings yet
Ai Lect2 Search
81 pages
03_search
No ratings yet
03_search
85 pages
Data Warehousing & Modeling: Module - 2
No ratings yet
Data Warehousing & Modeling: Module - 2
144 pages
Lecture#6 - Branch-and-Bound Algorithm
No ratings yet
Lecture#6 - Branch-and-Bound Algorithm
32 pages
Solving Problem by Searching
No ratings yet
Solving Problem by Searching
85 pages
Tutorial 2
No ratings yet
Tutorial 2
12 pages
2marks DAA
No ratings yet
2marks DAA
9 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
50 pages
CpmPert Slide
No ratings yet
CpmPert Slide
35 pages
PUT Sol KMBN206 2023-24
No ratings yet
PUT Sol KMBN206 2023-24
12 pages
DAA Material
No ratings yet
DAA Material
12 pages
Tutorial 05
No ratings yet
Tutorial 05
2 pages
Hcs 225 Assignment
No ratings yet
Hcs 225 Assignment
7 pages
AI Chapter 2
No ratings yet
AI Chapter 2
34 pages
Algorithm PDF
No ratings yet
Algorithm PDF
54 pages
Cse373 10sp Midterm2.Key
No ratings yet
Cse373 10sp Midterm2.Key
10 pages
unit 2 AI part 2
No ratings yet
unit 2 AI part 2
42 pages
04 Local Search
No ratings yet
04 Local Search
31 pages
A New Binary Encoding Scheme
No ratings yet
A New Binary Encoding Scheme
11 pages
DAA Practical question for reference
No ratings yet
DAA Practical question for reference
4 pages
Dynamic Programming Applications
No ratings yet
Dynamic Programming Applications
9 pages
Assignment 3
No ratings yet
Assignment 3
20 pages
Algo - Mod9 - Dynamic Programming Method
No ratings yet
Algo - Mod9 - Dynamic Programming Method
51 pages
Chapter 3
No ratings yet
Chapter 3
226 pages
Fundamentals of Algorithms (cs502) Assignment No.5: Objectives
No ratings yet
Fundamentals of Algorithms (cs502) Assignment No.5: Objectives
4 pages
DCA7104 - Analysis and Design of Algorithms
No ratings yet
DCA7104 - Analysis and Design of Algorithms
14 pages
2.2+Model Free+Control
No ratings yet
2.2+Model Free+Control
92 pages
2 Exhaustive Search AI
No ratings yet
2 Exhaustive Search AI
35 pages
IE 332 - Homework #2: Read Carefully. Important!
No ratings yet
IE 332 - Homework #2: Read Carefully. Important!
7 pages
Equations Quiz
No ratings yet
Equations Quiz
4 pages
An Improved Genetic Algorithm For The Vehicle Routing Problem With
No ratings yet
An Improved Genetic Algorithm For The Vehicle Routing Problem With
7 pages
CPT212-00-Algorithm Design and Analysis
No ratings yet
CPT212-00-Algorithm Design and Analysis
9 pages
Example: Air Cargo Transport
No ratings yet
Example: Air Cargo Transport
13 pages
Eem520l3 2023
No ratings yet
Eem520l3 2023
25 pages
Linear Fractional Transportation Problem With Varying Demand and Supply Vishwas Deep Joshi - Nilama Gupta
No ratings yet
Linear Fractional Transportation Problem With Varying Demand and Supply Vishwas Deep Joshi - Nilama Gupta
10 pages
Exam4 Practice Solutions PDF
No ratings yet
Exam4 Practice Solutions PDF
4 pages
03 Search
No ratings yet
03 Search
84 pages
Chapter3 ProblemSolvingBySearching
No ratings yet
Chapter3 ProblemSolvingBySearching
61 pages
Tut21 RL
No ratings yet
Tut21 RL
101 pages
A Programming Problem: Algoritmalar Ve Programlama II
No ratings yet
A Programming Problem: Algoritmalar Ve Programlama II
8 pages
Session+4
No ratings yet
Session+4
31 pages
Mth601 Final Term Solved Mcqs File
No ratings yet
Mth601 Final Term Solved Mcqs File
12 pages
AI Lecture 3
No ratings yet
AI Lecture 3
37 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
Motion Field: Exploring the Dynamics of Computer Vision: Motion Field Unveiled
From Everand
Motion Field: Exploring the Dynamics of Computer Vision: Motion Field Unveiled
Fouad Sabry
No ratings yet
Exercises of Function Study
From Everand
Exercises of Function Study
Simone Malacrida
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 4 Stochastic Gradient Descent
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 4 Stochastic Gradient Descent
12 pages
Machine Learning: Backpropagation
No ratings yet
Machine Learning: Backpropagation
24 pages
CS221 - Artificial Intelligence - Machine Learning - 6 Non-Linear Features
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 6 Non-Linear Features
22 pages
CS221 - Artificial Intelligence - Machine Learning - 2 Linear Regression
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 2 Linear Regression
24 pages
Machine Learning: Neural Networks
No ratings yet
Machine Learning: Neural Networks
22 pages
CS221 - Artificial Intelligence - Machine Learning - 3 Linear Classification
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 3 Linear Classification
28 pages
CS221 - Artificial Intelligence - Machine Learning - 1 Overview
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 1 Overview
16 pages
Aksantara2015 Sheet1
No ratings yet
Aksantara2015 Sheet1
2 pages
Written Assignment Unit 2
No ratings yet
Written Assignment Unit 2
4 pages
ENGR 2213 Thermodynamics: F. C. Lai School of Aerospace and Mechanical Engineering University of Oklahoma
No ratings yet
ENGR 2213 Thermodynamics: F. C. Lai School of Aerospace and Mechanical Engineering University of Oklahoma
20 pages
Chapter 4: Probability and Counting Rules
100% (2)
Chapter 4: Probability and Counting Rules
49 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
1) ! 1) ! 1) ! 1 Be A Positive Integer Such That 1
No ratings yet
1) ! 1) ! 1) ! 1 Be A Positive Integer Such That 1
10 pages
Jeemain24!03!2020gtm 15 Qpaper
No ratings yet
Jeemain24!03!2020gtm 15 Qpaper
24 pages
Rule-Based Fuzzy Model
No ratings yet
Rule-Based Fuzzy Model
15 pages
Junior Mathematics Competition 2020 Questions: Instructions To Candidates
No ratings yet
Junior Mathematics Competition 2020 Questions: Instructions To Candidates
4 pages
Bas430 FPD 6 2021 2 PDF
No ratings yet
Bas430 FPD 6 2021 2 PDF
44 pages
BV Software Brochure
No ratings yet
BV Software Brochure
12 pages
Bond Between CFRP Sheets and Concrete
No ratings yet
Bond Between CFRP Sheets and Concrete
8 pages
Concept Map Physics STPM
No ratings yet
Concept Map Physics STPM
8 pages
Group Project Sta589
No ratings yet
Group Project Sta589
16 pages
05586659
No ratings yet
05586659
11 pages
Screening Sample Test PDF
No ratings yet
Screening Sample Test PDF
9 pages
Edexcel GCE: 6679 Mechanics M3 Advanced/Advanced Subsidiary
No ratings yet
Edexcel GCE: 6679 Mechanics M3 Advanced/Advanced Subsidiary
6 pages
Parametric Equations Exam Questions
No ratings yet
Parametric Equations Exam Questions
159 pages
Cs Mcq's Mod For Conduct
No ratings yet
Cs Mcq's Mod For Conduct
8 pages
Density of Water Lab Conclusion
64% (14)
Density of Water Lab Conclusion
2 pages
Mathematics: Quarter 3 - Module 6
50% (2)
Mathematics: Quarter 3 - Module 6
34 pages
AQA-13502A-QP-NOV21
No ratings yet
AQA-13502A-QP-NOV21
20 pages
Get Pacemaker Practical Mathematics for Consumers 3rd ed. Edition Fearon free all chapters
No ratings yet
Get Pacemaker Practical Mathematics for Consumers 3rd ed. Edition Fearon free all chapters
81 pages
w2
No ratings yet
w2
2 pages
STD I, NOV ANNUAL EXAMS- MESP TANZANIA-15-20
No ratings yet
STD I, NOV ANNUAL EXAMS- MESP TANZANIA-15-20
6 pages
Antiderivative and Indefinite Integral
No ratings yet
Antiderivative and Indefinite Integral
5 pages
Non-Dimensionalisation of The Navier-Stokes Equations: Michal Kopera
No ratings yet
Non-Dimensionalisation of The Navier-Stokes Equations: Michal Kopera
45 pages
Applications of Continued Fractions Continued
No ratings yet
Applications of Continued Fractions Continued
2 pages
Gate-Level Combinational Circuit: FPGA Prototyping
No ratings yet
Gate-Level Combinational Circuit: FPGA Prototyping
6 pages

CS221 - Artificial Intelligence - Search - 4 Dynamic Programming

Uploaded by

CS221 - Artificial Intelligence - Search - 4 Dynamic Programming

Uploaded by

Search: dynamic programming

Example: route finding

Observation: future costs only depend on current city

Exponential saving in time and space!

[semi-live solution: Dynamic Programming]

The state graph defined by Actions(s) and Succ(s, a) is acyclic.

Key idea: state

past actions (all cities) 1346

Example: route finding

State: (whether previous city was odd, current city)

7:c37 4:c34 5:c45

odd, 7 odd, 4 even, 5

• Dynamic programming: backtracking search with memoization — potentially exponen-

Key idea: state

You might also like