0% found this document useful (0 votes)

12 views

04 Search With Uncertainty

Uploaded by

shahzad.dar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

04 Search With Uncertainty

Uploaded by

shahzad.dar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

CS 5/7320

Artificial Intelligence

Search with
Uncertainty
AIMA Chapters 4.3-4.5

Slides by Michael Hahsler

with figures from the AIMA textbook

This work is licensed under a Creative Commons

Attribution-ShareAlike 4.0 International License.
Nondeterministic Actions:
Outcome of an action in a state is
uncertain.

Types of No observations:
uncertainty Sensorless problem

we consider
for now* Partially observable environments:
The agent does not know in what
state the environment is.

Exploration:
Unknown environments and
Online search

* we will quantify uncertainty with

probabilities later.
Remember: Solving Search
Problems under Certainty
State space: A state completely describes the
No Uncertainty environment and agent

• Deterministic Initial state

actions with known
transition model
𝑅𝑒𝑠𝑢𝑙𝑡 𝑠1 , 𝑎 = 𝑠4
• Full observability
(we have sensors to
see the whole
environment)
Goal states

Solution of the planning phase is a sequence of actions also called a plan that can be
blindly followed: [Suck, Right, Suck]
Consequence of Uncertainty
Solution is typically not a fixed precomputed plan
(sequence of actions), but a

conditional plan (also called strategy or policy)

that depends on percepts.

Nondeterministic Actions
Nondeterministic Actions
Outcome of actions in the environment is
nondeterministic = transition model need to
describe uncertainty

Example transition:

𝑅𝑒𝑠𝑢𝑙𝑡𝑠 𝑠1 , 𝑎 = 𝑠2 , 𝑠4 , 𝑠5

i.e., action 𝑎 in 𝑠1 can lead to one of several states.

Example:
Erratic Vacuum World
Regular fully-observable vacuum world, but the
action ‘suck’ is more powerful and nondeterministic:

a) On a dirty square: cleans the square and

sometimes cleans dirt on adjacent squares as
well.
b) On a clean square: sometimes deposits some dirt
on the square.
Example:
Erratic Vacuum World
Start State

𝑅𝑒𝑠𝑢𝑙𝑡𝑠 1, 𝑆𝑢𝑐𝑘 = 5, 7

Goal states

We need a conditional plan

[Suck, if State = 5 then [Right, Suck] else []]
Finding a Cond. Plan: AND-OR Search Tree
Suck OR node (choose one action)
OR
AND node (all possible outcomes)

AND

Right

LOOP: No need to
Suck Solution is shown with bold arrows:
continue search. [Suck, if State = 5 then [Right, Suck] else []]
Solution is the
same as above.
Solution is a subtree that
1. has only GOAL leaf nodes
2. specifies one action at each OR node (state)
3. includes every outcome of AND nodes
AND-OR Tree search: Idea
OR node • Descend the tree by trying an
Suck action in each OR node and
AND node
considering all resulting states
of the AND nodes.
• Remove branches (actions) if
Right we cannot find a subtree
below that leads to only goal
nodes. (see failure in the code
on the next slide). Loop nodes
can be ignored.
Suck
• Stop when we find a subtree
that only has goal states in all
leaf nodes.
• Construct the conditional plan
that represents the subtree
starting at the root node.
[Suck, if State = 5 then [Right, Suck] else []]
AND-OR Recursive DFS Algorithm
= nested If-then-else statements

path is only maintained for cycle checking!

// don’t follow loops using path.

// try all possible actions

// fail means we found no action that leads to

// a goal-only subtree

// try all possible outcomes, none can fail!

// (= belief state)
// fail if we find any non-goal subtree

Notes:
• The DFS search tree is implicitly created using the call stack (recursive algorithm).
• DFS is not optimal! BFS and A* search can be used to find better solutions (e.g., smallest
subtree).
Use of Conditional Plans
• Planning is a goal-based agent.
• The conditional plan can be executed by a model-based reflex agent.

Example: After the initial action “suck”

Agent Step Program

1 [Suck,
Agent’s State
2 if State = 5 then
(= program counter)
3 [Right,
Step 2 4 Suck]
else
4b []
]
Search with no
Observations
Using Actions to
“Coerce” the World into
Known States
No Observations
Sensorless problem = unobservable environment also
called a conformant problem.

Why is this useful?

• Example: Doctor prescribes a broad-band antibiotic

instead of performing time-consuming blood work for a
more specific antibiotic. This saves time and money.

• Basic idea: Find a solution (a sequence of actions) that

works (reasonably well) from any state and then just
blindly execute it (open loop system).
?
Belief State
• The agent does not know in which state it is exactly in.
• However, it may know that it is in one of a set of possible states.
This set is called a belief state of the agent.
• Example: b = 𝑠2 , 𝑠4 , 𝑠6

b
Actions to Coerce the ?
World into States
• Actions can reduce the number of possible states.
• Example: Deterministic vacuum world. Agent does not know
its position and the dirt distribution.
Initial belief state {1,2,3,4,5,6,7,8}

right

Goal
states
Actions to Coerce the ?
World into States
• Actions can reduce the number of possible states.
• Example: Deterministic vacuum world. Agent does not know
its position and the dirt distribution.

suck
Actions to Coerce the ?
World into States
• The action sequence [right, suck, left, suck] coerces the
world into the goal state 7. It works from any initial state!
• There are no observations so there is no need for a
conditional plan.

[right,
suck,
left,
suck]
Example: The reachable belief-state ?
space for the deterministic,
sensorless vacuum world
Size of the belief state
space depends on the
number of states 𝑁:

𝒫𝑠 = 2𝑁 = 28 = 256
Initial
belief Only a small fraction
state (12 states) are
reachable.

No observations, so we
get a solution sequence
from an initial belief
state:
[Right, Suck, Left, Suck]
Finding a Solution Sequence
Note: State space size makes this
impractical for larger problems!

Formulate as a regular search and solve with DFS, BFS or A*:

• States: All belief states (=powerset 𝒫𝑠 of states of size 2𝑁 for N states)
• Initial state: Often the belief state consisting of all states.
• Actions: Actions of a belief state are the union of the possible actions for all the
states it contains.
• Transition model: 𝑏′ = 𝑅𝑒𝑠𝑢𝑙𝑡𝑠 𝑏, 𝑎 = {𝑠 ′ : 𝑠 ′ = 𝑅𝑒𝑠𝑢𝑙𝑡 𝑠, 𝑎 𝑎𝑛𝑑 𝑠 ∈ 𝑏}
• Goal test: Are all states in the belief state goal states?
• Simplifying property: If a belief state (e.g., 𝑏1 = {1,2,3,4,5}) is solvable (i.e.,
there is a sequence of actions that coerce all states to only goal states), then
belief states that are subsets (e.g., 𝑏2 = {2,5}) are also solved using the same
action sequence. Used to prune the search tree.

Other approach:
• Incremental belief-state search. Generate a solution that works for one state
and check if it also works for all other states. If it does not, then modify the
solution slightly. This is similar to local search.
3m

Case Study
1m
2m
x
Goal The agent can move up, down right, left.
location The agent has no sensors and does not
know its current location.
1. Can you navigate to the goal location?
How?

Agent
2. What would you need to know about
the environment?

3. What type of agent can do this?

Partially Observable
Environments
Using Observations to
Learn About the State
Percepts and Observability
• Many problems cannot be solved efficiently
without sensing (e.g., 8-puzzle).
• We need to see at least one square.

Percept function: 𝑃𝑒𝑟𝑐𝑒𝑝𝑡 𝑠

𝑠 is the state

𝑃𝑒𝑟𝑐𝑒𝑝𝑡 𝑠 = 𝑇𝑖𝑙𝑒7
• Fully observable: 𝑃𝑒𝑟𝑐𝑒𝑝𝑡 𝑠 = 𝑠
Problem: Many
• Sensorless: 𝑃𝑒𝑟𝑐𝑒𝑝𝑡 𝑠 = 𝑛𝑢𝑙𝑙 states (different
• Partially observable: 𝑃𝑒𝑟𝑐𝑒𝑝𝑡 𝑠 = 𝑜 order of the hidden
𝑜 is called an observation and tells us something about 𝑠 tiles) can produce the
same observation!
Use Observations to Learn
About the State Prediction for
action 𝑎 𝑏 Update with
observation 𝑜

Agents choose an action and then receive an observation.

Idea: Observations can be used to learn about the agent’s
state.
Assume we have a current belief state 𝑏 (i.e., the set of states we could be in).
Prediction for action: Choose an action 𝑎 and compute a new belief state that results
from the action.

𝑏෠ = 𝑃𝑟𝑒𝑑𝑖𝑐𝑡 𝑏, 𝑎 = ራ 𝑃𝑟𝑒𝑑𝑖𝑐𝑡(𝑠, 𝑎)
𝑠∈𝑏
Update with observation: You receive an observation 𝑜 and only keep states that are
consistent with the new observation. The belief after observing 𝑜 is:

෠ 𝑜 = {𝑠 ∶ 𝑠 ∈ 𝑏෠ ∧ 𝑃𝑒𝑟𝑐𝑒𝑝𝑡 𝑠 = 𝑜}
𝑏𝑜 = 𝑈𝑝𝑑𝑎𝑡𝑒 𝑏,

Both steps in one: 𝑏 ← 𝑈𝑝𝑑𝑎𝑡𝑒 𝑃𝑟𝑒𝑑𝑖𝑐𝑡(𝑏, 𝑎), 𝑜

Example: Deterministic local
sensing vacuum world

Predict for Update with

Prediction for
action 𝑎 𝑏 Update with
observation 𝑜 actions a observation 𝑜

[R,Dirty]

?
?

𝑏 ← 𝑈𝑝𝑑𝑎𝑡𝑒 𝑃𝑟𝑒𝑑𝑖𝑐𝑡 𝑏 , 𝑎 , 𝑜
𝑈𝑝𝑑𝑎𝑡𝑒 𝑃𝑟𝑒𝑑𝑖𝑐𝑡 1,3 , 𝑅𝑖𝑔ℎ𝑡 , [𝑅, 𝐷𝑖𝑟𝑡𝑦] = {2}
Solving Partially Observable Problems
Use an AND-OR tree of belief states to create a
conditional plan
Initial
belief state

OR
[L,Clean] [R,Dirty] [R,Clean]
AND
AND

Solution: [Suck, Right, if b = {6} then Suck else []]

Solving Partially Observable Problems
Use an AND-OR tree to create a conditional plan
predict

OR
update

[L,Clean] [R,Dirty] [R,Clean]

AND
AND

Solution: [Suck, Right, if b = {6} then Suck else []]

Solving Partially Observable Problems
Use an AND-OR tree to create a conditional plan
predict

OR
update

[L,Clean] [R,Dirty] [R,Clean]

AND
AND

…
Solution: [Suck, Right, if b = {6} then Suck else []]
Solving Partially Observable Problems
Use an AND-OR tree to create a conditional plan
predict

OR
update

[L,Clean] [R,Dirty] [R,Clean]

AND
AND

…
Solution: [Suck, Right, if b = {6} then Suck else []]

b = {6} is the result of the

update with o = [r, Dirty]
State Estimation and Prediction for
action 𝑎 𝑏 Update with
observation 𝑜
Approximate Belief States
• Agents choose an action and then receive an observation from the
environment.
• The agent keep track of its belief state using the following update:

𝑏 ← 𝑈𝑝𝑑𝑎𝑡𝑒 𝑃𝑟𝑒𝑑𝑖𝑐𝑡 𝑏, 𝑎 , 𝑜

• This process is often called

• monitoring,
• filtering, or
• state estimation.

• The agent needs to be able to update its belief state following observations in
real time! For many practical application, there is only time to compute an
approximate belief state! These approximate methods are used in control
theory and reinforcement learning.
Case Study
Partially Observable
8-Puzzle
1. Give a problem description for each step.
• States:
• Initial state:
• Actions:
• Transition model:
• Goal test:
• Percept function:

2. The problem can be solved using an AND-

OR Tree, but is there an easier solution?

a. What type of agent do we use?

b. What algorithms can be used?

Exploration
Unknown Environment and
Online Search
Online Search
• Recall offline search: Create a plan using the state space as a model before
taking any action. The plan can be a sequence of actions or a conditional plan
to account for uncertainty.

• The agent uses the transition function to predict the consequence of actions.
What if the transition function is unknown?

• Online search explores the real world one action at a time. Prediction is
replaced by “act” and update by “observe.”

Act Observe Act Observe Act …

• Useful for
• Real-time problems: When offline computation takes too long and there is a penalty for
sitting around and thinking.
• Nondeterministic domain: Only focus on what actually happens instead of planning for
everything!
• Unknown environment: The agent has no complete model of how the environment works.
It needs to explore an unknown state space and/or what actions do. I.e., it needs to learn
the transition function 𝑓 ∶ 𝑆 × 𝐴 → 𝑆
Design Considerations for
Online Search
• Knowledge: What does the agent already know about
the outcome of actions? E.g.,
• Does go north and then south lead to the same location? Transition
• Where are the walls in the maze? function

• Safely explorable state space/world: There are no

irreversible actions (e.g., traps, cliffs). At least the
agent needs to be able to avoid these actions.

• Exploration order: Expanding nodes in local order is

more efficient if you must execute the actions to get
observations: Depth-first search with backtracking
instead of BFS or A* Search.
Online Search: Model-based Agent
Program for Unknown Transition model
Environment is deterministic but
• only partially observable (𝑝𝑒𝑟𝑐𝑒𝑝𝑡 𝑠 = 𝑐𝑢𝑟𝑟𝑒𝑛𝑡 𝑙𝑜𝑐𝑎𝑡𝑖𝑜𝑛, state space may be unknown)
• unknown transition model (function 𝑟𝑒𝑠𝑢𝑙𝑡).
Approach: The algorithm builds the map 𝑟𝑒𝑠𝑢𝑙𝑡 𝑠, 𝑎 → 𝑠′ by trying all actions and backtracks when
all actions in a state have been explored. Learns results function
(= transition function)

Untried is the “frontier”

unbacktracked stores the current path

Record the found transition

Keep breadcrumbs to go back

Case Study: DFS with Backtracking
for an unknown Maze
unbacktracked
• We can only see
adjacent squares and Start
don’t know the
location of the goal!
• We cannot plan but untried
we must explore by (~ frontier)
walking around!
• We only know what
we have already
explored.
• A simple method is to The
store the path for
backtracking to get transition
back to untied paths function is
when we run into a unknown.
dead end (i.e., use
breadcrumbs).
Important concepts that you
should be able to explain and
use now…

• Difference between solution types:

a. a fixed actions sequence,
b. a conditional plan (also called a strategy or
policy), and
c. exploration.
• What are belief states?
• How actions can be used to coerce the world into
known states.
• How observations can be used to learn about the
state: State estimation with repeated predict and
update steps.
• The use of AND-OR trees to solve small problems.

Magnesium and Hydrochloric Acid (Model) - Science Background
No ratings yet
Magnesium and Hydrochloric Acid (Model) - Science Background
3 pages
PHYS 110 Test Bank
No ratings yet
PHYS 110 Test Bank
27 pages
Artificial Intelligence: CS482, CS682, MW 1 - 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis
No ratings yet
Artificial Intelligence: CS482, CS682, MW 1 - 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis
28 pages
Ai ch4b
No ratings yet
Ai ch4b
48 pages
Unit-2.4 Searching With Partial Observations - CSPs - Back Tracking
100% (2)
Unit-2.4 Searching With Partial Observations - CSPs - Back Tracking
42 pages
IA-c05-NoAnim
No ratings yet
IA-c05-NoAnim
26 pages
Chap 4 Part B Slide1-30
No ratings yet
Chap 4 Part B Slide1-30
45 pages
Chap 3 A Basic Search
No ratings yet
Chap 3 A Basic Search
36 pages
Ai - Module 2 - Week - 3
No ratings yet
Ai - Module 2 - Week - 3
87 pages
Unit – 3 Problem Solving With Searching
No ratings yet
Unit – 3 Problem Solving With Searching
90 pages
CS361 AI Week2 Lecture1
No ratings yet
CS361 AI Week2 Lecture1
21 pages
24-Module 5 - Conditional Planning-15!03!2024
No ratings yet
24-Module 5 - Conditional Planning-15!03!2024
17 pages
Lecture3 - Problem Solving by Search
No ratings yet
Lecture3 - Problem Solving by Search
33 pages
cs221-lecture9
No ratings yet
cs221-lecture9
43 pages
Lec 26
No ratings yet
Lec 26
21 pages
aimodue2-241106094449-7f59ca33
No ratings yet
aimodue2-241106094449-7f59ca33
75 pages
01. [AI - Slides] Search - Uninformed Search
No ratings yet
01. [AI - Slides] Search - Uninformed Search
69 pages
11 Automated Planning
No ratings yet
11 Automated Planning
25 pages
Artificial Intelligence Chapter 3: Problem Solving and Searching
100% (1)
Artificial Intelligence Chapter 3: Problem Solving and Searching
94 pages
CS 343H: Artificial Intelligence: Week2a: Uninformed Search
No ratings yet
CS 343H: Artificial Intelligence: Week2a: Uninformed Search
54 pages
Announcements: Homework 1
No ratings yet
Announcements: Homework 1
51 pages
Problem Solving by Search & Uninformed Search Algorithms
No ratings yet
Problem Solving by Search & Uninformed Search Algorithms
66 pages
CS 416 Artificial Intelligence: Uninformed Searches
No ratings yet
CS 416 Artificial Intelligence: Uninformed Searches
32 pages
Unit Iii
No ratings yet
Unit Iii
10 pages
Lec02 Ai Chapter3 and 4 Problem Solving by Search Aima
No ratings yet
Lec02 Ai Chapter3 and 4 Problem Solving by Search Aima
81 pages
19Z701-AI-Unit-2-1 Search Strategies
No ratings yet
19Z701-AI-Unit-2-1 Search Strategies
84 pages
EE480 Agents ClassicSearch
No ratings yet
EE480 Agents ClassicSearch
57 pages
9-M6_ Classical planning, Planning as State-space search-08-10-2024
No ratings yet
9-M6_ Classical planning, Planning as State-space search-08-10-2024
30 pages
AI (Chap 3&4)
No ratings yet
AI (Chap 3&4)
182 pages
Chapter 03
No ratings yet
Chapter 03
74 pages
Planingproblem 190308075248
No ratings yet
Planingproblem 190308075248
76 pages
Chapter 3 - Problem Solving by Searching(I)
No ratings yet
Chapter 3 - Problem Solving by Searching(I)
73 pages
2024 Slide2 Uninform Search Update
No ratings yet
2024 Slide2 Uninform Search Update
129 pages
6-7.informed Search (Recovered)
No ratings yet
6-7.informed Search (Recovered)
50 pages
Presentation 2
No ratings yet
Presentation 2
65 pages
ArtificialIntelligence Unit2(OLD Part1)
No ratings yet
ArtificialIntelligence Unit2(OLD Part1)
90 pages
1-AI Search
No ratings yet
1-AI Search
42 pages
Artificial Intelligence Unit IV
No ratings yet
Artificial Intelligence Unit IV
105 pages
Lecture 2 Problem Solving As Search, State Space Search
No ratings yet
Lecture 2 Problem Solving As Search, State Space Search
53 pages
3_AI_Problem_Solving (6)
No ratings yet
3_AI_Problem_Solving (6)
18 pages
Lesson 3 - Part 1 Uninformed - Searching Algorithms
No ratings yet
Lesson 3 - Part 1 Uninformed - Searching Algorithms
56 pages
Unit 2
No ratings yet
Unit 2
33 pages
Chapter 3
No ratings yet
Chapter 3
142 pages
CS 4700: Foundations of Artificial Intelligence: Bart Selman Problem Solving by Search R&N: Chapter 3
No ratings yet
CS 4700: Foundations of Artificial Intelligence: Bart Selman Problem Solving by Search R&N: Chapter 3
69 pages
3-AI Planning
No ratings yet
3-AI Planning
28 pages
3.0 Search
No ratings yet
3.0 Search
95 pages
CPS 270: Artificial Intelligence Planning: Vincent Conitzer
No ratings yet
CPS 270: Artificial Intelligence Planning: Vincent Conitzer
28 pages
11 Classical Planning
No ratings yet
11 Classical Planning
29 pages
Chapter 4
No ratings yet
Chapter 4
34 pages
Planning: Chapter 11.1-11.3
No ratings yet
Planning: Chapter 11.1-11.3
49 pages
l3 Uninformed Search
No ratings yet
l3 Uninformed Search
87 pages
Three Day Online Class
No ratings yet
Three Day Online Class
32 pages
03_search
No ratings yet
03_search
85 pages
Crib Sheet
No ratings yet
Crib Sheet
2 pages
Chapter 3 Solving Problems by Searching1
No ratings yet
Chapter 3 Solving Problems by Searching1
96 pages
Yapay Zeka - 3
No ratings yet
Yapay Zeka - 3
30 pages
AI_Module 3
No ratings yet
AI_Module 3
99 pages
02 Solving Problems by Searching (Us)
No ratings yet
02 Solving Problems by Searching (Us)
48 pages
Great Group Leaders Ignited!: 74 Ways to Put Purpose, Power, & Plans Into Action
From Everand
Great Group Leaders Ignited!: 74 Ways to Put Purpose, Power, & Plans Into Action
Susan Ragsdale
No ratings yet
Situation Calculus: Fundamentals and Applications
From Everand
Situation Calculus: Fundamentals and Applications
Fouad Sabry
No ratings yet
How To Be A Productivity Ninja - With Pocket Informant
From Everand
How To Be A Productivity Ninja - With Pocket Informant
Matthew Brown
No ratings yet
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
From Everand
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
Fouad Sabry
No ratings yet
02 Agents
No ratings yet
02 Agents
38 pages
22 Reinforcement Learning
No ratings yet
22 Reinforcement Learning
18 pages
05 Games
No ratings yet
05 Games
42 pages
06 CSP
No ratings yet
06 CSP
20 pages
Tautology
No ratings yet
Tautology
4 pages
DS - Chapter # 1
No ratings yet
DS - Chapter # 1
37 pages
Indian Institute of Information Technology Design and Manufacturing (Iiitd&M) Kancheepuram
No ratings yet
Indian Institute of Information Technology Design and Manufacturing (Iiitd&M) Kancheepuram
13 pages
Pà Áðlpà Á Á Àjãpéë Àävàäû Àië®å Tðaiàä Àäaqà°
No ratings yet
Pà Áðlpà Á Á Àjãpéë Àävàäû Àië®å Tðaiàä Àäaqà°
12 pages
Electric Charges & Fields : DPP 04 (of Lec 10) || Prayas JEE 2025
No ratings yet
Electric Charges & Fields : DPP 04 (of Lec 10) || Prayas JEE 2025
3 pages
KNP3053 Manusys
No ratings yet
KNP3053 Manusys
5 pages
An LLL Algorithm With Quadratic Complexity: Abstract. The Lenstra-Lenstra-Lov
No ratings yet
An LLL Algorithm With Quadratic Complexity: Abstract. The Lenstra-Lenstra-Lov
30 pages
(Ebook) A Student's Guide to Entropy by Don S. Lemons ISBN 9781107470040, 1107470048 - Quickly download the ebook to read anytime, anywhere
100% (2)
(Ebook) A Student's Guide to Entropy by Don S. Lemons ISBN 9781107470040, 1107470048 - Quickly download the ebook to read anytime, anywhere
61 pages
ECON022 BAP With Major
No ratings yet
ECON022 BAP With Major
3 pages
Theory of Automata MCQ'S: Prof Ali Raza Mushtaq
No ratings yet
Theory of Automata MCQ'S: Prof Ali Raza Mushtaq
16 pages
Micro Barlo
No ratings yet
Micro Barlo
141 pages
Modern Control Lecture Plan
No ratings yet
Modern Control Lecture Plan
2 pages
Properties of Gases
No ratings yet
Properties of Gases
16 pages
Practice 5 (Solution) : M T F KX DT DX DT X D M T X DT X D
No ratings yet
Practice 5 (Solution) : M T F KX DT DX DT X D M T X DT X D
2 pages
Formative Test Assgn
No ratings yet
Formative Test Assgn
7 pages
03 Analemma Ptolemy
No ratings yet
03 Analemma Ptolemy
34 pages
Physics Grade 7 All Topics Explained
No ratings yet
Physics Grade 7 All Topics Explained
6 pages
82DS 82000MB
No ratings yet
82DS 82000MB
83 pages
Circular Rhino Code
No ratings yet
Circular Rhino Code
6 pages
The Scrodinger Wave Equation
No ratings yet
The Scrodinger Wave Equation
19 pages
Can We Imagine Life Without Mathematics
100% (1)
Can We Imagine Life Without Mathematics
5 pages
LS50 Section03 Handout
No ratings yet
LS50 Section03 Handout
10 pages
RACHO - Discussion and Conclusion
56% (9)
RACHO - Discussion and Conclusion
3 pages
Department of Education: General Physics 1
No ratings yet
Department of Education: General Physics 1
5 pages
Malaysian Online Journal of Educational Management: (Mojem)
No ratings yet
Malaysian Online Journal of Educational Management: (Mojem)
24 pages
Untitled
100% (1)
Untitled
505 pages
Design of Gears
No ratings yet
Design of Gears
57 pages
1D Finite Difference Method
No ratings yet
1D Finite Difference Method
6 pages
Orthogonal Signal Correction of Near-Infrared Spectra: Svante Wold, Henrik Antti, Fredrik Lindgren, Jerker Ohman
No ratings yet
Orthogonal Signal Correction of Near-Infrared Spectra: Svante Wold, Henrik Antti, Fredrik Lindgren, Jerker Ohman
11 pages
Unit 1 Chapter 2 Lecture 1
No ratings yet
Unit 1 Chapter 2 Lecture 1
31 pages

04 Search With Uncertainty

Uploaded by

04 Search With Uncertainty

Uploaded by

CS 5/7320

Slides by Michael Hahsler

This work is licensed under a Creative Commons

* we will quantify uncertainty with

• Deterministic Initial state

conditional plan (also called strategy or policy)

that depends on percepts.

i.e., action 𝑎 in 𝑠1 can lead to one of several states.

a) On a dirty square: cleans the square and

We need a conditional plan

path is only maintained for cycle checking!

// don’t follow loops using path.

// fail means we found no action that leads to

// try all possible outcomes, none can fail!

Example: After the initial action “suck”

Agent Step Program

Why is this useful?

• Example: Doctor prescribes a broad-band antibiotic

• Basic idea: Find a solution (a sequence of actions) that

Formulate as a regular search and solve with DFS, BFS or A*:

3. What type of agent can do this?

Percept function: 𝑃𝑒𝑟𝑐𝑒𝑝𝑡 𝑠

Agents choose an action and then receive an observation.

Both steps in one: 𝑏 ← 𝑈𝑝𝑑𝑎𝑡𝑒 𝑃𝑟𝑒𝑑𝑖𝑐𝑡(𝑏, 𝑎), 𝑜

Predict for Update with

Solution: [Suck, Right, if b = {6} then Suck else []]

[L,Clean] [R,Dirty] [R,Clean]

Solution: [Suck, Right, if b = {6} then Suck else []]

[L,Clean] [R,Dirty] [R,Clean]

[L,Clean] [R,Dirty] [R,Clean]

b = {6} is the result of the

• This process is often called

2. The problem can be solved using an AND-

a. What type of agent do we use?

b. What algorithms can be used?

Act Observe Act Observe Act …

• Safely explorable state space/world: There are no

• Exploration order: Expanding nodes in local order is

Untried is the “frontier”

unbacktracked stores the current path

Record the found transition

Keep breadcrumbs to go back

• Difference between solution types:

You might also like