0% found this document useful (0 votes)

4 views

02_agents

The document discusses intelligent agents as entities that perceive their environment through sensors and act upon it via actuators, outlining the characteristics and types of agents. It introduces the PEAS framework (Performance measure, Environment, Actuators, Sensors) for defining agent behavior and rationality, emphasizing the importance of maximizing expected performance. Additionally, it categorizes agents into types such as simple reflex, model-based reflex, goal-based, and utility-based agents, and highlights the role of learning in improving agent performance.

Uploaded by

wwahib2

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

02_agents

Uploaded by

wwahib2

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

CS 5/7320

Artificial
Intelligence

Intelligent Agents
AIMA Chapter 2

Slides by Michael Hahsler

based on slides by Svetlana Lazepnik
with figures from the AIMA textbook.

This work is licensed under a Creative Commons Image: "Robot at the British Library Science Fiction Exhibition"
Attribution-ShareAlike 4.0 International License.
by BadgerGravling
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
What is an Agents?
• An agent is anything that can be viewed as perceiving its environment
through sensors and acting upon that environment through actuators.

• Control theory: A closed-loop control system (= feedback control system)

is a set of mechanical or electronic devices that automatically regulate a
process variable to a desired state or set point without human interaction.
The agent is called a controller.
• Softbot: Agent is a software program that runs on a host device.
Agent Function and Agent Program
The agent function maps from the set of all possible percept sequences 𝑃𝑃∗ to the
set of actions 𝐴𝐴 formulated as an abstract mathematical function.

𝒑𝒑

𝒂𝒂 = 𝒇𝒇(𝒑𝒑)
𝑓𝑓 ∶ 𝑃𝑃∗ → 𝐴𝐴
𝒂𝒂

The agent program is a concrete implementation of this function for a given

physical system.

Agent = architecture (hardware) + agent program (implementation of 𝑓𝑓)

• Sensors
• Memory
• Computational power
Example:
Vacuum-cleaner World
• Percepts:
Location and status,
e.g., [A, Dirty]
• Actions:
Most recent
Left, Right, Suck, NoOp
Percept 𝑝𝑝

Agent function: 𝑓𝑓 ∶ 𝑃𝑃∗ → 𝐴𝐴 Implemented agent program:

Percept Sequence Action function Vacuum-Agent([location, status])

[A, Clean] Right returns an action 𝑎𝑎
[A, Dirty] Suck
… if status = Dirty then return Suck
[A, Clean], [B, Clean] Left else if location = A then return Right
else if location = B then return Left
…
[A, Clean], [B, Clean], [A, Dirty] Suck
…

Problem: This table can become infinitively large!

Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Rational Agents: What is Good Behavior?
Foundation
• Consequentialism: Evaluate behavior by its consequences.
• Utilitarianism: Maximize happiness and well-being.

Definition of a rational agent:

“For each possible percept sequence, a rational agent should select an action that
maximizes its expected performance measure, given the evidence provided by the
percept sequence and the agent’s built-in knowledge.”

• Performance measure: An objective criterion for success of an agent's behavior (often

called utility function or reward function).
• Expectation: Outcome averaged over all possible situations that may arise.

This means:
• Rationality is only an ideal
• Rationality ≠ Omniscience (rational agents can make mistakes if percepts and
knowledge do not suffice to make a good decision)
• Rationality ≠ Perfection (rational agents maximize expected outcomes not actual
outcomes)
• It is rational to explore and learn (i.e., use percepts to supplement prior knowledge
and become autonomous)
Example:
Vacuum-cleaner World
• Percepts:
Location and status,
e.g., [A, Dirty]
• Actions:
Left, Right, Suck, NoOp
Agent function: Implemented agent program:

Percept Sequence Action function Vacuum-Agent([location, status])

[A, Clean] Right returns an action
[A, Dirty] Suck
… if status = Dirty then return Suck
[A, Clean], [B, Clean] Left else if location = A then return Right
else if location = B then return Left
…

What could be a performance measure?

Is this agent program rational?
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Problem Specification: PEAS Performance
measure

Performance
Environment Actuators Sensors
measure

Components and Defines

Defines utility Defines
rules of how actions percepts
and what is available
affect the
rational actions
environment.
Example: Automated Taxi Driver

Performance
Environment Actuators Sensors
measure
• Safe • Roads • Steering • Cameras
• fast • other traffic wheel • sonar
• legal • pedestrians • accelerator • speedometer
• comfortable • customers • brake • GPS
trip • signal • Odometer
• maximize • horn • engine
profits sensors
• keyboard
Example: Spam Filter

Performance
Environment Actuators Sensors
measure
• Accuracy: • A user’s email • Mark as spam • Incoming
Minimizing account • delete messages
false • email server • etc. • other
positives, information
false about user’s
negatives account
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Environment Types
Fully observable: The agent's sensors
give it access to the complete state of Partially observable: The agent cannot see all
the environment. The agent can “see” vs. aspects of the state. E.g., it can’t see through
the whole environment. walls

Stochastic: The next state cannot be determined

from the current state and the action (there is
Deterministic: The next state of the
some randomness).
environment is completely determined by vs.
Strategic: The environment is stochastic and
the current state and the agent’s action.
adversarial. It chooses actions strategically to
harm the agent. E.g., a game where the other
player is modeled as part of the environment.

Known: The agent knows the rules of the

Unknown: The agent cannot predict the outcome
environment and can predict the vs. of actions. It needs to learn the transition
outcome of actions. It knows the
function by trying actions.
transition function.
Environment Types
Static: The environment is not changing
Dynamic: The environment is changing while
while agent is deliberating. vs. the agent is deliberating.
Semidynamic: the environment is static,
but the agent's performance score
depends on how fast it acts.

Discrete: The environment provides a fixed Continuous: Percepts, actions, state variables or
number of distinct percepts, actions, and vs. time are continuous leading to an infinite state,
environment states. Time can also evolve in a percept or action space.
discrete or continuous fashion.

Episodic: Episode = a self-contained

Sequential: Actions now affect the outcomes
sequence of actions. The agent's choice of vs. later. E.g., learning makes problems
action in one episode does not affect the
sequential.
next episodes. The agent does the same task
repeatedly.

Single agent: An agent operating by itself in vs. Multi-agent: Agent cooperate or compete in the
an environment. same environment.
Examples of Different Environments

Word jumble Chess with Scrabble Taxi driving

solver a clock

Observable Fully Fully Partially Partially

Deterministic Stochastic
Deterministic Strategic Stochastic
+Strategic
Episodic? Episodic Episodic Episodic Sequential

Static Static Semidynamic Static Dynamic

Discrete Discrete Discrete Discrete Continuous

Single agent Single Multi* Multi* Multi*

* Can be models as a single agent problem with the other agent(s) in the environment.
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Designing a Rational Agent
Remember the definition of a
rational agent:
𝑓𝑓 “For each possible percept sequence, a
action rational agent should select an action
that maximizes its expected
performance measure, given the
evidence provided by the percept
sequence and the agent’s built-in
knowledge.”

Percept to the
Agent Function agent function
• Assess Note: Everything
𝑓𝑓 outside the agent
performance function can be
measure seen as the
• Remember environment.
percept sequence Action from the
• Built-in knowledge agent function
Hierarchy of Agent Types

Utility-based agents

Goal-based agents

Model-based reflex agents

Simple reflex agents

Simple Reflex Agent
• Uses only built-in knowledge in the form of rules that select action only based
on the current percept. This is typically very fast!
• The agent does not know about the performance measure! But well-designed
rules can lead to good performance.
• The agent needs no memory and ignores all past percepts.

𝑎𝑎 = 𝑓𝑓(𝑝𝑝)

The interaction is a sequence: 𝑝𝑝0 , 𝑎𝑎0 , 𝑝𝑝1 , 𝑎𝑎1 , 𝑝𝑝2 , 𝑎𝑎2 , … 𝑝𝑝𝑡𝑡 , 𝑎𝑎𝑡𝑡 , …

Example: A simple vacuum cleaner that uses rules based on its current sensor input.
Model-based Reflex Agent
• Maintains a state variable to keeps track of aspects of the environment that
cannot be currently observed. I.e., it has memory and knows how the
environment reacts to actions.
• The state is updated using the percept.
• There is now more information for the rules to make better decisions.

𝑎𝑎 = 𝑓𝑓(𝑝𝑝, 𝑠𝑠)

The interaction is a sequence: 𝑠𝑠0 , 𝑎𝑎0 , 𝑝𝑝1 , 𝑠𝑠1 , 𝑎𝑎1 , 𝑝𝑝2 , 𝑠𝑠2 , 𝑎𝑎2 , 𝑝𝑝3 , … , 𝑝𝑝𝑡𝑡 , 𝑠𝑠𝑡𝑡 , 𝑎𝑎𝑡𝑡 , …

Example: A vacuum cleaner that remembers were it has already cleaned.

State Representation
States help to keep track of the environment and the agent in the environment. This is
often also called the system state. The representation can be
• Atomic: Just a label for a black box. E.g., A, B
• Factored: A set of attribute values called fluents.
E.g., [location = left, status = clean, temperature = 75 deg. F]

Action causes Variables describing the

transition system state are called
“fluents”

We often construct atomic labels from factored information. E.g.: If the agent’s state is
the coordinate x = 7 and y = 3, then the atomic state label could be the string “(7, 3)”.
With the atomic representation, we can only compare if two labels are the same. With
the factored state representation, we can reason more and calculate the distance
between states!

State Space: The set of all possible states 𝑆𝑆. This set is typically very large!
Old-school vs. Smart Thermostat

Old-school thermostat Smart thermostat

Percepts States Percepts States
Old-school vs. Smart Thermostat
Change
Set temperatur
temperature e when you
range are too
cold/warm.

Old-school thermostat Smart thermostat

Percepts Percepts States
States
• Temp: deg. F Factored states
• Outside temp. • Estimated
• Weather report time to cool
temperature: • Energy the house
Low, ok, high No states need curtailment • Someone
• Someone walking home?
by • How long till
• Someone changes someone is
temp. coming
• Day & time home?
• … • A/C: on, off
Goal-based Agent
• The agent has the task to reach a defined goal state and is then finished.
• The agent needs to move towards the goal. It can use search algorithms to
plan actions that lead to the goal.
• Performance measure: the cost to reach the goal.

𝑇𝑇

𝑎𝑎 = argmin𝑎𝑎0∈A � 𝑐𝑐𝑡𝑡 � 𝑠𝑠𝑇𝑇 ∈ 𝑆𝑆 𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔

𝑡𝑡=0

Sum of the cost

of a planed sequence of
actions that leads to a
goal state

The interaction is a sequence: 𝑠𝑠0 , 𝑎𝑎0 , 𝑝𝑝1 , 𝑠𝑠1 , 𝑎𝑎1 , 𝑝𝑝2 , 𝑠𝑠2 , 𝑎𝑎2 , … , 𝑠𝑠 𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔
cost
Example: Solving a puzzle. What action gets me closer to the solution?
Utility-based Agent
• The agent uses a utility function to evaluate the desirability of each possible
states. This is typically expressed as the reward of being in a state 𝑅𝑅(𝑠𝑠).
• Choose actions to stay in desirable states.
• Performance measure: The discounted sum of expected utility over time.
∞

𝑎𝑎 = arg𝑚𝑚𝑚𝑚𝑚𝑚𝑎𝑎0∈A 𝔼𝔼 � 𝛾𝛾 𝑡𝑡 𝑟𝑟𝑡𝑡
𝑡𝑡=0

Expected future
discounted reward

Techniques: Markov decision

processes, reinforcement learning

The interaction is a sequence: 𝑠𝑠0 , 𝑎𝑎0 , 𝑝𝑝1 , 𝑠𝑠1 , 𝑎𝑎1 . 𝑝𝑝2 , 𝑠𝑠2 , 𝑎𝑎2 , …
reward
Example: An autonomous Mars rover prefers states where its battery is not critically low.
Agents that Learn

The learning element modifies the agent program (reflex-based, goal-

based, or utility-based) to improve its performance.

How is the agent

currently performing?

Update the agent

Agent
program program

Exploration
Example: Smart Thermostat
Change
temperature
when you are
too
cold/warm.

Smart thermostat
Percepts States
• Temp: deg. F Factored states
• Outside temp. • Estimated
• Weather report time to cool
• Energy the house
curtailment • Someone
• Someone walking home?
by • How long till
• Someone changes someone is
temp. coming
• Day & time home?
• … • A/C: on, off
Example: Modern Vacuum Robot
Features are:
• Control via App
• Cleaning Modes
• Navigation
• Mapping
• Boundary blockers

Source: https://www.techhive.com/article/3269782/best-robot-
vacuum-cleaners.html
PEAS Description of a
Modern Robot Vacuum

Performance
Environment Actuators Sensors
measure
What Type of Intelligent Agent is a
Modern Robot Vacuum?

Does it collect utility over

Utility-based agents time? How would the utility for
each state be defined?
Is it learning?

Goal-based agents Does it have a goal state?

Does it store state information.

Model-based reflex agents How would they be defined
(atomic/factored)?

Does it use simple rules based

Simple reflex agents on the current percepts?

Check what applies

What Type of Intelligent
Agent is this?
PEAS Description of ChatGPT

Performance
Environment Actuators Sensors
measure
How does ChatGPT work?
What Type of Intelligent Agent is
ChatGPT?
Does it collect utility over
Utility-based agents time? How would the utility for
each state be defined?
Is it learning?

Goal-based agents Does it have a goal state?

Does it store state information.

Model-based reflex agents How would they be defined
(atomic/factored)?

Does it use simple rules based

Simple reflex agents
on the current percepts?

Answer the following questions:

Check what applies • Does ChatGPT pass the Touring test?
• Is ChatGPT a rational agent? Why?

We will talk about knowledge-based agents later.

Intelligent Systems as
Sets of Agents:
Self-driving Car

Make sure the passenger has a pleasant drive High-level

Utility-based agents (not too much sudden breaking = utility) planning
It should learn!

Goal-based agents Plan the route to the destination.

Remember where every other car is and

Model-based reflex agents calculate where they will be in the next few
seconds.
React to unforeseen issues like a child Low-level
Simple reflex agents running in front of the car quickly.
planning
Conclusion

Intelligent agents inspire the research areas of modern AI

Stay within given

Search for a goal Optimize functions constraints
(e.g., navigation). (e.g., utility). (constraint satisfaction problem;
e.g., reach the goal without
running out of power)

Learn a good agent

Deal with uncertainty program from data Sensing
(e.g., current traffic on the (e.g, natural language
road). and improve over time processing, vision)
(machine learning).

Aptitude Maths Oda
No ratings yet
Aptitude Maths Oda
100 pages
Lokfix E75 TDS 8072020
No ratings yet
Lokfix E75 TDS 8072020
5 pages
Perkins - Coleridge's Ideal Realism - An Alternative To The Doctors of The Absolute (1995)
No ratings yet
Perkins - Coleridge's Ideal Realism - An Alternative To The Doctors of The Absolute (1995)
16 pages
Prdaan List Electrical
No ratings yet
Prdaan List Electrical
54 pages
02 Agents
No ratings yet
02 Agents
38 pages
02_agents
No ratings yet
02_agents
36 pages
02_agents
No ratings yet
02_agents
41 pages
02_agents
No ratings yet
02_agents
68 pages
Lecture 6 AI
No ratings yet
Lecture 6 AI
19 pages
Lec02 Agents
No ratings yet
Lec02 Agents
49 pages
Agents 2pp
No ratings yet
Agents 2pp
15 pages
2.AIlec 3-4
No ratings yet
2.AIlec 3-4
38 pages
Chapter 2 AI
No ratings yet
Chapter 2 AI
33 pages
Chapter 2
No ratings yet
Chapter 2
28 pages
CH 2 Intelligent Agents
No ratings yet
CH 2 Intelligent Agents
29 pages
Intelligent Agents: Lecturer: DR - Nguyen Thanh Binh
No ratings yet
Intelligent Agents: Lecturer: DR - Nguyen Thanh Binh
39 pages
CSC 325 AI Lecture02 Problem Solving Fall2024 22092024 113649am
No ratings yet
CSC 325 AI Lecture02 Problem Solving Fall2024 22092024 113649am
72 pages
2
No ratings yet
2
57 pages
Chapter 2 Type of Agentsfinal Edition
No ratings yet
Chapter 2 Type of Agentsfinal Edition
57 pages
MODULE 1 chapter02
No ratings yet
MODULE 1 chapter02
59 pages
Ch 1 2 Agents Norvig Book
No ratings yet
Ch 1 2 Agents Norvig Book
50 pages
Week 2 Intelligent Agents UPDATED
No ratings yet
Week 2 Intelligent Agents UPDATED
55 pages
Lecture02 Intelligent Agents
No ratings yet
Lecture02 Intelligent Agents
45 pages
Is-Lecture 2 (Intelligent Agents)
No ratings yet
Is-Lecture 2 (Intelligent Agents)
51 pages
Intelliget Agent 2 How To Design
No ratings yet
Intelliget Agent 2 How To Design
24 pages
AI-Intelligent Agents: Dr. Azhar Mahmood
No ratings yet
AI-Intelligent Agents: Dr. Azhar Mahmood
45 pages
Cse473au22 Agents
No ratings yet
Cse473au22 Agents
20 pages
2 Chapter - 2 - AI Intelligent Agents
No ratings yet
2 Chapter - 2 - AI Intelligent Agents
44 pages
4 Unit Merged
No ratings yet
4 Unit Merged
121 pages
AI QB Answer Key
No ratings yet
AI QB Answer Key
22 pages
Session 2
No ratings yet
Session 2
14 pages
COMP6275 - Artificial Intelligence: Week 1
No ratings yet
COMP6275 - Artificial Intelligence: Week 1
30 pages
Chapter-2 (1)
No ratings yet
Chapter-2 (1)
41 pages
Unit 1 (Intelligent Agents)
No ratings yet
Unit 1 (Intelligent Agents)
43 pages
Lecture 002
No ratings yet
Lecture 002
30 pages
Artificial-Intelligence Chapter2
No ratings yet
Artificial-Intelligence Chapter2
26 pages
2. Chapter2 Agent
No ratings yet
2. Chapter2 Agent
31 pages
Intelligent Agents - Chapter 2
No ratings yet
Intelligent Agents - Chapter 2
64 pages
m2 Agentscomplete
No ratings yet
m2 Agentscomplete
45 pages
AI Lec2-1
No ratings yet
AI Lec2-1
24 pages
m2 Agents
No ratings yet
m2 Agents
42 pages
Lecture03-AI-UMT-Spring 24
No ratings yet
Lecture03-AI-UMT-Spring 24
21 pages
Lec03 Agents-1689184299238
No ratings yet
Lec03 Agents-1689184299238
18 pages
2 - Intelligent Agents
No ratings yet
2 - Intelligent Agents
33 pages
Agents and Environments
No ratings yet
Agents and Environments
20 pages
Intelligent Agents
No ratings yet
Intelligent Agents
62 pages
CoSc4142 AI Chapter 2 (1)
No ratings yet
CoSc4142 AI Chapter 2 (1)
33 pages
Module 1 Chapter 2
No ratings yet
Module 1 Chapter 2
60 pages
Agents and environments (1) cour 4
No ratings yet
Agents and environments (1) cour 4
35 pages
2023 Slide2 Agents Eng
No ratings yet
2023 Slide2 Agents Eng
35 pages
Lect02 Agent
No ratings yet
Lect02 Agent
44 pages
Artificial Intelligence Slide 2
No ratings yet
Artificial Intelligence Slide 2
38 pages
AI Lecture2 - 3
No ratings yet
AI Lecture2 - 3
43 pages
Artificial Intelligence Chapter2
No ratings yet
Artificial Intelligence Chapter2
26 pages
AI Lecture2 - 30-10-21
No ratings yet
AI Lecture2 - 30-10-21
58 pages
Chapter2 IntelligentAgents
No ratings yet
Chapter2 IntelligentAgents
53 pages
2024 Lecture02 IntelligentAgents
No ratings yet
2024 Lecture02 IntelligentAgents
62 pages
Lecture 03
No ratings yet
Lecture 03
13 pages
AI_CE415_Lecture02-2025
No ratings yet
AI_CE415_Lecture02-2025
47 pages
AI_Module 2
No ratings yet
AI_Module 2
25 pages
Chapter 2 Intelligent Agents
No ratings yet
Chapter 2 Intelligent Agents
59 pages
Lesson 10001
No ratings yet
Lesson 10001
29 pages
Intelligent Agents
No ratings yet
Intelligent Agents
36 pages
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
Iec 61724-1 2017 Version-Selection of Pyranometers v2008
100% (1)
Iec 61724-1 2017 Version-Selection of Pyranometers v2008
4 pages
Photovoltaic System Performance - Part 1: Monitoring: Irish Standard I.S. EN 61724-1:2017
No ratings yet
Photovoltaic System Performance - Part 1: Monitoring: Irish Standard I.S. EN 61724-1:2017
20 pages
5peb1001 6656xb 213231 C
No ratings yet
5peb1001 6656xb 213231 C
1 page
Acet 22510 Door Entry Handset Data Sheet
No ratings yet
Acet 22510 Door Entry Handset Data Sheet
2 pages
Digital Control Strategy For Harmonic Compensation of Dynamic Voltage Restorer
No ratings yet
Digital Control Strategy For Harmonic Compensation of Dynamic Voltage Restorer
9 pages
15-Servo Drive System With Simulink and Simplorer PDF
No ratings yet
15-Servo Drive System With Simulink and Simplorer PDF
13 pages
Thesis Report OmarJimenez CE EWI
No ratings yet
Thesis Report OmarJimenez CE EWI
138 pages
06 - MVD - FIR - Design PDF
No ratings yet
06 - MVD - FIR - Design PDF
29 pages
Creating Axi-Lite Custom IP' in Vivado: Lab For COMP4601
100% (1)
Creating Axi-Lite Custom IP' in Vivado: Lab For COMP4601
33 pages
1727.UCD3138 Control Theory
100% (1)
1727.UCD3138 Control Theory
49 pages
Ijest10 02 12 191
No ratings yet
Ijest10 02 12 191
4 pages
Propped Cantilever Beam Tutorial Abaqus Final
0% (1)
Propped Cantilever Beam Tutorial Abaqus Final
36 pages
әдістеме 1 модуль
No ratings yet
әдістеме 1 модуль
10 pages
Name Contact
No ratings yet
Name Contact
8 pages
111 Grounding-Transformers-vs.-Ground-Switches-Poster - SargentLundy
No ratings yet
111 Grounding-Transformers-vs.-Ground-Switches-Poster - SargentLundy
1 page
SOA - Salary Loan
No ratings yet
SOA - Salary Loan
1 page
Bom
No ratings yet
Bom
3 pages
الممارسة الرياضية لدى الإناث في المؤسسات التربوية دراسة ميدانية للمتوسطات التابعة لولاية الجزائر شرق
No ratings yet
الممارسة الرياضية لدى الإناث في المؤسسات التربوية دراسة ميدانية للمتوسطات التابعة لولاية الجزائر شرق
16 pages
Case 13 Jordan Telecom Group
100% (2)
Case 13 Jordan Telecom Group
19 pages
Mvaj101 - Trip Relay
100% (1)
Mvaj101 - Trip Relay
2 pages
Lesson Plan 5 - Science Plants Needs
No ratings yet
Lesson Plan 5 - Science Plants Needs
6 pages
Income Tax Department Depreciation Rate
No ratings yet
Income Tax Department Depreciation Rate
16 pages
[MM] L1 _ Introduction to Marketing Management
No ratings yet
[MM] L1 _ Introduction to Marketing Management
12 pages
CRJ-00 and 00-Fire Protection
No ratings yet
CRJ-00 and 00-Fire Protection
30 pages
Acoustic Levitator
No ratings yet
Acoustic Levitator
26 pages
Board Resolution Adopting An Environmental Policy
No ratings yet
Board Resolution Adopting An Environmental Policy
2 pages
Account Statement PDF
No ratings yet
Account Statement PDF
12 pages
Advance Java Programming Lab PDF
No ratings yet
Advance Java Programming Lab PDF
22 pages
Core Java Bank Management Project
No ratings yet
Core Java Bank Management Project
15 pages
Types of Sections
No ratings yet
Types of Sections
15 pages
278 279 280 W1 Timetable 11OCT09 V2
No ratings yet
278 279 280 W1 Timetable 11OCT09 V2
2 pages
Geotechnical Engineering C2
No ratings yet
Geotechnical Engineering C2
8 pages
Taduran, BEED2A LP in Science
No ratings yet
Taduran, BEED2A LP in Science
7 pages
Draft MASS Code 5 June 2023
No ratings yet
Draft MASS Code 5 June 2023
61 pages
PM 251 (Local Government and Regional Administration)
No ratings yet
PM 251 (Local Government and Regional Administration)
12 pages
Change Management Policy
No ratings yet
Change Management Policy
7 pages
4. MINOR LOSSS
No ratings yet
4. MINOR LOSSS
5 pages

02_agents

Uploaded by

02_agents

Uploaded by

CS 5/7320

Slides by Michael Hahsler

• Control theory: A closed-loop control system (= feedback control system)

The agent program is a concrete implementation of this function for a given

Agent = architecture (hardware) + agent program (implementation of 𝑓𝑓)

Agent function: 𝑓𝑓 ∶ 𝑃𝑃∗ → 𝐴𝐴 Implemented agent program:

Percept Sequence Action function Vacuum-Agent([location, status])

Problem: This table can become infinitively large!

Definition of a rational agent:

• Performance measure: An objective criterion for success of an agent's behavior (often

Percept Sequence Action function Vacuum-Agent([location, status])

What could be a performance measure?

Components and Defines

Stochastic: The next state cannot be determined

Known: The agent knows the rules of the

Episodic: Episode = a self-contained

Word jumble Chess with Scrabble Taxi driving

Observable Fully Fully Partially Partially

Static Static Semidynamic Static Dynamic

Discrete Discrete Discrete Discrete Continuous

Single agent Single Multi* Multi* Multi*

Model-based reflex agents

Simple reflex agents

Example: A vacuum cleaner that remembers were it has already cleaned.

Action causes Variables describing the

Old-school thermostat Smart thermostat

Old-school thermostat Smart thermostat

𝑎𝑎 = argmin𝑎𝑎0∈A � 𝑐𝑐𝑡𝑡 � 𝑠𝑠𝑇𝑇 ∈ 𝑆𝑆 𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔

Sum of the cost

Techniques: Markov decision

The learning element modifies the agent program (reflex-based, goal-

How is the agent

Update the agent

Does it collect utility over

Goal-based agents Does it have a goal state?

Does it store state information.

Does it use simple rules based

Check what applies

Goal-based agents Does it have a goal state?

Does it store state information.

Does it use simple rules based

Answer the following questions:

We will talk about knowledge-based agents later.

Make sure the passenger has a pleasant drive High-level

Goal-based agents Plan the route to the destination.

Remember where every other car is and

Intelligent agents inspire the research areas of modern AI

Stay within given

Learn a good agent

You might also like