A modern approach to AI AI_02_agents_Strut.pptx

Artificial
Intelligence
Intelligent Agents
AIMA Chapter 2
Image: "Robot at the British Library Science Fiction Exhibition"
by BadgerGravling

Outline
What is an
intelligent
agent?
Rationality
PEAS
(Performance
measure,
Environment,
Actuators,
Sensors)
Environment
types
Agent types

What is an Agents?
• An agent is anything that can be viewed as perceiving its environment
through sensors and acting upon that environment through
actuators.
• Control theory: A closed-loop control system (= feedback control system)
is a set of mechanical or electronic devices that automatically regulate a
process variable to a desired state or set point without human interaction.
The agent is called a controller.
• Softbot: Agent is a software program that runs on a host device.

Agent Function and Agent Program
𝑓 ∶ 𝑃∗
→ 𝐴
• Sensors
• Memory
• Computational power
𝑎 =
𝑓(𝑝)
𝑎
The agent program is a concrete implementation of this function for a given
physical system.
Agent = architecture (hardware) + agent program (implementation of 𝑓)
The agent function maps from the set of all possible percept sequences 𝑃∗ to
the
set of actions 𝐴 formulated as an abstract mathematical function.
𝑝

Agents
Definition of Agent: Agent is anything that can be viewed as perceiving the
environment through sensors and acting upon the environment through
actuators.
- The human agent has
- Eyes, ears, and other organs for sensors
- Hands, legs , vocal tract for actuators
- A Robotic agent has
- cameras and infrared range finders for sensors
- Variou motors for actuators
- A software agent
- receives keystrokes, file contents and network packets as
sensory inputs
- Acts on the environment by displaying on the screen, writing files and
sending network packets
Percept vs Percept Sequence:- Percent is the agent’s perceptual input at
any given instant and percept sequence is the complete history of
everything the agent has ever perceived

Agent Function Vs Agent Program
- The Vacuum cleaner world
- Has jus two locations : Square A and B
- The Vacuum Agent
- Perceives which square it is in and whether there is dirt in the square
- Can choose to move left, move right, such up the dirt or do nothing
- Agent Function
- If the current square is dirty, then suck; otherwise move to the other
square

Example:
Vacuum-cleaner World
• Percepts:
Location and status,
e.g., [A, Dirty]
• Actions:
Left, Right, Suck,
NoOp
Implemented agent program:
function Vacuum-Agent([location, status])
returns an action 𝑎
if status = Dirty then return Suck else
if location = A then return Right else
if location = B then return Left
Agent function: 𝑓 ∶ 𝑃∗ →
𝐴
Percept Sequence Action
Right
Suck
Left
[A, Clean]
[A, Dirty]
…
[A, Clean], [B, Clean]
…
Most recent
Percept 𝑝
[A, Clean], [B, Clean], [A, Dirty] Suck
…
Problem: This table can become infinitively large!

Agent Function vs Agent Program
-Agent function- describes agent’s behavior by mapping any given percept
sequence to an action.
- To describe any given agent , we have to tabulate the agent
function- and this will typically be a very large table(potentially infinitely large
table)
- We can , in principle , construct this table by trying out all possible percept
sequence and recording which action the agent does in response
- This table is external characterization of the agent
- Agent function abstract mathematical description
- Agent program- is an internal implementation of the agent function for an
artificial agent
- It is a concrete implementation running within sum physical system.

Good Behavior: The Concept of Rationality
We learnt the we should design agents that “ act rational”
 How do we define action rationally so we can write programs?
 Should we consider the environment where the agent will be deployed?

- A rational agent is one that does the right thing(not a definition of rational
agent)
- In other words, every entry in the table for the agent function is filled out
correctly
- What does it mean to do the right thing?
- We consider the consequences of the agent’s behavior
- When an agent is plunked down in an environment, it generates a
sequence of actions according to the percept it receives.
- The sequence of actions causes the environment to go through a sequence
of states
- If this sequence of environment is desirable , the agent has performed well

- We are interested in environment states not agent states
- We should not define success in terms of the agent’s opinion
- The agent could achieve a perfect rationality simply by deluding itself
its performance was perfect
- Human agents for example are notorious for “ sour grapes”- believing
that they did not really want something after not getting it.

Performance Measure of a Rational vacuum cleaner agent
- Amount of dirt cleaned up,
- Amount of time taken
- Amount of electricity consumed
- Amount of noise generated etc.

Rational Agents: What is Good Behavior?
Foundation
• Consequentialism: Evaluate behavior by its consequences.
• Utilitarianism: Maximize happiness and well-being.
Definition of a rational agent:
“For each possible percept sequence, a rational agent should select an
action that maximizes its expected performance measure, given the
evidence provided by the percept sequence and the agent’s built-in
knowledge.”
• Performance measure: An objective criterion for success of an agent's
behavior (often called utility function or reward function).
• Expectation: Outcome averaged over all possible situations that may
arise.
Rule: Pick the action that maximize the expected utility
𝑎 = argmax𝑎∈A 𝐸 𝑈 𝑎)

Rational Agents
This means:
• Rationality is an ideal – it implies that no one can build a better agent
• Rationality ≠ Omniscience – rational agents can make mistakes if
percepts and knowledge do not suffice to make a good decision
• Rationality ≠ Perfection – rational agents maximize expected outcomes not
actual outcomes
• It is rational to explore and learn – I.e., use percepts to supplement prior knowledge
and become autonomous
• Rationality is often bounded by available memory, computational power,
available sensors, etc.
Rule: Pick the action that maximize the expected utility
𝑎 = argmax𝑎∈A 𝐸 𝑈 𝑎)

Example:
Vacuum-cleaner World
• Percepts:
Location and status,
e.g., [A, Dirty]
• Actions:
Left, Right, Suck,
NoOp
Implemented agent program:
function Vacuum-Agent([location, status])
returns an action
if status = Dirty then return Suck else
if location = A then return Right else
if location = B then return Left
Agent function:
Percept Sequence Action
Right
Suck
Left
[A, Clean]
[A, Dirty]
…
[A, Clean], [B, Clean]
…
What could be a performance measure?
Is this agent program rational?

Problem Specification: PEAS Performance
measure
Performance
measure
Environment Actuators Sensors
Defines utility
and what is
rational
Components and
rules of how actions
affect the
environment.
Defines
available
actions
Defines
percepts

Example: Automated Taxi Driver
Performance
measure
• Safe
• fast
• legal
• comfortable
trip
• maximize
profits
Environment
• Roads
• other traffic
• pedestrians
• customers
Actuators
• Steering
wheel
• acceler
ator
• brake
• signal
• horn
Sensors
• Cameras
• sonar
• speedometer
• GPS
• Odometer
• engine
sensors
• keyboa
rd

Example: Spam Filter
Performance
measure
• Accuracy:
Minimizing
false
positives,
false
negatives
Environment
• A user’s email
account
• email server
Actuators
• Mark as spam
• delete
• etc.
Sensors
• Incoming
messages
• other
information
about
user’s
account

A modern approach to AI AI_02_agents_Strut.pptx

Types of Environment
1. Observability : Fully ( complete) , Partial , Hidden
2. Predictability: Deterministic , Strategic, Stochastic
3. Interaction: Episodic / Sequential
4. Agents: Single-Agent, Multi-agent(Competitive, Cooperative)
5. Time: Static/Dynamic
6. State:- Discrete, Continuous(Time , Percept, actions)

Environment Types
vs.
vs.
vs.
Fully observable: The agent's sensors
give it access to the complete state of
the environment. The agent can “see”
the whole environment.
Partially observable: The agent cannot see all
aspects of the environment. E.g., it can’t see
through walls
Deterministic: Changes in the environment
is completely determined by the current
state of the environment and the agent’s
action.
Stochastic: Changes cannot be determined from
the current state and the action (there is some
randomness).
Strategic: The environment is stochastic and
adversarial. It chooses actions strategically to
harm the agent. E.g., a game where the other
player is modeled as part of the environment.
Known: The agent knows the rules of the
environment and can predict the
outcome of actions.
Unknown: The agent cannot predict the outcome
of actions.

Environment Types
vs.
Continuous: Percepts, actions, state variables or
vs. time are continuous leading to an infinite state,
percept or action space.
vs.
vs.
Static: The environment is not changing
while agent is deliberating.
Semidynamic: the environment is static,
but the agent's performance score
depends on how fast it acts.
Dynamic: The environment is changing while
the agent is deliberating.
Discrete: The environment provides a fixed
number of distinct percepts, actions, and
environment states. Time can also evolve in a
discrete or continuous fashion.
Episodic: Episode = a self-contained
sequence of actions. The agent's choice of
action in one episode does not affect the
next episodes. The agent does the same task
repeatedly.
Single agent: An agent operating by itself in
an environment.
Sequential: Actions now affect the outcomes
later. E.g., learning makes problems
sequential.
Multi-agent: Agent cooperate or compete in the
same environment.

Examples of Different Environments
Observable
Deterministic
Episodic?
Static
Discrete
Single agent
Partially
Partially
Stochastic
+Strategi
c
Stochastic
Fully
Determ. game
Mechanics
+ Strategic*
Episodic
Semidynamic
Sequential
Dynamic
Episodic
Static
Discrete Discrete Continuous
Multi*
Episodic
Fully
Deterministic
Static
Discrete
Single Multi* Multi*
Chess with
a clock
Scrabble Taxi driving
Word jumble
solver
* Can be models as a single agent problem with the other agent(s) in the environment.

Designing a Rational Agent
Remember the definition of a
rational agent:
“For each possible percept sequence, a
rational agent should select an action
that maximizes its expected
performance measure, given the
evidence provided by the percept
sequence and the agent’s built-in
knowledge.”
Agent Function
• Assess
performance
measure
• Remember
percept sequence
• Built-in
knowledge
�
�
�
�
Percept to the
agent function
Action from the
agent function
Note: Everything
outside the agent
function can be
seen as the
environment.
action

Hierarchy of Agent Types
Utility-based agents
Goal-based agents
Model-based reflex agents
Simple reflex
agents
Learning Agents

Simple Reflex Agent
• Uses only built-in knowledge in the form of rules that select action only based
on the current percept. This is typically very fast!
• The agent does not know about the performance measure! But well-designed
rules can lead to good performance.
• The agent needs no memory and ignores all past percepts.
The interaction is a sequence: 𝑝0, 𝑎0, 𝑝1, 𝑎1, 𝑝2, 𝑎2, … 𝑝𝑡, 𝑎𝑡, …
Example: A simple vacuum cleaner that uses rules based on its current sensor
input.
Observe the world, choose an action, implement action, done
Problems if environment is not fully-observable
𝑎 =
𝑓(𝑝)

Model-based Reflex Agent
The interaction is a sequence: 𝑠0, 𝑎0, 𝑝1, 𝑠1, 𝑎1, 𝑝2, 𝑠2, 𝑎2, 𝑝3, … ,
𝑝𝑡, 𝑠𝑡, 𝑎𝑡, …
Example: A vacuum cleaner that remembers were it has already cleaned.
Information about how the world evolves independently of the agent
Information about how the agent’s own actions affect the world
• Maintains a state variable to keeps track of aspects of the environment that
cannot be currently observed. I.e., it has memory and knows how the
environment reacts to actions (called transition function).
• The state is updated using the percept.
• There is now more information for the rules to make better decisions.
𝑠
𝑠′ = 𝑇(𝑠, 𝑎)
𝑎 = 𝑓(𝑝, 𝑠)

Transition Function
• The environment is modeled as a discrete dynamical system.
• Changed in the environment are a sequence of states 𝑠0, 𝑠1, … 𝑠𝑇, where
the index is the time step.
• Example of a state diagram:
switch off
• States change because of:
a. System dynamics of the environment.
b. The actions of the agent.
• Both types of changes are represented by the transition function written as
𝑠𝑠 =
𝑇(𝑠, 𝑎)
𝑇: 𝑆 × 𝐴 →
𝑆
�
�
�
�
… set of states
… set of available actions
𝑎 ∈ 𝐴 … an action
𝑠 ∈ 𝑆 … current
state
𝑠′ ∈ 𝑆 … next state
or
Light is
off
Light is
on
switch on

State Representation
States help to keep track of the environment and the agent in the environment. This is
often also called the system state. The representation can be
• Atomic: Just a label for a black box. E.g., A, B
• Factored: A set of attribute values called fluents.
E.g., [location = left, status = clean, temperature = 75 deg. F]
We often construct atomic labels from factored information. E.g.: If the agent’s state is
the coordinate x = 7 and y = 3, then the atomic state label could be the string “(7, 3)”.
With the atomic representation, we can only compare if two labels are the same. With
the factored state representation, we can reason more and calculate the distance
between states!
The set of all possible states is called the state space 𝑆. This set is typically very large!
Action causes
transition
Variables describing the
system state are called
“fluents”

Old-school vs. Smart Thermostat
Smart thermostat
Percepts
Old-school thermostat
Percepts States
States

Old-school vs. Smart Thermostat
Smart thermostat
temperature:
Low, ok, high
Percepts
• Temp: deg. F
• Outside temp.
• Weather report
• Energy
curtailment
• Someone walking
by
• Someone changes
temp.
• Day & time
• …
Old-school thermostat
Percepts
States
No states need
States
Factored states
• Estimated
time to cool
the house
• Someone
home?
• How long till
someone is
coming
home?
• A/C: on, off
Set
temperature
range
Change
temperatur
e when you
are too
cold/warm.

Goal-based Agent
• The agent has the task to reach a defined goal state and is then finished.
• The agent needs to move towards the goal. It can use search algorithms to
plan actions that lead to the goal.
• Performance measure: the cost to reach the goal.
𝑎 =
argmin𝑎0∈A
𝑇
�
𝑐𝑡
�
𝑠𝑇
∈
𝑆𝑔
𝑔𝑜𝑎𝑙
𝑡=0
Sum of the cost
of a planed sequence of
actions that leads to a
goal state
The interaction is a sequence: 𝑠0, 𝑎0, 𝑝1, 𝑠1, 𝑎1, 𝑝2, 𝑠2, 𝑎2, … ,
𝑠𝑔𝑔𝑜𝑎𝑙
cost
plan

Utility-based Agent
• The agent uses a utility function to evaluate the desirability of each possible
states. This is typically expressed as the reward of being in a state 𝑅(𝑠).
• Choose actions to stay in desirable states.
• Performance measure: The discounted sum of expected utility over time.
∞
𝑎 = arg𝑚𝑎𝑥𝑎
D
� 𝛾𝛾𝑡𝑟
0∈A
𝑡
𝑡=0
Utility is the
expected future
discounted reward
Techniques: Markov decision
processes, reinforcement learning

Agents that Learn
The learning element modifies the agent program (reflex-based, goal-
based, or utility-based) to improve its performance.
Agent
program
How is the agent
currently performing?
Exploration
Update the agent
program

Example: Smart Thermostat
Smart thermostat
Percepts
• Temp: deg. F
• Outside temp.
• Weather report
• Energy
curtailment
• Someone walking
by
• Someone changes
temp.
• Day & time
• …
States
Factored states
• Estimated
time to cool
the house
• Someone
home?
• How long till
someone is
coming
home?
• A/C: on, off
Change
temperature
when you are
too
cold/warm.

Example: Modern Vacuum Robot
Features are:
• Control via App
• Cleaning Modes
• Navigation
• Mapping
• Boundary blockers
Source: https://www.techhive.com/article/3269782/best-robot-
vacuum-cleaners.html

PEAS Description of a
Modern Robot Vacuum
Performance
measure

What Type of Intelligent Agent is a
Modern Robot Vacuum?
Goal-based agents
Simple reflex
agents
Does it collect utility over
time? How would the utility for
each state be defined?
Does it have a goal state?
Does it store state information.
How would they be defined
(atomic/factored)?
Does it use simple rules based
on the current percepts?
Is
it
learning?
Check what applies

What Type of Intelligent
Agent is this?

PEAS Description of ChatGPT
Performance
measure

What Type of Intelligent Agent is
ChatGPT?
Goal-based agents
Simple reflex agents
Does it store state information.
How would they be defined
(atomic/factored)?
Does it have a goal state?
Does it collect utility over
time? How would the utility for
each state be defined?
Is
it
learning?
Answer the following questions:
• Does ChatGPT pass the Touring test?
• Is ChatGPT a rational agent? Why?
Check what applies
Does it use simple rules based
on the current percepts?
We will talk about knowledge-based agents later.

Intelligent Systems as
Sets of Agents:
Self-driving Car
Goal-based agents
Simple reflex agents
Remember where every other car is and
calculate where they will be in the next few
seconds.
React to unforeseen issues like a child
running in front of the car quickly.
Make sure the passenger has a pleasant drive
(not too much sudden breaking = utility)
Plan the route to the destination.
It
should
learn!
High-level
planning
Low-level
planning

Some Environment Types Revisited
vs.
vs.
vs.
Fully observable: The agent’s sensors
always show the whole state.
Partially observable: The agent only
perceives part of the state and needs to
remember or infer the test.
Deterministic: Percepts are 100% reliable
and changes in the environment is
completely determined by the current state
of the environment and the agent’s action.
Stochastic:
• Percepts are unreliable (noise distribution,
sensor failure probability, etc.). This is called a
stochastic sensor model.
• The transition function is stochastic leading
to transition probabilities and a Markov
process.
Known: The agent knows the transition
function.
Unknown: The needs to learn the transition
function by trying actions.
We will spend the whole semester on discussing algorithms that can deal with
environments that have different combinations of these three properties.

Conclusion
Search for a goal
(e.g., navigation).
Optimize functions
(e.g., utility).
Stay within given
constraints
(constraint satisfaction problem;
e.g., reach the goal without
running out of power)
Deal with uncertainty
(e.g., current traffic on the
road).
Learn a good agent
program from data
and improve over time
(machine learning).
Sensing
(e.g, natural language
processing, vision)
Intelligent agents inspire the research areas of modern AI

Different types of AI agents:
1. Simple Reflex Agents: These agents make
decisions based solely on the current percept
(immediate stimuli from the environment). They
ignore the rest of the percept history.
For example, a simple reflex agent might apply the
brakes if it detects the tail light of the vehicle in fr
ont of it is on
1
.
2. Goal-Based Agents: Goal-based agents have a
specific goal they aim to achieve. They select
actions that improve progress toward that goal.
For instance, a goal-based reflex agent would take
actions to reach its goal, even if they are not neces
sarily the best actions
2
.
3. Learning Agents: Learning agents adapt and
improve their behavior over time by learning from

4. Hierarchical Agents: These agents operate in a
hierarchical manner, with different levels of
decision-making. Higher-level goals guide lower-
level actions.
For example, a robot navigating a maze might hav
e high-level goals (reach the exit) and low-level acti
ons (move forward, turn left, etc.)
3
.
5. Collaborative Agents: Collaborative agents
work together in multi-agent systems to achieve
common goals. They coordinate actions and
communicate with each other.
Examples include teams of robots working togeth
er or multiplayer game characters cooperating
1
.
6. Agent Program: An agent program implements
the agent’s decision-making function. It maps

Simple Reflex Agent
def simple_reflex_agent(percept):
if percept == "tail light on":
return "Apply brakes"
else:
return "Continue driving"
# Example usage:
current_percept = "tail light on"
action = simple_reflex_agent(current_percept)
print(f"Action: {action}")

class SimpleReflexAgent:
def __init__(self):
self.rules =
{
'dirty': self.clean,
'clean': self.no_op
}
self.location = None
def perceive(self, environment):
self.location, status = environment
return status
def act(self, status):
action = self.rules.get(status, self.no_op)
return action()
def clean(self):
print(f"Agent cleaned the room at location {self.location}.")
return "clean“
def no_op(self):
print(f"Agent did nothing in the room at location
{self.location}.") return "no_op"

if __name__ == "__main__":
environment_state = ("A", "dirty")
agent = SimpleReflexAgent()
while True:
percept = agent.perceive(environment_state)
action_result = agent.act(percept)
if action_result == "clean":
environment_state = ("A", "clean")
else: break

Goal Based Agent
class GoalBasedAgent:
def __init__(self, goal):
self.goal = goal
def select_action(self, percept):
# Implement your logic here based on the goal and percept
# For demonstration purposes, let's assume a simple action:
return "Take a step towards the goal"
# Example usage:
goal_agent = GoalBasedAgent("Reach the exit")
current_percept = "Obstacle ahead"
action = goal_agent.select_action(current_percept)
print(f"Action: {action}")

A modern approach to AI AI_02_agents_Strut.pptx

More Related Content

Similar to A modern approach to AI AI_02_agents_Strut.pptx (20)

More from SuryaBasnet3 (20)

Recently uploaded (20)

A modern approach to AI AI_02_agents_Strut.pptx

Editor's Notes