2__Agents
2__Agents
CHAPTER 2
Oliver Schulte
Summer2011
https://www2.cs.sfu.ca/CourseCentral/310/oschulte/
chapter2.ppt
Outline
2
• Human agent:
– eyes, ears, and other organs for sensors;
– hands, legs, mouth, and other body parts for
actuators
• Robotic agent:
– cameras and infrared range finders for sensors
– various motors for actuators
Demo:
http://www.ai.sri.com/~oreilly/aima3ejava/aima3ejavademos.
html
• Rationality
– Performance measuring success
– Agents prior knowledge of environment
– Actions that agent can perform
– Agent’s percept sequence to date
Extremes
No autonomy – ignores environment/data
Complete autonomy – must act randomly/no
program
Example: baby learning to crawl
Ideal: design agents to have some autonomy
Possibly become more autonomous with experience
PEAS
10
• PEAS: Performance measure, Environment,
Actuators, Sensors
Non-deterministic environments
Have aspects beyond the control of the agent
Utility functions have to guess at changes in world
In non-episodic environments:
Agent has to plan ahead:
Current choice will affect future actions
Cross Word Poker Backgammon Taxi driver Part picking robot Image analysis
Sequential Sequential Sequential Sequential Episodic Episodic
Cross Word Poker Backgammon Taxi driver Part picking robot Image analysis
Discrete Discrete Discrete Conti Conti Conti
Cross Word Poker Backgammon Taxi driver Part picking robot Image analysis
Single Multi Multi Multi Single Single
Fully
Observable
yes
no
Deterministic no
yes
Certainty: Uncertainty
Search
Thermostat.
• Have a goal
A destination to get to
Performance element
is what was previously
the whole agent
Input sensor
Output action
Learning element
Modifies
performance
element.
Problem generator
Tries to solve the
problem differently
instead of optimizing.
Suggests exploring
new actions -> new
problems.
Artificial Intelligence a modern approach
Learning agents(Taxi driver)
34
Performance element
How it currently drives
Taxi driver Makes quick left turn across 3 lanes
Critics observe shocking language by passenger and other
drivers and informs bad action
Learning element tries to modify performance elements for
future
Problem generator suggests experiment out something
called Brakes on different Road conditions
Exploration vs. Exploitation
Learning experience can be costly in the short run
shocking language from other drivers
Less tip
Fewer passengers
Artificial Intelligence a modern approach
The Big Picture: AI for Model-Based
Agents
35
Planning
Action Reinforcement
Decision
Theory Learning
Game Theory
Knowledg Learnin
e Machine g
Logic
Learning
Probabilit
y Statistics
Heuristic
s
Inference
Artificial Intelligence a modern approach
The Picture for Reflex-Based Agents
36
Action
Reinforcement
Learning
Learnin
g
• Studied in AI, Cybernetics, Control Theory,
Biology, Psychology.