ppt Module-1 chapter-2.ppt - Google Slides
ppt Module-1 chapter-2.ppt - Google Slides
CHAPTER 1
BY
DR. MANJUNATHA P B
Outline
2
• Human agent:
– eyes, ears, and other organs for sensors;
– hands, legs, mouth, and other body parts for actuators
• Robotic agent:
– cameras and infrared range finders for sensors
– various motors for actuators
Demo:
http://www.ai.sri.com/~oreilly/aima3ejava/aima3ejavademos.h
tml
• Rationality
– Performance measuring success
– Agents prior knowledge of environment
– Actions that agent can perform
– Agent’s percept sequence to date
⚫ Extremes
⚪ No autonomy – ignores environment/data
⚪ Complete autonomy – must act randomly/no program
⚫ Non-deterministic environments
⚪ Have aspects beyond the control of the agent
Cross Word Poker Backgammon Taxi driver Part picking robot Image analysis
Deterministic Stochastic Stochastic Stochastic Stochastic Deterministic
⚫ In non-episodic environments:
⚪ Agent has to plan ahead:
Current choice will affect future actions
Cross Word Poker Backgammon Taxi driver Part picking robot Image analysis
Sequential Sequential Sequential Sequential Episodic Episodic
Cross Word Poker Backgammon Taxi driver Part picking robot Image analysis
Discrete Discrete Discrete Conti Conti Conti
Cross Word Poker Backgammon Taxi driver Part picking robot Image analysis
Single Multi Multi Multi Single Single
Fully
Observable
yes
no
Deterministic no
yes
Certainty: Uncertainty
Search
⚪ Utility-based agents
⚪ All these can be turned into learning agents
⚪ http://www.ai.sri.com/~oreilly/aima3ejava/aima3ejavad
emos.html
⚪ Thermostat.
⚫ Chess – openings, endings
⚪ Lookup table (not a good idea in general)
• Have a goal
⚪ A destination to get to
• Reflex agent breaks when it sees brake lights. Goal based agent
reasons
– Brake light -> car in front is stopping -> I should stop -> I should use brake
⚫ Performance element is
what was previously the
whole agent
⚫ Input sensor
⚫ Output action
⚫ Learning element
⚫ Modifies performance
element.
⚫ Problem generator
⚫ Tries to solve the problem
differently instead of
optimizing.
⚫ Suggests exploring new
actions -> new problems.
⚪ Performance element
How it currently drives
⚪ Taxi driver Makes quick left turn across 3 lanes
Critics observe shocking language by passenger and other drivers
and informs bad action
Learning element tries to modify performance elements for future
Problem generator suggests experiment out something called
Brakes on different Road conditions
⚪ Exploration vs. Exploitation
Learning experience can be costly in the short run
shocking language from other drivers
Less tip
Fewer passengers
Planning
Decision Theory
Action Reinforcement
Learning
Game Theory
Knowledge Learning
Logic Machine Learning
Probability Statistics
Heuristics
Inference
Action
Reinforcement
Learning
Learning