Open navigation menu

Scribd

0% found this document useful (0 votes)

15 views

Unit:1 Reinforcement Learning

nil

Uploaded by

jashwanthkumar.ad21

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Unit:1 Reinforcement Learning

nil

Uploaded by

jashwanthkumar.ad21

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

UNIT :1 REINFORCEMENT LEARNING

TOPIC 2:ELEMENTS OF REINFORCEMENT

LEARNING,LIMITATIONS AND SCOPE OF
REINFORCEMENT LEARNING

ELEMENTS OF REINFORCEMENT LEARNING:

1. Policy

2. Reward Signal

3. Value Function

4. Model of the environment

1) Policy: A policy can be defined as a way how an agent behaves at a

given time. It maps the perceived states of the environment to the actions

taken on those states. A policy is the core element of the RL as it alone

can define the behavior of the agent. In some cases, it may be a simple

function or a lookup table, whereas, for other cases, it may involve

general computation as a search process. It could be deterministic or a

stochastic policy:
Fordeterministicpolicy:a=π(s)

For stochastic policy: π(a | s) = P[At =a | St = s]

2) Reward Signal: The goal of reinforcement learning is defined by the

reward signal. At each state, the environment sends an immediate signal

to the learning agent, and this signal is known as a reward signal. These

rewards are given according to the good and bad actions taken by the

agent. The agent's main objective is to maximize the total number of

rewards for good actions. The reward signal can change the policy, such

as if an action selected by the agent leads to low reward, then the policy

may change to select other actions in the future.

3) Value Function: The value function gives information about how

good the situation and action are and how much reward an agent can

expect. A reward indicates the immediate signal for each good and bad

action, whereas a value function specifies the good state and action for

the future. The value function depends on the reward as, without reward,

there could be no value. The goal of estimating values is to achieve more

rewards.
4) Model: The last element of reinforcement learning is the model,

which mimics the behavior of the environment. With the help of the

model, one can make inferences about how the environment will behave.

Such as, if a state and an action are given, then a model can predict the

next state and reward.

The model is used for planning, which means it provides a way to take a

course of action by considering all future situations before actually

experiencing those situations. The approaches for solving the RL

problems with the help of the model are termed as the model-based

approach. Comparatively, an approach without using a model is called

a model-free approach.
Reinforcement Learning Applications

1. Robotics:RL is used in Robot navigation, Robo-soccer, walking,

juggling, etc.

2. Control:RL can be used for adaptive control such as Factory

processes, admission control in telecommunication, and Helicopter

pilot is an example of reinforcement learning.

3. Game Playing:RL can be used in Game playing such as tic-tac-toe,

chess, etc.

4. Chemistry:RL can be used for optimizing the chemical reactions.

5. Business:RL is now used for business strategy planning.

6. Manufacturing:In various automobile manufacturing companies,

the robots use deep reinforcement learning to pick goods and put

them in some containers.

7. Finance Sector:The RL is currently used in the finance sector for

evaluating trading strategies.

LIMITATIONS OF REINFORCEMENT LEARNING:

 Too much of reinforcement may cause an overload which could

weaken the results.

 Reinforcement learning is preferred for solving complex

problems, not simple ones.

 It requires plenty of data and involves a lot of computation.

 Maintenance cost is high.

CHALLENGES IN REINFORCEMENT LEARNING

1. Large Datasets: Since Reinforcement Learning Models are complex,

they need massive datasets to make better decisions.

2. Environment Dependency: As we know that the Reinforcement

Learning Models learn based on the agent’s interactions with the

environment – it causes hindrance in the training of the model; the agent

learns based on the current state of the environment, and for a constantly

changing environment, it becomes difficult for the agent to get trained.

3. Design of Reward Structure: For any real worlds use case of RL,

one needs to analyze the problem statement and devise an appropriate

structure as to when the model should be awarded and when should it be

penalized. This remains another problem that the researchers are

constantly in the face of.

SCOPE OF REINFORCEMENT LEARNING

 Reinforcement Learning closely mimics human learning patterns

observation, trial and error. For example, let us consider a game of

chess. Our agent begins to play the game with an absolute trial and

error approach. Every time it wins, it is rewarded, and when it

loses, it is accordingly penalized.

 It serves as an impeccable resolve for situations when the target

our problem statement is trying to accomplish is clear, but the way

of getting there is not.

 The real-world applications of RL are limited and are not in

constant circulation in our daily lives, With the tenacious and

committed researchers constantly digging deep into the field of

RL, we’ll surely break through all the challenges and resistance the

current studies face and revolutionize the Artificial Intelligence

sphere.

You might also like

(PDF Download) Remote Sensing Digital Image Analysis Sixth Edition John Alan Richards Fulll Chapter
100% (7)
(PDF Download) Remote Sensing Digital Image Analysis Sixth Edition John Alan Richards Fulll Chapter
64 pages
UNIT-4
No ratings yet
UNIT-4
56 pages
Unleashing The Power of Reinforcement Learning
No ratings yet
Unleashing The Power of Reinforcement Learning
2 pages
Reinforcement Learning: Nazia Bibi
100% (1)
Reinforcement Learning: Nazia Bibi
61 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
RL Unit 1
100% (1)
RL Unit 1
26 pages
Exp-14 Reinforcement Learning
No ratings yet
Exp-14 Reinforcement Learning
11 pages
UNIT-V-Reinforcement Learning
No ratings yet
UNIT-V-Reinforcement Learning
4 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
Assignment_15_Modern_AI
No ratings yet
Assignment_15_Modern_AI
3 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
Reinforcement Learning (RL) : Agent
No ratings yet
Reinforcement Learning (RL) : Agent
35 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Sara Reinforcement Learning
No ratings yet
Sara Reinforcement Learning
69 pages
Reinforcement_Learning_Enhanced
No ratings yet
Reinforcement_Learning_Enhanced
3 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
3 pages
Winter Semester 2023-24_CSE4037_ETH_AP2023246000594_2024-01-05_Reference-Material-I
No ratings yet
Winter Semester 2023-24_CSE4037_ETH_AP2023246000594_2024-01-05_Reference-Material-I
35 pages
3.RL Unit 3
No ratings yet
3.RL Unit 3
31 pages
SL-Week01
No ratings yet
SL-Week01
13 pages
DRL Final Notes
No ratings yet
DRL Final Notes
281 pages
Unit-5 Reinforcemnt and Q learning
No ratings yet
Unit-5 Reinforcemnt and Q learning
45 pages
Unit V Reinforcement Learning and Genetic Algorithm
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
40 pages
Unit 5
No ratings yet
Unit 5
45 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
MLT Unit-5 notes
No ratings yet
MLT Unit-5 notes
17 pages
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
No ratings yet
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
9 pages
four
No ratings yet
four
5 pages
Reinforcement Learning, Q-Learning
No ratings yet
Reinforcement Learning, Q-Learning
20 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
11 pages
RL
No ratings yet
RL
94 pages
Lec 01
No ratings yet
Lec 01
60 pages
UNIT 5 ML
No ratings yet
UNIT 5 ML
49 pages
Unit 3
No ratings yet
Unit 3
12 pages
AI unit -3.docx
No ratings yet
AI unit -3.docx
102 pages
RL Introduction
No ratings yet
RL Introduction
225 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
Reinforcement_Learning_Basics_and_Beyond
No ratings yet
Reinforcement_Learning_Basics_and_Beyond
1 page
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
L35-ReinforcementLearning 2
No ratings yet
L35-ReinforcementLearning 2
17 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
Reinforcement_Learning_Synopsis (2)
No ratings yet
Reinforcement_Learning_Synopsis (2)
7 pages
Deep Learning
No ratings yet
Deep Learning
45 pages
Lecture 9 - Reinforced Learning
No ratings yet
Lecture 9 - Reinforced Learning
18 pages
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
Reinforcement Learning 1
No ratings yet
Reinforcement Learning 1
14 pages
Introduction to Reinforcement Learning and Its Applications
No ratings yet
Introduction to Reinforcement Learning and Its Applications
2 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
Reinforcement Learning With Python
No ratings yet
Reinforcement Learning With Python
24 pages
ML UNIT 5
No ratings yet
ML UNIT 5
13 pages
Module 01
No ratings yet
Module 01
66 pages
Final
No ratings yet
Final
18 pages
Lect.2
No ratings yet
Lect.2
26 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
30 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
29 pages
ML-10
No ratings yet
ML-10
9 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
Missing Data Imputation Using Singular Value Decomposition
No ratings yet
Missing Data Imputation Using Singular Value Decomposition
6 pages
MCQ - FILL in The Balnks
No ratings yet
MCQ - FILL in The Balnks
6 pages
ECE457 Pattern Recognition Techniques and Algorithms: Answer All Questions
No ratings yet
ECE457 Pattern Recognition Techniques and Algorithms: Answer All Questions
3 pages
Seminar Presentation PDF
No ratings yet
Seminar Presentation PDF
12 pages
Module 1
No ratings yet
Module 1
15 pages
Dependency Parsing 2: CMSC 723 / LING 723 / INST 725
No ratings yet
Dependency Parsing 2: CMSC 723 / LING 723 / INST 725
52 pages
Control of Continuous Process: Lecturer: Dr. Shallon Stubbs
100% (1)
Control of Continuous Process: Lecturer: Dr. Shallon Stubbs
32 pages
TR.27.5 Spring Tension_Compression Specification
No ratings yet
TR.27.5 Spring Tension_Compression Specification
2 pages
Renormalization Theory and Effective Field Theory Neubert
No ratings yet
Renormalization Theory and Effective Field Theory Neubert
46 pages
STK110 - Exam Information - 2024
No ratings yet
STK110 - Exam Information - 2024
2 pages
4.the Rabin Karp Algorithm
No ratings yet
4.the Rabin Karp Algorithm
16 pages
Data Security and Encryption
No ratings yet
Data Security and Encryption
7 pages
C Language: Sorting: Text: Chapter 8 & 9 of Algorithms in C (R. Sedgewick)
No ratings yet
C Language: Sorting: Text: Chapter 8 & 9 of Algorithms in C (R. Sedgewick)
1 page
DCDR Question Bank
No ratings yet
DCDR Question Bank
4 pages
2311.10357v5
No ratings yet
2311.10357v5
28 pages
2007 What You Should Know About The Vehicle Routing Problem
No ratings yet
2007 What You Should Know About The Vehicle Routing Problem
9 pages
Proposed List of Moocs (Nptel Jan-April-2023 Time Line) For B.E. Degree Honors
No ratings yet
Proposed List of Moocs (Nptel Jan-April-2023 Time Line) For B.E. Degree Honors
1 page
Practice Questions For - DSA
100% (1)
Practice Questions For - DSA
3 pages
Chapter 19
No ratings yet
Chapter 19
39 pages
Agentic AI
No ratings yet
Agentic AI
26 pages
N Months?: The Fibonacci Sequence
No ratings yet
N Months?: The Fibonacci Sequence
20 pages
Chap-8 Scheduling
No ratings yet
Chap-8 Scheduling
35 pages
Konstantinos-Eleftherios Metallinos
No ratings yet
Konstantinos-Eleftherios Metallinos
2 pages
Dokumen - Pub Ordinary Differential Equations Basics and Beyond 1st Ed 1493963872 978-1-4939 6387 4 978
100% (2)
Dokumen - Pub Ordinary Differential Equations Basics and Beyond 1st Ed 1493963872 978-1-4939 6387 4 978
565 pages
Quantum Computing
100% (3)
Quantum Computing
2 pages
Traffic Sign Detection
No ratings yet
Traffic Sign Detection
5 pages
B.tech CSE Cyber Security
No ratings yet
B.tech CSE Cyber Security
4 pages
Arecibo Message and Reply
No ratings yet
Arecibo Message and Reply
1 page
1 Introduction To Weibull Analysis
100% (2)
1 Introduction To Weibull Analysis
2 pages