0% found this document useful (0 votes)

19 views

Data Driven Control IEEE Paper

Uploaded by

Dhruv Kumar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Data Driven Control IEEE Paper

Uploaded by

Dhruv Kumar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Data-Driven Control for Autonomous Systems Using Reinforcement Learning

Authors: Dhruv Kumar, Anirudh Pratap Singh

Supervised by: Dr. Sandeep Kumar Soni

Abstract

This paper investigates a data-driven control approach for autonomous systems using reinforcement

learning (RL). The project focuses on stabilizing an inverted pendulum and robotic arm through

Q-learning, a model-free RL algorithm. The system dynamics, including torque and angular

velocities, are represented discretely to enable optimal control policy learning. Simulations

demonstrate the successful stabilization of both systems, showing the effectiveness of RL in control

tasks that deal with nonlinear dynamics and uncertain environments.

Keywords

Reinforcement learning, Q-learning, data-driven control, inverted pendulum, robotic arm, autonomous syste

I. Introduction

Optimal control has been a fundamental topic in control theory for decades, applied in numerous

fields such as robotics, missile guidance, and energy systems. However, for complex nonlinear

systems, traditional control methods often fall short. This paper presents a reinforcement learning

approach to stabilize two classic systems: an inverted pendulum and a robotic arm. Using

Q-learning, the agent learns to apply optimal torques to stabilize both systems, demonstrating the

potential of RL in handling uncertain and nonlinear systems.

II. Related Work

Reinforcement learning, particularly Q-learning, has seen extensive applications in control systems

due to its ability to learn optimal policies without requiring a model of the environment. Previous
studies have demonstrated its effectiveness in stabilizing various control systems. This work aims to

contribute to this research gap by applying Q-learning to autonomous systems with unknown

dynamics.

III. System Model

The systems under consideration are:

1. Inverted Pendulum: A classic example of a nonlinear system, where the goal is to stabilize the

pendulum in an upright position.

2. Robotic Arm: A multi-joint system where the objective is to apply optimal torques to achieve a

desired joint configuration.

In both systems, the state is represented by the angular displacement and velocity, which are

discretized for use with Q-learning. The torque is treated as a discrete action, and the reward

function penalizes deviations from the desired state.

IV. Methodology

A. Q-Learning Framework

Q-learning is a model-free reinforcement learning algorithm where the agent learns the optimal

policy by updating a Q-table based on observed rewards. The algorithm operates in discrete time

steps, where at each step, the agent selects an action, observes the reward, and updates its

knowledge of the Q-values.

1. State Representation: The continuous state space (angular displacement and velocity) is

discretized into bins, enabling the use of Q-learning for control.

2. Action Representation: The torque applied to the pendulum and robotic arm is discretized into

multiple levels to control the system's behavior.

3. Reward Function: A custom reward function is used to penalize large deviations from equilibrium.
B. Training Process

The agent is trained for 10,000 episodes, during which it explores and exploits different actions. The

exploration-exploitation trade-off is controlled by an epsilon-greedy strategy, where the epsilon value

decays over time. The Q-values are updated using the Bellman equation.

V. Results

A. Pendulum Stabilization

The Q-learning agent successfully stabilized the inverted pendulum, bringing both the angular

displacement and velocity to zero. The agent was able to adapt to various initial conditions,

showcasing the robustness of the RL approach.

B. Robotic Arm Stabilization

Similarly, the robotic arm was stabilized by the RL agent, which successfully applied torques to the

joints to bring them into the desired position.

VI. Conclusion

This paper demonstrates the applicability of reinforcement learning, specifically Q-learning, to

autonomous systems such as the inverted pendulum and robotic arm. The results indicate that RL

can effectively stabilize nonlinear and uncertain systems, providing a foundation for real-world

applications in robotics and automation.

VII. Future Work

Future research will focus on:

- Exploring adaptive learning rates to improve convergence.

- Applying more advanced RL algorithms such as Deep Q-Networks (DQN).

- Testing the trained policies on real-world systems to evaluate their performance in physical
environments.

References

1. X. Jia, X. Zhang, S. Zhu, F. Deng, and B. Zhu, "Data-driven adaptive consensus control for

heterogeneous nonlinear multi-agent systems using online reinforcement learning," IEEE

Transactions on Cybernetics, 2020.

2. T. Wang, G. Zong, X. Zhao, and N. Xu, "Data-driven-based sliding-mode dynamic event-triggered

control of unknown nonlinear systems via reinforcement learning," IEEE Transactions on Control

Systems Technology, 2020.

3. GeeksforGeeks, "What is Reinforcement Learning?" Available:

https://www.geeksforgeeks.org/what-is-reinforcement-learning/. [Accessed: Dec. 2024].

4. GeeksforGeeks, "Q-Learning in Python," Available:

https://www.geeksforgeeks.org/q-learning-in-python/. [Accessed: Dec. 2024].

5. MathWorks, "Train DDPG Agent to Swing-Up and Balance Pendulum," Available:

https://in.mathworks.com/help/reinforcement-learning/ug/train-ddpg-agent-to-swing-up-and-balance-

pendulum.html. [Accessed: Dec. 2024].

Innovative Lesson Plan
100% (2)
Innovative Lesson Plan
4 pages
Developmental Stages For Acquisition of Literacy Skills1
No ratings yet
Developmental Stages For Acquisition of Literacy Skills1
9 pages
YBOCS Yale-Brown Obsessive Compulsive Scale For OCD
No ratings yet
YBOCS Yale-Brown Obsessive Compulsive Scale For OCD
44 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
4 pages
Neural-Adaptive Output Feedback Control of A Class of Transportation Vehicles Based On Wheeled Inverted Pendulum Models
No ratings yet
Neural-Adaptive Output Feedback Control of A Class of Transportation Vehicles Based On Wheeled Inverted Pendulum Models
9 pages
Applsci 13 13181
No ratings yet
Applsci 13 13181
21 pages
Fuzzy Intelligent Emotion and Instinct Control of a Robotic Unic
No ratings yet
Fuzzy Intelligent Emotion and Instinct Control of a Robotic Unic
6 pages
IJEAST2022IPSURVEY
No ratings yet
IJEAST2022IPSURVEY
4 pages
Reinforcement Learning-Based Satellite Attitude ST PDF
No ratings yet
Reinforcement Learning-Based Satellite Attitude ST PDF
18 pages
Development of A New Adaptive Backstepping Control Design For A Non-Strict and Under-Actuated System Based On A PSOTuner
No ratings yet
Development of A New Adaptive Backstepping Control Design For A Non-Strict and Under-Actuated System Based On A PSOTuner
17 pages
Inverted Pendulum
No ratings yet
Inverted Pendulum
18 pages
Mechanical Systems and Signal Processing: Zhi-Cheng Qiu
No ratings yet
Mechanical Systems and Signal Processing: Zhi-Cheng Qiu
19 pages
Designing System Level Synthesis Controllers For N
No ratings yet
Designing System Level Synthesis Controllers For N
11 pages
applsci-09-04144
No ratings yet
applsci-09-04144
23 pages
Sensors: Efficient Force Control Learning System For Industrial Robots Based On Variable Impedance Control
No ratings yet
Sensors: Efficient Force Control Learning System For Industrial Robots Based On Variable Impedance Control
26 pages
Double Pendulum For Analyzing Alarithms
No ratings yet
Double Pendulum For Analyzing Alarithms
20 pages
Nonlinear_Control_of_the_Reaction_Wheel_Pendulum_Using_Passivity-based_control_And_Backstepping_Control
No ratings yet
Nonlinear_Control_of_the_Reaction_Wheel_Pendulum_Using_Passivity-based_control_And_Backstepping_Control
6 pages
Tema Optica Geometrica
No ratings yet
Tema Optica Geometrica
6 pages
Unified Swing Up and Upright Position Stabilizing
No ratings yet
Unified Swing Up and Upright Position Stabilizing
7 pages
Dynamics and Control of A Robotic Arm Having Four Links
No ratings yet
Dynamics and Control of A Robotic Arm Having Four Links
13 pages
Comparative Asses TWIP 2010
No ratings yet
Comparative Asses TWIP 2010
7 pages
EERI 321 Practical Part 1 2023
No ratings yet
EERI 321 Practical Part 1 2023
5 pages
Modeling and Simulation of Inverted Pendulum System Using Matlab: Overview
No ratings yet
Modeling and Simulation of Inverted Pendulum System Using Matlab: Overview
4 pages
Fuzzy Logic Control vs. Conventional PID Control of An Inverted Pendulum Robot
No ratings yet
Fuzzy Logic Control vs. Conventional PID Control of An Inverted Pendulum Robot
7 pages
Efficient_Identification_of_Multi-Link_Inverted_Pe
No ratings yet
Efficient_Identification_of_Multi-Link_Inverted_Pe
20 pages
Dynamical Movement Primitives Learning Attractor Models for Motor Behaviors
No ratings yet
Dynamical Movement Primitives Learning Attractor Models for Motor Behaviors
46 pages
CS PBL
No ratings yet
CS PBL
13 pages
Gac Wedge Eaai
No ratings yet
Gac Wedge Eaai
14 pages
Robust Control of Nonholonomic Wheeled Mobile Robot With Past Information: Theory and Experiment
No ratings yet
Robust Control of Nonholonomic Wheeled Mobile Robot With Past Information: Theory and Experiment
11 pages
153403959
No ratings yet
153403959
6 pages
Actuator Control For The NASA-JSC Valkyrie Humanoid Robot: A Decoupled Dynamics Approach For Torque Control of Series Elastic Robots
No ratings yet
Actuator Control For The NASA-JSC Valkyrie Humanoid Robot: A Decoupled Dynamics Approach For Torque Control of Series Elastic Robots
25 pages
Second-Order_Sliding_Mode_Control_for_Ball-Balancing_System
No ratings yet
Second-Order_Sliding_Mode_Control_for_Ball-Balancing_System
6 pages
v1_covered
No ratings yet
v1_covered
22 pages
Adaptive Control Flexible-Joint Manipulators: Fathi Ghorbel, John Hung, and Mark Spong
No ratings yet
Adaptive Control Flexible-Joint Manipulators: Fathi Ghorbel, John Hung, and Mark Spong
5 pages
Attitude Control and Parameter Optimization A Stud
No ratings yet
Attitude Control and Parameter Optimization A Stud
8 pages
2007 Siggraph Simbicon
No ratings yet
2007 Siggraph Simbicon
10 pages
2023 Cambridge
No ratings yet
2023 Cambridge
18 pages
Terminal Sliding Modes
No ratings yet
Terminal Sliding Modes
4 pages
Modelling & Simulation For Optimal Control of Nonlinear Inverted Pendulum Dynamical System Using PID Controller & LQR
No ratings yet
Modelling & Simulation For Optimal Control of Nonlinear Inverted Pendulum Dynamical System Using PID Controller & LQR
6 pages
BME 404 ML Project Report
No ratings yet
BME 404 ML Project Report
13 pages
Self Excited Periodic Motion in Underactuated Mechanical S - 2022 - Fuzzy Sets A
No ratings yet
Self Excited Periodic Motion in Underactuated Mechanical S - 2022 - Fuzzy Sets A
21 pages
Sliding Mode Control of Inverted Pendulu
No ratings yet
Sliding Mode Control of Inverted Pendulu
3 pages
Internship Project Report Final
No ratings yet
Internship Project Report Final
18 pages
1 s2.0 S0016003220300843 Main
No ratings yet
1 s2.0 S0016003220300843 Main
18 pages
Project Report (3)
No ratings yet
Project Report (3)
11 pages
yin2015
No ratings yet
yin2015
6 pages
Husek - 2008 - Systems, Structure and Control
No ratings yet
Husek - 2008 - Systems, Structure and Control
256 pages
Dynamics and Control of A Robotic Arm Ha
No ratings yet
Dynamics and Control of A Robotic Arm Ha
12 pages
Inverted 2024
No ratings yet
Inverted 2024
21 pages
Sliding Mode Control of Two-Parameter Fourth-Order Chaos Model of Power System
No ratings yet
Sliding Mode Control of Two-Parameter Fourth-Order Chaos Model of Power System
5 pages
Robust Motion Control For Mobile Manipulator Using Resolved Acceleration and Proportional-Integral Active Force Control
No ratings yet
Robust Motion Control For Mobile Manipulator Using Resolved Acceleration and Proportional-Integral Active Force Control
10 pages
Chang-2019-International_Journal_of_Robust_and_Nonlinear_Control (1)
No ratings yet
Chang-2019-International_Journal_of_Robust_and_Nonlinear_Control (1)
15 pages
[3]05362205
No ratings yet
[3]05362205
5 pages
Robust Trajectory Tracking Control Design For Nonholonomic Mobile Robot (NMR)
No ratings yet
Robust Trajectory Tracking Control Design For Nonholonomic Mobile Robot (NMR)
6 pages
A Control Engineers Quide To Sliding Mode Control
No ratings yet
A Control Engineers Quide To Sliding Mode Control
14 pages
ijaerv13n14_24
No ratings yet
ijaerv13n14_24
7 pages
Dynamic Analysis of Scissor Lift Mechanism Through Bond Graph Modeling
No ratings yet
Dynamic Analysis of Scissor Lift Mechanism Through Bond Graph Modeling
7 pages
Control Design Along Trajectories With Sums of Squares Programming
No ratings yet
Control Design Along Trajectories With Sums of Squares Programming
8 pages
Quadrupedal Robots Whole-Body Motion Control Based
No ratings yet
Quadrupedal Robots Whole-Body Motion Control Based
17 pages
Optimal Control of Inverted Pendulum Using Fuzzy Logic, PID & LQR Controller
No ratings yet
Optimal Control of Inverted Pendulum Using Fuzzy Logic, PID & LQR Controller
8 pages
Mechanics Using Python: An Introductory Guide
From Everand
Mechanics Using Python: An Introductory Guide
Aayushman Dutta
No ratings yet
Attractor Networks: Fundamentals and Applications in Computational Neuroscience
From Everand
Attractor Networks: Fundamentals and Applications in Computational Neuroscience
Fouad Sabry
No ratings yet
Loop-shaping Robust Control
From Everand
Loop-shaping Robust Control
Philippe Feyel
No ratings yet
ACS Organic Exam: The Details.: Organic Chemistry FALL 2017 Laney College CHEM 12B (L1/L1L) Instructor: Stephen Corlett
No ratings yet
ACS Organic Exam: The Details.: Organic Chemistry FALL 2017 Laney College CHEM 12B (L1/L1L) Instructor: Stephen Corlett
2 pages
Education: Learn and Talk I - Lesson 27
No ratings yet
Education: Learn and Talk I - Lesson 27
5 pages
Irr Ra 9646
No ratings yet
Irr Ra 9646
20 pages
SEAMO 2024 Indonesia Kindergarten
No ratings yet
SEAMO 2024 Indonesia Kindergarten
3 pages
CLEF Application form South Africa
No ratings yet
CLEF Application form South Africa
12 pages
Grade 9 English REVIEWER
No ratings yet
Grade 9 English REVIEWER
8 pages
Student's Reflection On The Third Chapter
No ratings yet
Student's Reflection On The Third Chapter
2 pages
B2 UNIT 10 Flipped Classroom Video Worksheet
No ratings yet
B2 UNIT 10 Flipped Classroom Video Worksheet
1 page
Revision SQL
No ratings yet
Revision SQL
5 pages
Sit Lead MDG
No ratings yet
Sit Lead MDG
2 pages
The Multiple Faces of Civil Society: Development and Democratization in Rajasthan, India
No ratings yet
The Multiple Faces of Civil Society: Development and Democratization in Rajasthan, India
330 pages
Lilia Gutnik 2015
No ratings yet
Lilia Gutnik 2015
2 pages
K To 12 Bread and Pastry Teacher's Guide
98% (53)
K To 12 Bread and Pastry Teacher's Guide
19 pages
Oral Com. Week 7
No ratings yet
Oral Com. Week 7
3 pages
Case Study 1
No ratings yet
Case Study 1
5 pages
Minor Test 5 - 12th
No ratings yet
Minor Test 5 - 12th
32 pages
AQA GCSE Bangla Speaking
No ratings yet
AQA GCSE Bangla Speaking
58 pages
Oral Communication DLP 2019
No ratings yet
Oral Communication DLP 2019
1 page
Dissertation
No ratings yet
Dissertation
121 pages
A B C D A B C D: Directions
No ratings yet
A B C D A B C D: Directions
6 pages
Employer Feedback Form: (Approved by AICTE, Min. of HRD, Govt. of India)
No ratings yet
Employer Feedback Form: (Approved by AICTE, Min. of HRD, Govt. of India)
2 pages
Hormones and Secondary Messengers Loll
No ratings yet
Hormones and Secondary Messengers Loll
29 pages
My Personal Philosophy of Education
No ratings yet
My Personal Philosophy of Education
3 pages
Acee11 Cor
No ratings yet
Acee11 Cor
36 pages
2017 Olimp Locala VII
No ratings yet
2017 Olimp Locala VII
3 pages
CAREER GUIDANCE WEEK CELEBRATION 2022 Mechanics
No ratings yet
CAREER GUIDANCE WEEK CELEBRATION 2022 Mechanics
2 pages
Answer Key Computer Skill 2009
No ratings yet
Answer Key Computer Skill 2009
3 pages