Reinforcement Learning and Transfer Learning: Simulation-Robot System For Object-Handling

This document discusses a project exploring transfer learning and reinforcement learning techniques for vision-based robotic controllers to allow robots to rapidly adapt to new environments. The project aims to develop efficient reinforcement learning algorithms for object handling tasks using self-learning robots, and explore how to transfer knowledge learned in simulations to real robots to minimize computational costs and improve results. It will involve contextualizing relevant techniques, testing in classical applications, evaluating new approaches, and comparing results to classic methods.

Uploaded by

Expedito Mello

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views

Reinforcement Learning and Transfer Learning: Simulation-Robot System For Object-Handling

Uploaded by

Expedito Mello

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Title

Reinforcement Learning and Transfer Learning: Simulation-Robot System for

Object-Handling

Summary

To allow robots to be able to adapt rapidly to new environments they are

being faced with, the following project explores a transfer learning system to vision-
based robotic controllers, making use of reinforcement learning techniques.

Introduction

The self-learning capacity of robotic controllers is vastly explored, by

academic literature and by the business market alike. Notwithstanding, the majority
of researches consists on the learning of aforementioned controllers, utilizing still
non-processed data - such as the values of each pixel on a visor – with no previous
knowledge.
It is known, however that data processed by different controllers, tasks or by
simulation is not reused in other robots. Therefore, it is vital to investigate the
algorithms to reuse of said data so it is possible to not only avoid expenditures, but
also to save computational time, and enhance a controller’s performance when
being adapted to a new function.

Contents

Self-learning robots have been achieving high accuracy and repeatability in

specific assignments. Withal, with the intention of being more practical, and having
the capacity of being handled in common places (such as households), they need
to swiftly adapt to situations in which they will be utilized; which immensely contrasts
with the actual lengthy span of time and heavy data requirement needed for the
above acclimation. In the field of robotics, there is a bigger obstacle due to external

1
elements, e.g.: energy used, and/or maintenance of components. Considering, then,
the mishaps, recently-done analyses seek to make use of similar and previously
processes data so that the optimal decision can be made more nimbly.
An important paradigm for robots to develop an acceptable control over their
own motors, which will be approached by this project is the Reinforcement Learning
– a topic in the field of Machine Learning – that enables computer programs and
robots to fulfill a certain duty through trial and error. Different tasks can currently be
taught, like: walking, folding shirts, handling objects, among others.
Another sphere of study explored is Transfer Learning. This area of research
is based on the utilization of the expertise obtained after the optimization of a task
to improve another, understanding the second to be of a related domain. Such
proposal supplies an array of benefits when compared to old methods: the reduction
of the processing (that would otherwise make it unfeasible) being the main one.
The linking of these two groups of techniques, used in similar ways in articles
referenced ([1], [2], [3] e [4]), will be implemented to achieve the proposed scientific
investigations. The first one is developing efficient algorithms of Reinforcement
Learning, optimizing the object-handling assignments through the means of self-
learning with the use of vision-based robots (based on activities made in
backgrounds such as the Amazon Picking Challenge). The second inquiry will be
exploring different possibilities for the transference of simulation-acquired
knowledge to an actual robot, with the intention of minimizing the computational cost
in order to learn such task, and, finally, yield better results.

Methodology and Research Plans

To achieve the proposed goals, this research will perform four steps of
scientific development:

1. Contextualization:
This step will be responsible for keeping scientific coherence, revising the
academic literature, so that it is possible to elaborate a tools and knowledge map
that are in accord with what could and should be used during the development of
the project. Some examples follow: classical techniques of Reinforcement Learning,

2
different architectures of Neural Networks, Neural Networks applied to
Reinforcement Learning and optimizing Evolutionary Algorithms.
Estimated duration: 3 months.

2. Tests in classical applications and building of the development environment:

In this step, simulations and actual environments of classical experiments
shall be used, such as the inverted pendulum, with the intention of investigating the
first results. This phase is also intertwined with the implementation of a vast library
for the simulation of a controller and the communication with the robot. At its end,
the goal will be to have a basis of results with simple experiments, a framework for
reinforcement learning, a task simulator that will be performed by the robot and a
connection component based on the ROS control interfaces, an operational system,
between robot and computer.
Estimated duration: 10 months.

3. Test phase:
Having the environment built, all possible and viable approaches found will
be used, that are within the span of time allotted for the project, and that appear to
yield promising results. It will be possible, then, to test new methods, such as the
Deep Neural Networks, to evaluate the obtained algorithms.
Estimated duration: 6 months.

4. Final analyses phase:

Having ended the building of a strong literary basis, an environment for
simulation and new approaches, the accuracy-level tests between the new methods
and the classic ways will be performed and compared. With promising final results,
the implementation of the present project will be an important step towards the
development of new technologies to further enhance everyday life.
Estimated duration: 5 months.

Socio-Economic Status and Academic Performance of Selected Grade 12 Students
95% (22)
Socio-Economic Status and Academic Performance of Selected Grade 12 Students
63 pages
Genki - An Integrated Course in Elementary Japanese Workbook I (Second Edition) (2011), WITH PDF BOOKMARKS!
100% (1)
Genki - An Integrated Course in Elementary Japanese Workbook I (Second Edition) (2011), WITH PDF BOOKMARKS!
148 pages
Serverless Computing
No ratings yet
Serverless Computing
6 pages
A Survey On Deep Learning and Deep Reinforcement Learning in Robotics With A Tutorial On Deep Reinforcement Learning
No ratings yet
A Survey On Deep Learning and Deep Reinforcement Learning in Robotics With A Tutorial On Deep Reinforcement Learning
33 pages
Mathematical Modeling by Identification: A Case Study in The Laboratory of Control Applied To The Identification of A Servo Mechanism
No ratings yet
Mathematical Modeling by Identification: A Case Study in The Laboratory of Control Applied To The Identification of A Servo Mechanism
11 pages
QU Master Project
No ratings yet
QU Master Project
6 pages
Machine Learning
No ratings yet
Machine Learning
64 pages
Project Proposal: Efficient Algorithms For Molecular Dynamics Simulations and Other Dynamic Spatial Join Queries
No ratings yet
Project Proposal: Efficient Algorithms For Molecular Dynamics Simulations and Other Dynamic Spatial Join Queries
7 pages
Exploring A Self-Replication Algorithm To Flexibly Match Patterns
No ratings yet
Exploring A Self-Replication Algorithm To Flexibly Match Patterns
18 pages
Active Machine Learning
No ratings yet
Active Machine Learning
8 pages
Pid 5184483
No ratings yet
Pid 5184483
6 pages
Regression Machine Learning Models for the Short-Time Prediction of Genetic Algorithm Results in a Vehicle Routing Problem
No ratings yet
Regression Machine Learning Models for the Short-Time Prediction of Genetic Algorithm Results in a Vehicle Routing Problem
15 pages
Entropy 23 01123 v2
No ratings yet
Entropy 23 01123 v2
21 pages
ProjPaper[1]_Edited-1
No ratings yet
ProjPaper[1]_Edited-1
6 pages
Contrasting Expert Systems and Multi-Processors
No ratings yet
Contrasting Expert Systems and Multi-Processors
6 pages
Relational Reinforcement Learning With Guided Demon 2017 Artificial Intellig
No ratings yet
Relational Reinforcement Learning With Guided Demon 2017 Artificial Intellig
18 pages
Статья, Забродский О.С.
No ratings yet
Статья, Забродский О.С.
6 pages
A Systematic Approach To Composing and Optimizing Application Workflows
No ratings yet
A Systematic Approach To Composing and Optimizing Application Workflows
9 pages
PP2 - Taller2-Redaccion de Paper-Parte1
No ratings yet
PP2 - Taller2-Redaccion de Paper-Parte1
14 pages
Annex 3 NPU Student Selection Report
No ratings yet
Annex 3 NPU Student Selection Report
52 pages
MTech_Thesis_Synopsis (2) (2)
No ratings yet
MTech_Thesis_Synopsis (2) (2)
10 pages
tmp3FCD TMP
No ratings yet
tmp3FCD TMP
14 pages
A Framework For The Engineering of Reliable Distributed Systems
No ratings yet
A Framework For The Engineering of Reliable Distributed Systems
8 pages
1.0 Executive Summary: Computational Materials Science CRTA
No ratings yet
1.0 Executive Summary: Computational Materials Science CRTA
1 page
Task Scheduling Mechanisms For Fog Computing A Systematic Survey
No ratings yet
Task Scheduling Mechanisms For Fog Computing A Systematic Survey
24 pages
Deep Learning Approaches Based On Transformer Architectures For Image Captioning Tasks
No ratings yet
Deep Learning Approaches Based On Transformer Architectures For Image Captioning Tasks
16 pages
ARTIFICIAL INTELLIGENCE_MACHINE_LEARNING_FOR_MATERIALS_DISCOVERY_AND_OPTIMIZATION_NTMP
No ratings yet
ARTIFICIAL INTELLIGENCE_MACHINE_LEARNING_FOR_MATERIALS_DISCOVERY_AND_OPTIMIZATION_NTMP
26 pages
Realtss - A Real-Time Scheduling Simulator
No ratings yet
Realtss - A Real-Time Scheduling Simulator
4 pages
Paper 1
No ratings yet
Paper 1
7 pages
Active Learning in The Era of Big Data
No ratings yet
Active Learning in The Era of Big Data
13 pages
第七次课参考文献-REX Rapid Exploration and eXploitation for AI agents
No ratings yet
第七次课参考文献-REX Rapid Exploration and eXploitation for AI agents
16 pages
A survey of adaptive sampling for global metamodeling in support of simulation-based complex engineering design
No ratings yet
A survey of adaptive sampling for global metamodeling in support of simulation-based complex engineering design
24 pages
Research Statement - Somil Bansal
No ratings yet
Research Statement - Somil Bansal
6 pages
Design of A Low-Cost Air Levitation System For Teaching Control Engineering
No ratings yet
Design of A Low-Cost Air Levitation System For Teaching Control Engineering
18 pages
Tech03 01 Oop
No ratings yet
Tech03 01 Oop
5 pages
Analysis
No ratings yet
Analysis
3 pages
Potential Thesis Topics
No ratings yet
Potential Thesis Topics
3 pages
A_reservoir_computing_approach_for_learning_forward_dynamics_of_industrial_manipulators
No ratings yet
A_reservoir_computing_approach_for_learning_forward_dynamics_of_industrial_manipulators
7 pages
1-s2.0-S2949715923000604-main
No ratings yet
1-s2.0-S2949715923000604-main
30 pages
Beltran-Hernandez et al_2019_Learning to Grasp with Primitive Shaped Object Policies
No ratings yet
Beltran-Hernandez et al_2019_Learning to Grasp with Primitive Shaped Object Policies
6 pages
Project Plan For Degree Projects
No ratings yet
Project Plan For Degree Projects
4 pages
Deep MPC
No ratings yet
Deep MPC
9 pages
The Need For A Hypertext Instructional Design Methodology
No ratings yet
The Need For A Hypertext Instructional Design Methodology
8 pages
A Metadata Based Components Model PDF
No ratings yet
A Metadata Based Components Model PDF
4 pages
Seamless Integration of Machine Learning Contents in Mechatronics Curricula
No ratings yet
Seamless Integration of Machine Learning Contents in Mechatronics Curricula
6 pages
Rethinking The Recommender Research Ecosystem: Reproducibility, Openness, and Lenskit
No ratings yet
Rethinking The Recommender Research Ecosystem: Reproducibility, Openness, and Lenskit
8 pages
Research
No ratings yet
Research
47 pages
Task Scheduling Mechanisms
No ratings yet
Task Scheduling Mechanisms
17 pages
Capelo 1998
No ratings yet
Capelo 1998
15 pages
Studies On The Less-Used Actions Exploration Problem of A Rationing Algorithm Based On Reinforcement Learning
No ratings yet
Studies On The Less-Used Actions Exploration Problem of A Rationing Algorithm Based On Reinforcement Learning
6 pages
A Permutation-Based Algorithm TRAIN FINAL
No ratings yet
A Permutation-Based Algorithm TRAIN FINAL
55 pages
Gemini - Prompt Design Strategies - Google AI For Developers
No ratings yet
Gemini - Prompt Design Strategies - Google AI For Developers
21 pages
Vol 3, No 1 (2015) (1)
No ratings yet
Vol 3, No 1 (2015) (1)
63 pages
Master Thesis Technology Management
100% (3)
Master Thesis Technology Management
7 pages
sensors-21-08003
No ratings yet
sensors-21-08003
16 pages
DAA Mini Project
No ratings yet
DAA Mini Project
19 pages
Applsci 09 02775
No ratings yet
Applsci 09 02775
15 pages
Maintenance of KBS's by Domain Experts: The Holy Grail in Practice
No ratings yet
Maintenance of KBS's by Domain Experts: The Holy Grail in Practice
10 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Economic Multi Agent Systems: Design, Implementation, and Application
From Everand
Economic Multi Agent Systems: Design, Implementation, and Application
Gottfried Haber
4/5 (1)
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Model-Driven Online Capacity Management for Component-Based Software Systems
From Everand
Model-Driven Online Capacity Management for Component-Based Software Systems
André van Hoorn
No ratings yet
Warren Bennis Leadership
No ratings yet
Warren Bennis Leadership
6 pages
French Listening Comprehension For Absolute Beginners #15 Talking About Breakfast in French
No ratings yet
French Listening Comprehension For Absolute Beginners #15 Talking About Breakfast in French
5 pages
Edital 030 Transferncia de Curso1 2018
No ratings yet
Edital 030 Transferncia de Curso1 2018
4 pages
Public Class Dim As String Private Sub As As Handles Try: Form1 Object Eventargs
No ratings yet
Public Class Dim As String Private Sub As As Handles Try: Form1 Object Eventargs
1 page
Discrete Mathematics - Wikipedia
No ratings yet
Discrete Mathematics - Wikipedia
8 pages
Haykin Xue Neural Networks and Learning Machines 3ed Soln PDF
50% (2)
Haykin Xue Neural Networks and Learning Machines 3ed Soln PDF
103 pages
French Listening Comprehension For Absolute Beginners #1 at A French Bookstore
No ratings yet
French Listening Comprehension For Absolute Beginners #1 at A French Bookstore
3 pages
Discrete Mathematics - Wikipedia
No ratings yet
Discrete Mathematics - Wikipedia
8 pages
First City Providential College
No ratings yet
First City Providential College
4 pages
67 Jota Aragonesa Spanish Folk Dance
No ratings yet
67 Jota Aragonesa Spanish Folk Dance
3 pages
Parts of The Book
No ratings yet
Parts of The Book
5 pages
Verb To Be Board Gamepresent Simple
No ratings yet
Verb To Be Board Gamepresent Simple
1 page
Lang/Year: Mobile: +91-8667215877/+91-9566492473 - Whatsapp: 08667215877 Output Satisfaction
No ratings yet
Lang/Year: Mobile: +91-8667215877/+91-9566492473 - Whatsapp: 08667215877 Output Satisfaction
4 pages
Assessment 1 James Wilson 17727701
No ratings yet
Assessment 1 James Wilson 17727701
104 pages
December Teachers Training Report
No ratings yet
December Teachers Training Report
16 pages
PHD Thesis On Cultural Identity
100% (2)
PHD Thesis On Cultural Identity
6 pages
Middle School Syllabus
No ratings yet
Middle School Syllabus
3 pages
CIPP Checklist
100% (1)
CIPP Checklist
16 pages
2035_Example_Candidate_Responses_Paper_2_(for_examination_from_2024)
100% (1)
2035_Example_Candidate_Responses_Paper_2_(for_examination_from_2024)
21 pages
SMK: Form 5 Yearly Plan 2021: Theme: People and Culture / Unit 1
No ratings yet
SMK: Form 5 Yearly Plan 2021: Theme: People and Culture / Unit 1
7 pages
Human Geography
No ratings yet
Human Geography
2 pages
Language - The Social Mirror, 3rd Edition
No ratings yet
Language - The Social Mirror, 3rd Edition
452 pages
Advt69 2013 ET Detailed
No ratings yet
Advt69 2013 ET Detailed
6 pages
Susmita Saha - CV - DK
No ratings yet
Susmita Saha - CV - DK
3 pages
2 VGU RANKA NATIONAL MOOT COURT COMPETITION Final Brouchure
No ratings yet
2 VGU RANKA NATIONAL MOOT COURT COMPETITION Final Brouchure
22 pages
7 September 25 2022 Week 7 Lecture NSTP LTS
No ratings yet
7 September 25 2022 Week 7 Lecture NSTP LTS
150 pages
Kwara Polytechnic Unbundles HND Mass Communicatio…
No ratings yet
Kwara Polytechnic Unbundles HND Mass Communicatio…
1 page
Business Administration Resume Examples
100% (1)
Business Administration Resume Examples
8 pages
Thesis Writing About Education
100% (3)
Thesis Writing About Education
8 pages
EDTM 312 Study Unit 1
No ratings yet
EDTM 312 Study Unit 1
64 pages
The British Education System
100% (2)
The British Education System
11 pages
8588 Level 5 Certificate and Diploma in Effective Coaching and Mentoring
No ratings yet
8588 Level 5 Certificate and Diploma in Effective Coaching and Mentoring
108 pages
ME-CH3-Taoism-Confucianism-Native Beliefs
No ratings yet
ME-CH3-Taoism-Confucianism-Native Beliefs
54 pages
Winning The People
No ratings yet
Winning The People
8 pages
The Electronic Classroom
No ratings yet
The Electronic Classroom
4 pages
B.Tech.: Department of Physics National Institute of Technology Calicut Kozhikode - 673601, KERALA, INDIA
No ratings yet
B.Tech.: Department of Physics National Institute of Technology Calicut Kozhikode - 673601, KERALA, INDIA
25 pages
WPP 3
No ratings yet
WPP 3
235 pages