0% found this document useful (0 votes)

11 views

HW1 Questions

This homework assignment for Deep Reinforcement Learning focuses on training agents to make decisions through trial and error in various environments. Students will implement algorithms, create custom environments, and analyze performance metrics, with tasks including solving predefined environments and designing a grid-world problem. The assignment aims to enhance understanding of RL concepts and algorithm performance, with a submission deadline of February 16th, 2025.

Uploaded by

kooshan fattah

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

HW1 Questions

Uploaded by

kooshan fattah

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Deep Reinforcement Learning

Professor Mohammad Hossein Rohban

Homework 1:

Introduction to RL
Designed By:

MohammadHasan Abbasi
[email protected]

Spring 2025
Deep Reinforcement Learning [Spring 2025]

Preface
Welcome to your first homework!
Reinforcement Learning (RL) is a fundamental branch of artificial intelligence that focuses on training
agents to make sequential decisions through trial and error. Unlike supervised learning, RL does not rely
on labeled data but instead learns optimal policies by interacting with an environment, receiving feedback
in the form of rewards.
This assignment is designed to provide hands-on experience with RL modeling, algorithm implementation,
and performance evaluation. Students will explore RL concepts through predefined environments and
custom-designed settings. The homework is structured into the following sections:
• Predefined Environments: You will select and solve two RL environments from a given set,
implementing stable-baselines3 algorithms such as PPO and DQN.
• Custom Environments: You will design and modify RL environments, experimenting with changes
in rewards, states, and environment dynamics.
• Algorithm Comparison: Analyzing the performance of different RL algorithms in terms of sample
efficiency, reward accumulation, and hyperparameter sensitivity.
• Pygame Tutorial: A short tutorial on using Pygame to create custom RL environments, helping
students understand environment modeling from scratch.
The goal of this assignment is not only to implement RL models but also to develop an intuition for how
different RL methods perform under varying conditions. By the end of this homework, students should be
able to:
• Design and implement RL environments using OpenAI Gym/Gymnasium and Pygame.
• Train RL agents using stable-baselines3 algorithms.
• Compare RL algorithms based on efficiency and performance metrics.
• Understand the impact of environment design on learning outcomes.
We hope this assignment enhances your understanding of RL and encourages further exploration into deep
reinforcement learning.

Grading
The grading will be based on the following criteria, with a total of 100 points:

Task Points
Task 1: Solving Predefined Environments 45
Task 2: Creating Custom Environments 45
Clarity and Quality of Code 5
Clarity and Quality of Report 5
Bonus 1: Writing a wrapper for a known env 10
Bonus 2: Implementing pygame env 20
Bonus 3: Writing your report in Latex 10
Deep Reinforcement Learning [Spring 2025]

Notes:
• Include well-commented code and relevant plots in your notebook.
• Clearly present all comparisons and analyses in your report.
• Ensure reproducibility by specifying all dependencies and configurations.

Acknowledgement
We would like to thank Negin Hashemi and Alireza Nobakht from the QA team for their valuable feedback
on this homework.

Submission
The deadline for this homework is 1403/11/28 (February 16th 2025) at 11:59 PM.
Please submit your work by following the instructions below:
• Place your solution alongside the Jupyter notebook(s).
– Your written solution must be a single PDF file named HW1_Solution.pdf .

– If there is more than one Jupyter notebook, put them in a folder named Notebooks .
• Zip all the files together with the following naming format:
DRL_HW1_[StudentNumber]_[FullName].zip

– Replace [FullName] and [StudentNumber] with your full name and student number,
respectively. Your [FullName] must be in CamelCase with no spaces.
• Submit the zip file through Quera in the appropriate section.
• We provided this LaTeX template for writing your homework solution. There is a 10-point bonus
for writing your solution in LaTeX using this template and including your LaTeX source code in your
submission, named HW1_Solution.zip .
• If you have any questions about this homework, please ask them in the Homework section of our
Telegram Group.
• If you are using any references to write your answers, consulting anyone, or using AI, please mention
them in the appropriate section. In general, you must adhere to all the rules mentioned here and
here by registering for this course.

Keep up the great work and best of luck with your submission!
Contents
1 Setup Instructions 1
1.1 Environment Setup. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Submission Requirements. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

2 Problem Descriptions 3
2.1 Task 1: Solving Predefined Environments. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.1.1 Instructions: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Task 2: Creating Custom Environments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.2.1 Instructions: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.3 Task 3: Pygame for RL environment (Bonus) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.3.1 Instructions: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1 Setup Instructions Deep Reinforcement Learning [Spring 2025]

1 Setup Instructions
Before starting this assignment, ensure that your environment is correctly set up with the required libraries
and dependencies. The practical component of this homework will be completed in the provided Jupyter
Notebook (HW1_Notebook.ipynb), which is attached along with this document.

1.1 Environment Setup

You may use Colab if you don’t want to set up a local environment. If you used a local setup, upload
the notebook and run it directly in Google Colab also. To run the provided notebook and complete the
assignments, follow these steps:
1. Install Python and Required Libraries: Ensure that you have Python 3.8+ installed. We rec-
ommend using a virtual environment for package management.
python -m venv rl_homework_env
source rl_homework_env/bin/activate # On Linux/macOS
rl_homework_env\Scripts\activate # On Windows

Next, install the required dependencies:

pip install -r requirements.txt

The provided requirements.txt file contains all necessary packages, including:

• gymnasium (for RL environments)
• stable-baselines3 (for RL algorithms)
• pygame (for environment customization)
• matplotlib, seaborn (for visualization)
• numpy, pandas (for data processing)
• jupyterlab or notebook (for jupyter set up)
2. Download and Open the Notebook: Navigate to the directory where you extracted the homework
files and launch the Jupyter Notebook:
jupyter notebook HW1_Notebook.ipynb

3. Verify Installations: Run the first few cells in the notebook to ensure that all dependencies are
correctly installed.
4. Additional Notes:
• If you face issues with Gymnasium environments, ensure that the necessary dependencies (such
as pygame and mujoco for specific environments) are installed.

1
1 Setup Instructions Deep Reinforcement Learning [Spring 2025]

1.2 Submission Requirements

• Ensure that your Jupyter Notebook runs without errors before submission.
• Include all code, outputs, and plots within the notebook.
• Submit a ZIP file containing:
– The completed HW1_Notebook.ipynb.
– Any additional scripts used for custom environments.
If you encounter any issues, please reach out on the course forum or contact the teaching assistants.

2
2 Problem Descriptions Deep Reinforcement Learning [Spring 2025]

2 Problem Descriptions
This homework consists of multiple tasks that will guide you through designing, implementing, and analyz-
ing reinforcement learning environments and algorithms. You will work with both predefined environments
and custom ones, applying different RL algorithms from stable-baselines3 and comparing their perfor-
mance.

2.1 Task 1: Solving Predefined Environments

Objective: Solve two predefined reinforcement learning environments using stable-baselines3 algorithms.

2.1.1 Instructions:
• Choose two environments from the following options:
1. CartPole - https://gymnasium.farama.org/environments/classic_control/cart_pole/
2. Frozen Lake https://gymnasium.farama.org/environments/toy_text/frozen_lake/
3. Flappy Bird (Custom Environment) - No official Gymnasium link
4. Taxi - https://gymnasium.farama.org/environments/toy_text/taxi/
• Implement and train agents using at least two RL algorithms (e.g., PPO, DQN).
• Try to write a wrapper for reward function and plot the changes. (Bonus)
• Record and analyze:
– Learning curves (reward over episodes)
– Sample efficiency (training time and required episodes)
– Algorithm hyperparameters and their effect
• Visualize and compare the results.
• Write down your thoughts on the fitness of SL for problems like this. Guess in what problems SL
fails or is not practical, explain why (bonus).
Deliverables:
• Code implementation in the attached Jupyter Notebook.
• Graphs comparing the performance of different algorithms.
• A brief discussion of the findings in the report.

3
2 Problem Descriptions Deep Reinforcement Learning [Spring 2025]

2.2 Task 2: Creating Custom Environments

Objective: Design and implement custom reinforcement learning environments with Gymnasium.

2.2.1 Instructions:
• In this question, you are required to model a custom 4*4 grid-world problem as Markov Decision
Processes (MDPs). You must define the following components:
– State Space (S): The set of all possible states the agent can be in.
– Action Space (A): The set of all possible actions the agent can take.
– Reward Function (R): The reward the agent receives for taking an action in a given state.
– Transition Probability (P ): The probability of transitioning to a new state given the current
state and action. If the environment is deterministic, this can be omitted.
• Train agents using at least one algorithm and evaluate performance.
Deliverables:
• Code for the environment.
• Training results, learning curves, and observations.
• A short discussion on implementation and how modifications affect learning.

4
2 Problem Descriptions Deep Reinforcement Learning [Spring 2025]

2.3 Task 3: Pygame for RL environment (Bonus)

Objective: Learn to use Pygame for RL environment customization.

2.3.1 Instructions:
• Complete a step-by-step Pygame tutorial provided in the notebook.
• Extend the example by adding new:
– Obstacles
– Rewards
– Action mechanics
• Experiment with different agent interactions.
Deliverables:
• Updated Pygame environment.
• Screenshots of modifications and explanations.

5
REFERENCES Deep Reinforcement Learning [Spring 2025]

References
[1] R. Sutton and A. Barto, Reinforcement Learning: An Introduction, 2nd Edition, 2020. Available online:
http://incompleteideas.net/book/the-book-2nd.html
[2] A. Raffin et al., "Stable Baselines3: Reliable Reinforcement Learning Implementations," GitHub Repos-
itory, 2020. Available: https://github.com/DLR-RM/stable-baselines3.
[3] Gymnasium Documentation. Available: https://gymnasium.farama.org/.
[4] Pygame Documentation. Available: https://www.pygame.org/docs/.
[5] CS 285: Deep Reinforcement Learning, UC Berkeley, Pieter Abbeel. Course material available: http:
//rail.eecs.berkeley.edu/deeprlcourse/.
[6] Cover image designed by freepik

Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
No ratings yet
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
2 pages
(Addison-Wesley Data & Analytics Series) Laura Graesser - Wah Loon Keng - Foundations of Deep Reinforcement Learning - Theory and Practice in Python-Addison-Wesley Professional (2019) PDF
100% (1)
(Addison-Wesley Data & Analytics Series) Laura Graesser - Wah Loon Keng - Foundations of Deep Reinforcement Learning - Theory and Practice in Python-Addison-Wesley Professional (2019) PDF
656 pages
HW4 Questions
No ratings yet
HW4 Questions
11 pages
HW3 Questions
No ratings yet
HW3 Questions
13 pages
A Crash Course On Reinforcement Learning
No ratings yet
A Crash Course On Reinforcement Learning
40 pages
AI - Assignment 2 Zaryab Khan
No ratings yet
AI - Assignment 2 Zaryab Khan
6 pages
HW4 Spec
No ratings yet
HW4 Spec
5 pages
RL2024 Phase1 OGV2
No ratings yet
RL2024 Phase1 OGV2
2 pages
Final Project
No ratings yet
Final Project
5 pages
HW 1
No ratings yet
HW 1
4 pages
Rlpyt: A Research Code Base For Deep Reinforcement Learning in Pytorch
No ratings yet
Rlpyt: A Research Code Base For Deep Reinforcement Learning in Pytorch
12 pages
Dulac Arnold 2021
No ratings yet
Dulac Arnold 2021
50 pages
Rl Catalogue
No ratings yet
Rl Catalogue
3 pages
Unit - 1
No ratings yet
Unit - 1
14 pages
Reinforcement Learning2018
No ratings yet
Reinforcement Learning2018
5 pages
RL-DL File
No ratings yet
RL-DL File
18 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
Deep Learning Record
No ratings yet
Deep Learning Record
70 pages
Agile Foundation Courseware – English
From Everand
Agile Foundation Courseware – English
Nader Rad
No ratings yet
Lab 6 Specification
No ratings yet
Lab 6 Specification
1 page
Assigniment 2 Machine Learning
No ratings yet
Assigniment 2 Machine Learning
7 pages
AI Magazine - 2024 - Hanna - Toward the confident deployment of real‐world reinforcement learning agents
No ratings yet
AI Magazine - 2024 - Hanna - Toward the confident deployment of real‐world reinforcement learning agents
8 pages
hw1 f21112 Problems11
No ratings yet
hw1 f21112 Problems11
2 pages
Software Development Techniques
From Everand
Software Development Techniques
Chandini Devar
No ratings yet
Fundamentals of Artificial Intelligence - Spring 2022 Lab 1: Expert Systems
No ratings yet
Fundamentals of Artificial Intelligence - Spring 2022 Lab 1: Expert Systems
2 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
3 pages
Ai Using Python Lab
No ratings yet
Ai Using Python Lab
34 pages
AML - Lab - Syllabus - Chandigarh University
No ratings yet
AML - Lab - Syllabus - Chandigarh University
9 pages
Ai Lab Manual Artificial Intelligence Lab Using Python (LC-CSE-326G)
No ratings yet
Ai Lab Manual Artificial Intelligence Lab Using Python (LC-CSE-326G)
29 pages
Assignment 01
No ratings yet
Assignment 01
7 pages
cs224r_L01_intro
No ratings yet
cs224r_L01_intro
51 pages
DeepSeek R1 Dual
No ratings yet
DeepSeek R1 Dual
44 pages
Building Reinforcement Learning Environment
No ratings yet
Building Reinforcement Learning Environment
7 pages
Syl6 ML
No ratings yet
Syl6 ML
3 pages
Assignment 1 - Imitation Learning
No ratings yet
Assignment 1 - Imitation Learning
3 pages
The Art of Reinforcement Learning: Fundamentals, Mathematics, and Implementations with Python 1st Edition Michael Hu - Get instant access to the full ebook with detailed content
No ratings yet
The Art of Reinforcement Learning: Fundamentals, Mathematics, and Implementations with Python 1st Edition Michael Hu - Get instant access to the full ebook with detailed content
50 pages
Machine Learning Assignments
No ratings yet
Machine Learning Assignments
3 pages
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
Deepbots: A Webots-Based Deep Reinforcement Learning Framework For Robotics
No ratings yet
Deepbots: A Webots-Based Deep Reinforcement Learning Framework For Robotics
12 pages
Neural Networks
No ratings yet
Neural Networks
39 pages
Quiz 0
No ratings yet
Quiz 0
3 pages
Introduction To Machine Learning Course Code: 4350702
No ratings yet
Introduction To Machine Learning Course Code: 4350702
12 pages
AI-Practice Questions 3
No ratings yet
AI-Practice Questions 3
2 pages
DRL - AI309 - A - Assignment - 2 - SP25 - GIKI
No ratings yet
DRL - AI309 - A - Assignment - 2 - SP25 - GIKI
2 pages
Reinforcement Learning For IoT - Final
No ratings yet
Reinforcement Learning For IoT - Final
45 pages
Lab3 - Introduction To Machine Learning Algorithms With A Focus On Robotics Applications
No ratings yet
Lab3 - Introduction To Machine Learning Algorithms With A Focus On Robotics Applications
12 pages
RLDL128
No ratings yet
RLDL128
73 pages
Hyperparameter Tuning For Deep Reinforcement Learning Applications
No ratings yet
Hyperparameter Tuning For Deep Reinforcement Learning Applications
12 pages
Lab 5
No ratings yet
Lab 5
6 pages
R Deep Neural Network Step by Step
No ratings yet
R Deep Neural Network Step by Step
27 pages
Simple RL: Reproducible Reinforcement Learning in Python: David - Abel@brown - Edu
No ratings yet
Simple RL: Reproducible Reinforcement Learning in Python: David - Abel@brown - Edu
11 pages
Trí Tuệ Nhân Tạo - Search - Project
No ratings yet
Trí Tuệ Nhân Tạo - Search - Project
34 pages
DeepSeek_R1
No ratings yet
DeepSeek_R1
22 pages
DeepSeek_R1
No ratings yet
DeepSeek_R1
22 pages
AD3511 SET2
No ratings yet
AD3511 SET2
2 pages
201CS240-ML OBTR 2 (1)
No ratings yet
201CS240-ML OBTR 2 (1)
16 pages
RLtools-Nov. 2024
No ratings yet
RLtools-Nov. 2024
19 pages
Experiment Number 5
No ratings yet
Experiment Number 5
2 pages
Comp 428 Cat Ii
No ratings yet
Comp 428 Cat Ii
2 pages
Aiml Lab
No ratings yet
Aiml Lab
3 pages
Practice Questions
No ratings yet
Practice Questions
28 pages
Garmin Pilot Users Guide For Ios
No ratings yet
Garmin Pilot Users Guide For Ios
188 pages
ELET442 - Artificial Neural Networks (ANNs)
No ratings yet
ELET442 - Artificial Neural Networks (ANNs)
56 pages
zzzzzzzzzzzzzzzzzz
No ratings yet
zzzzzzzzzzzzzzzzzz
3 pages
EED2-TP-030 HEG UsersGuide 2.14
No ratings yet
EED2-TP-030 HEG UsersGuide 2.14
119 pages
Animation Tips
No ratings yet
Animation Tips
12 pages
10 Кл Action 4 Четв ҚМЖ
No ratings yet
10 Кл Action 4 Четв ҚМЖ
74 pages
Ds-Module 5 Lecture Notes
No ratings yet
Ds-Module 5 Lecture Notes
12 pages
Karen Moning Fever Moonepub PDF
No ratings yet
Karen Moning Fever Moonepub PDF
4 pages
MODULE 1 STATISTICS AND DATA ANALYSIS Final
No ratings yet
MODULE 1 STATISTICS AND DATA ANALYSIS Final
9 pages
Balsa Wood Fem
No ratings yet
Balsa Wood Fem
10 pages
Wireframing
No ratings yet
Wireframing
3 pages
Yealink SIP-T2 Series T4 Series IP Phones Auto Provisioning Guide V73!40!1
No ratings yet
Yealink SIP-T2 Series T4 Series IP Phones Auto Provisioning Guide V73!40!1
401 pages
Plant Disease Detection Using Machine Learning
No ratings yet
Plant Disease Detection Using Machine Learning
8 pages
Cloud Computing Lab Tanushri
No ratings yet
Cloud Computing Lab Tanushri
19 pages
EOT SM Electrics A067
No ratings yet
EOT SM Electrics A067
6 pages
L04 - IS - Program Security
No ratings yet
L04 - IS - Program Security
2 pages
Financial Statement'
No ratings yet
Financial Statement'
55 pages
SAP BP Workbook Document Generation IN
No ratings yet
SAP BP Workbook Document Generation IN
26 pages
Tablero
No ratings yet
Tablero
8 pages
DM-Topic 5
No ratings yet
DM-Topic 5
23 pages
Geodatabase Design Forms: Geodatabase Name Feature Dataset Name
No ratings yet
Geodatabase Design Forms: Geodatabase Name Feature Dataset Name
6 pages
Product Sheet - Ekr 500 Digital (En)
No ratings yet
Product Sheet - Ekr 500 Digital (En)
2 pages
Application Note Ctan #353: Backing Up Critical Drive Setup Info Using Ctsoft
No ratings yet
Application Note Ctan #353: Backing Up Critical Drive Setup Info Using Ctsoft
12 pages
ECMA-262 3rd Edition December 1999
No ratings yet
ECMA-262 3rd Edition December 1999
188 pages
Rajiv Ranjan Raja - CV
No ratings yet
Rajiv Ranjan Raja - CV
2 pages
Major Project
No ratings yet
Major Project
22 pages
Lecture 2 - Binary Numbers, Python Basics PDF
No ratings yet
Lecture 2 - Binary Numbers, Python Basics PDF
62 pages
C_BW4H_2404
No ratings yet
C_BW4H_2404
4 pages
GE Fanuc IC695NKT002: RX3i Ethernet NIU Kit With Two Ethernet Modules. IC695N IC695NK IC695NKT
No ratings yet
GE Fanuc IC695NKT002: RX3i Ethernet NIU Kit With Two Ethernet Modules. IC695N IC695NK IC695NKT
13 pages

HW1 Questions

Uploaded by

HW1 Questions

Uploaded by

Deep Reinforcement Learning

Professor Mohammad Hossein Rohban

1.1 Environment Setup

Next, install the required dependencies:

The provided requirements.txt file contains all necessary packages, including:

1.2 Submission Requirements

2.1 Task 1: Solving Predefined Environments

2.2 Task 2: Creating Custom Environments

2.3 Task 3: Pygame for RL environment (Bonus)

You might also like