Final
Final
BY
EHGINEERING PHYSICS-II STUDENTS
This is certified that the work of MINI PROJECT done in this report on topic “Mastering the
Art of Reinforcement Learning” was carried out by me under the supervision of Azazul
Haque Sir.
Roll Name
No.
52. SHINDE SWAYAM VITTHAL
53. SINGH MOHAN PAWAN
54. SONI VINEET
55. SONIGRA DISHA BHAVESH
Understanding Reinforcement Learning
Drawbacks-
Q-Leaming can be slow and needs many iterations to find the
optimal policy. It faces the exploration-exploitation dilemma and
doesn't work well with continuous spaces
Q-Learning is a popular RL algorithm used to learn optimal policies in unknown
environments. It uses a Q-table to store the expected cumulative rewards for each state-action
pair. Through an iterative process of exploration and exploitation, Q-Learning updates the Q-
values until convergence, enabling the agent to make intelligent decisions in real-time
scenarios.
Deep Reinforcement Learning (DRL) combines RL with deep neural networks, enabling the
agent to learn directly from raw sensory input. DRL has achieved remarkable success in
complex tasks, such as playing Atari games and autonomous driving. By leveraging the power
of deep learning, DRL pushes the boundaries of intelligent decision-making and opens up
new possibilities.
Applications of Reinforcement Learning