ReinforcementLearning
ReinforcementLearning
Business
Algorithms, Applications, and Relevance in Consumer Markets
What is Reinforcement Learning?
Definition: A type of machine learning where an agent learns to make
decisions by performing actions in an environment to maximize
cumulative reward.
Key Components:
Agent: The learner or decision maker.
Environment: What the agent interacts with.
Actions: What the agent can do.
State: The current situation of the agent.
Reward: Feedback from the environment.
Comparison:
Feature Supervised Learning Reinforcement Learning
Learning Goal Map inputs to outputs Maximize cumulative reward
Training Data Labeled input-output pairs State, action, reward, next state
Feedback Direct and immediate Delayed and based on actions
Continuous interaction with
Environment Interaction No interaction, static data
environment
Image classification, spam
Examples of Applications Game playing, robot navigation
detection
Key Takeaway:
•Supervised Learning: Best for problems with clear, labeled data where the goal is to predict
outputs.
•Reinforcement Learning: Suitable for scenarios involving sequential decision-making where the
goal is to learn optimal strategies through trial and error.
How Reinforcement Learning Works
Process:
Agent observes the current state.
Agent takes an action based on a policy.
Environment responds with the next state and a reward.
Agent updates its policy based on the reward.