0% found this document useful (0 votes)

7 views13 pages

Introduction to Reinforcement Learning

Reinforcement Learning (RL) is a machine learning method where an agent learns to make decisions through interactions with its environment, receiving feedback in the form of rewards or penalties. Key components of RL include the agent, environment, actions, states, rewards, and policies, with applications in areas like gaming and self-driving cars. The Markov Decision Process (MDP) formalizes RL problems, and Q-learning is a widely used algorithm for learning optimal policies.

Uploaded by

jayanthp1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views13 pages

Introduction to Reinforcement Learning

Uploaded by

jayanthp1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Introduction to Reinforcement Learning

Presenter:
Sepideh Nikookar
[Link]
Advisor:
Prof. Senjuti Basu Roy
[Link]

Department of Computer Science

NJIT

Dec 1, 2022 1
What is Reinforcement Learning?

• Reinforcement learning is a type of machine

learning method where an intelligent agent interacts
with the environment and learns to act within that.

• For each good action, the agent gets positive

feedback, and for each bad action, the agent gets
negative feedback or penalty.

• RL solves a specific type of problem where decision

making is sequential, and the goal is long-term.

• The agent continues doing following three things:

• Take action,
• change state/remain in the same state
• get feedback

By doing these actions, he learns and explores the environment. 2

Reinforcement Learning Example

• An example of a state could be your cat sitting, and you use a specific word for cat to walk.
• Your cat reacts by performing an action transition from one “state” (sitting) to another “state”
(walking).
• The reaction of your cat is an action, and the policy is a method of selecting an action given a
state in expectation of better outcomes.
• After the transition, cat may get a reward or penalty in return. 3
Reinforcement Learning Applications

4
Supervised vs. Unsupervised vs. Reinforcement Learning

Criteria Supervised ML Unsupervised ML Reinforcement ML

Learns by using labelled Trained using unlabelled data Works on interacting with
Definition
data without any guidance. the environment

Regression and Exploitation or

Type of problems Association and Clustering
classification Exploration

Linear Regression,
K – Means, Q – Learning,
Algorithms Logistic Regression,
C – Means, Apriori SARSA
SVM, KNN etc.

Aim Calculate outcomes Discover underlying patterns Learn a series of action

Risk Evaluation, Forecast Recommendation System, Self Driving Cars,

Application
Sales Anomaly Detection Gaming, Healthcare
5
Term Used in Reinforcement Learning

Agent An entity that can perceive/explore the environment and act upon it.

Environment A situation in which an agent is present or surrounded by.

Action Actions are the moves taken by an agent within the environment.

State is a situation returned by the environment after each action taken by the
State agent.

A feedback returned to the agent from the environment to evaluate the action of
Reward
the agent.
Policy is a strategy applied by the agent for the next action based on the current
Policy state.

It is expected long-term retuned with the discount factor and opposite to the
Value
short-term reward.
6
Elements of Reinforcement Learning
There are four main elements of Reinforcement Learning, which are given below

1. Policy:
A policy can be defined as a way how an agent behaves at a given time. It maps the perceived
states of the environment to the actions taken on those states.

The policy-based approach has mainly two types of policy:

• Deterministic: The same action is produced by the policy (π) at any state.
• Stochastic: In this policy, probability determines the produced action.

2. Reward Signal: The goal of RL is defined by the reward signal. Reward signals are given
according to the good and bad actions taken by the learning agent. The main objective is to
maximize the total number of rewards for good actions.

3. Value Function: Gives information about how good the situation and action are and how
much reward an agent can expect.

4. Model: Mimics the behavior of the environment. 7

Reinforcement Learning Categories
🔘 Value Based
🔘 No Policy
🔘 Value Function

🔘 Policy Based
🔘 Policy
🔘 No Value Function
🔘 Actor Critic
🔘 Policy
🔘 Value Function

🔘 Model Free
🔘 Policy and/or Value Function
🔘 No Model

🔘 Model Based
🔘 Policy and/or Value Function
🔘 Model 8
State Representation

We can represent the agent state using the Markov State that contains all the
required information from the history. The State is Markov state if it follows the
given condition:

The Markov state follows the Markov property, which says that the future is
independent of the past and can only be defined with the present.

The RL works on fully observable environments, where the agent can observe the
environment and act for the new state. The complete process is known as Markov
Decision process.

9
Markov Decision Process

• Markov Decision Process or MDP, is used to formalize the Reinforcement Learning problems.

MDP contains a tuple of four elements

• A set of finite States

• A set of finite Actions
• Rewards received after transitioning from state to state , due to action .
• Probability .

10
Reinforcement Learning Algorithms
• RL algorithms are mainly used in AI applications and gaming applications. The main
used algorithm is:

Q-Learning: Q-learning is a popular model-free Reinforcement

Learning algorithm based on the Bellman
equation .

The main objective of Q-learning is to learn the

policy which can inform the agent that what
actions should be taken for maximizing the reward
under what circumstances.

It is an off-policy RL that attempts to find the best

action to take at a current state.

𝑄𝑛𝑒𝑤 ( 𝑠𝑡 ,𝑎𝑡 )=𝑄 ( 𝑠 𝑡 , 𝑎𝑡 ) +𝛼 ×(𝑟 𝑡 +𝛾 ×𝑚𝑎 𝑥 𝑎 𝑄 (𝑠𝑡 +1 , 𝑎)−𝑄 (𝑠 𝑡 , 𝑎𝑡 )) 11

Please open your Jupyter Notebook
for the hands-on experience .

12
13

Unit V AIML
No ratings yet
Unit V AIML
24 pages
MLT Unit-5 Notes
No ratings yet
MLT Unit-5 Notes
17 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
56 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
9 pages
Reinforcement Learning Explained
No ratings yet
Reinforcement Learning Explained
22 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
21 pages
Comprehensive Reinforcement Learning Guide
No ratings yet
Comprehensive Reinforcement Learning Guide
25 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
5 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
22 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
15 pages
Basics of Reinforcement Learning
No ratings yet
Basics of Reinforcement Learning
15 pages
Unit V
No ratings yet
Unit V
14 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
35 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
38 pages
Beginner's Guide to Reinforcement Learning
No ratings yet
Beginner's Guide to Reinforcement Learning
22 pages
Introduction To Reinforcement Learning
No ratings yet
Introduction To Reinforcement Learning
23 pages
Overview of Reinforcement Learning
No ratings yet
Overview of Reinforcement Learning
26 pages
Reinforcement Learning & MDP Overview
No ratings yet
Reinforcement Learning & MDP Overview
19 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
33 pages
Reinforcement
No ratings yet
Reinforcement
34 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
18 pages
Understanding Reinforcement Learning
No ratings yet
Understanding Reinforcement Learning
17 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
161 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
25 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
52 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
15 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
8 pages
Understanding Reinforcement Learning
No ratings yet
Understanding Reinforcement Learning
32 pages
RL - Module I
No ratings yet
RL - Module I
42 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
33 pages
Reinforcement Learning Concepts Explained
No ratings yet
Reinforcement Learning Concepts Explained
29 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
22 pages
Bias and Variance in RL Training
No ratings yet
Bias and Variance in RL Training
16 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
45 pages
Reinforcement Learning Overview and Applications
100% (1)
Reinforcement Learning Overview and Applications
25 pages
Reinforcement Learning Fundamentals Guide
No ratings yet
Reinforcement Learning Fundamentals Guide
72 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
31 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
102 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
34 pages
Reinforcement Learning Study Material
No ratings yet
Reinforcement Learning Study Material
115 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
Understanding Reinforcement Learning Elements
No ratings yet
Understanding Reinforcement Learning Elements
10 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
25 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
19 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
69 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
16 pages
Reinforcement Learning and Genetic Algorithms
100% (1)
Reinforcement Learning and Genetic Algorithms
24 pages
Unit 05 Machine Learning
No ratings yet
Unit 05 Machine Learning
21 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
32 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
8 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
34 pages
Reinforcement Learning & Genetic Algorithms
No ratings yet
Reinforcement Learning & Genetic Algorithms
62 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
61 pages
Model-Free Reinforcement Learning Guide
No ratings yet
Model-Free Reinforcement Learning Guide
7 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
5 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
22 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
29 pages
E-Voting Systems Overview and Types
No ratings yet
E-Voting Systems Overview and Types
9 pages
Ocean Fertilization Legal Case ICJ 2016
No ratings yet
Ocean Fertilization Legal Case ICJ 2016
23 pages
Finishing Faults in Garment Production
100% (1)
Finishing Faults in Garment Production
58 pages
RIA and ELISA: Techniques and Differences
No ratings yet
RIA and ELISA: Techniques and Differences
2 pages
Sustainable Tourism: Principles & Benefits
No ratings yet
Sustainable Tourism: Principles & Benefits
18 pages
HDFC Life Click 2 Protect Super Benefits
No ratings yet
HDFC Life Click 2 Protect Super Benefits
2 pages
Althuizen - 2021 - Revisiting Berlyne's Inverted U-Shape Relationship Between Complexity and Liking
No ratings yet
Althuizen - 2021 - Revisiting Berlyne's Inverted U-Shape Relationship Between Complexity and Liking
23 pages
Microplastic Pollution in Kelani River
No ratings yet
Microplastic Pollution in Kelani River
4 pages
Aircraft Cabin Fire Protection Standards
No ratings yet
Aircraft Cabin Fire Protection Standards
6 pages
D-ACTOR 100 Ultra Setup & Troubleshooting Guide
No ratings yet
D-ACTOR 100 Ultra Setup & Troubleshooting Guide
12 pages
eWOM's Impact on Purchase Intention
No ratings yet
eWOM's Impact on Purchase Intention
19 pages
Industrial Profile of Tiruchirappalli
No ratings yet
Industrial Profile of Tiruchirappalli
25 pages
Custom Manifolds Catalog 2010
No ratings yet
Custom Manifolds Catalog 2010
60 pages
2023 EPA Automotive Trends Report
No ratings yet
2023 EPA Automotive Trends Report
172 pages
AI Transforming Smart Cities
No ratings yet
AI Transforming Smart Cities
2 pages
All My Sons: Act One Script
100% (5)
All My Sons: Act One Script
70 pages
Bio Data and Labour Clearance Requests
No ratings yet
Bio Data and Labour Clearance Requests
10 pages
ADCA PRV30SS Pressure Reducing Valve
No ratings yet
ADCA PRV30SS Pressure Reducing Valve
3 pages
2024 Global Font Use Insights Report
No ratings yet
2024 Global Font Use Insights Report
15 pages
Ampalaya Tablets for Type 2 Diabetes
No ratings yet
Ampalaya Tablets for Type 2 Diabetes
2 pages
Investment Property Accounting Standards
No ratings yet
Investment Property Accounting Standards
8 pages
ATKT Exam Form Instructions 2025
No ratings yet
ATKT Exam Form Instructions 2025
2 pages
CRP Test Report for Mr. Anand Singh Negi
No ratings yet
CRP Test Report for Mr. Anand Singh Negi
1 page
Learning Needs Assessment Plan for Math Teachers
100% (4)
Learning Needs Assessment Plan for Math Teachers
3 pages
Mechanical Properties
No ratings yet
Mechanical Properties
72 pages
Enhancing Memory with Mnemonics in Education
No ratings yet
Enhancing Memory with Mnemonics in Education
5 pages
Avinash Kumar's 2024 Marks Report
No ratings yet
Avinash Kumar's 2024 Marks Report
1 page
Expansion Tank (Et) : Rabigh II Project Interconnecting Package (UO1)
No ratings yet
Expansion Tank (Et) : Rabigh II Project Interconnecting Package (UO1)
14 pages
RPG Character Stats and Equipment Guide
No ratings yet
RPG Character Stats and Equipment Guide
2 pages
LEWA Ecoflow Eccentric Pump Overview
100% (1)
LEWA Ecoflow Eccentric Pump Overview
2 pages

Introduction to Reinforcement Learning

Uploaded by

Introduction to Reinforcement Learning

Uploaded by

Introduction to Reinforcement Learning

Department of Computer Science

• Reinforcement learning is a type of machine

• For each good action, the agent gets positive

• RL solves a specific type of problem where decision

• The agent continues doing following three things:

By doing these actions, he learns and explores the environment. 2

Criteria Supervised ML Unsupervised ML Reinforcement ML

Regression and Exploitation or

Aim Calculate outcomes Discover underlying patterns Learn a series of action

Risk Evaluation, Forecast Recommendation System, Self Driving Cars,

Environment A situation in which an agent is present or surrounded by.

The policy-based approach has mainly two types of policy:

4. Model: Mimics the behavior of the environment. 7

MDP contains a tuple of four elements

• A set of finite States

Q-Learning: Q-learning is a popular model-free Reinforcement

The main objective of Q-learning is to learn the

It is an off-policy RL that attempts to find the best

𝑄𝑛𝑒𝑤 ( 𝑠𝑡 ,𝑎𝑡 )=𝑄 ( 𝑠 𝑡 , 𝑎𝑡 ) +𝛼 ×(𝑟 𝑡 +𝛾 ×𝑚𝑎 𝑥 𝑎 𝑄 (𝑠𝑡 +1 , 𝑎)−𝑄 (𝑠 𝑡 , 𝑎𝑡 )) 11

You might also like