0% found this document useful (0 votes)

29 views32 pages

Q-Learning in Reinforcement Learning

Reinforcement learning addresses how autonomous agents can learn optimal actions to achieve goals by interacting with an environment. Q-learning is a reinforcement learning method where an agent learns a Q-function that evaluates each state-action pair to estimate the maximum reward achievable. The agent takes actions in an environment, observes the results and rewards, and updates the Q-values for each state-action pair to learn an optimal policy that maximizes long-term rewards.

Uploaded by

Lahari bilimale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views32 pages

Q-Learning in Reinforcement Learning

Uploaded by

Lahari bilimale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Module-V Chapter 8

Reinforcement Learning
By
Pramod Kumar PM
Vivekananda College of Engineering
Technology, Puttur
Module 5 - Outline

Chapter 13: Reinforcement Learning

1. Introduction
2. The Learning Task
3. Q Learning
4. Summary

2
Introduction
▪ Reinforcement learning addresses the question of
• how an autonomous agent that senses and
• acts in its environment
• can learn to choose optimal actions to achieve its goals.

▪ Applications
• learning to control a mobile robot
• learning to optimize operations in factories
• learning to play board games.

3
Introduction

4
Introduction
▪ Consider building a learning robot called as agent.
▪ It has
• a set of sensors to observe the state of its environment
Ex: Camera, Sonar
• a set of actions it can perform to alter this state
Ex: “Move forward”, “Turn Right”
▪ Its task is to learn a control strategy, or policy, for choosing
actions that achieve its goals.
▪ For example, the robot may have a goal of docking onto its
battery charger whenever its battery level is low.

5
Introduction
▪ The goals of the agent can be defined by a reward function
▪ Reward function assigns a numerical value - an immediate
payoff -to each distinct action the agent may take from each
distinct state.
▪ For example, the goal of docking to the battery charger can
be captured by
• assigning a positive reward (e.g., +100) to state-action
transitions that immediately result in a connection to the
charger and
• a reward of zero to every other state-action transition.

6
Introduction
▪ This reward function
• may be built into the robot, or
• known only to an external teacher who provides the
reward value for each action performed by the robot.

▪ The task of the robot is to perform sequences of actions,

observe their consequences, and learn a control policy.

▪ The control policy we desire is one that, from any initial state,
chooses actions that maximize the reward accumulated over
time by the agent.

7
Robot learning

8
Introduction

9
Introduction

10
Module 5 - Outline

Chapter 13: Reinforcement Learning

1. Introduction
2. The Learning Task
3. Q Learning
4. Summary

11
The Learning Task

12
Source: Wikipedia

13
The Learning Task

14
The Learning Task

15
Illustrative Example

16
Illustrative
Example

17
Module 5 - Outline

Chapter 13: Reinforcement Learning

1. Introduction
2. The Learning Task
3. Q Learning
4. Summary

18
Q Learning

19
Q Learning

20
Q Learning

21
Q learning

22
Q Learning Algorithm

23
Q Learning Algorithm

24
Illustrative Example

25
Relationship to Dynamic
Programming

28
Relationship to Dynamic
Programming

29
Module 5 - Outline

Chapter 13: Reinforcement Learning

1. Introduction
2. The Learning Task
3. Q Learning
4. Summary

30
Summary
▪ Reinforcement learning
• Learning control strategies for autonomous agents.
• It assumes that training information is available in the form
of a real-valued reward signal given for each state-action
transition.
• The goal of the agent is to learn an action policy that
maximizes the total reward it will receive from any starting
state.

31
Summary
▪ The reinforcement learning algorithms addressed in this
chapter fit a problem setting known as a Markov decision
process.
▪ In Markov decision processes, the outcome of applying any
action to any state depends only on this action and state (and
not on preceding actions or states).
▪ Markov decision processes cover a wide range of problems
including many robot control, factory automation, and
scheduling problems.

32
Summary
▪ Q learning is one form of reinforcement learning in which the
agent learns an evaluation function over states and actions.

▪ Evaluation function Q(s, a) is defined as the

• maximum expected, discounted, cumulative reward
• the agent can achieve by applying action a to state s.

▪ Advantage - it can-be employed even when the learner has

no prior knowledge of how its actions affect its environment.

33
Thank You

Instance-Based & Reinforcement Learning
No ratings yet
Instance-Based & Reinforcement Learning
41 pages
Reinforcement Learning & Genetic Algorithms
No ratings yet
Reinforcement Learning & Genetic Algorithms
62 pages
Types of Reinforcement Explained
No ratings yet
Types of Reinforcement Explained
12 pages
Overview of Reinforcement Learning
No ratings yet
Overview of Reinforcement Learning
21 pages
MLT U-5 ONE SHOT Noes
No ratings yet
MLT U-5 ONE SHOT Noes
30 pages
MLT U-5 ONE SHOT Notes
No ratings yet
MLT U-5 ONE SHOT Notes
42 pages
Reinforcement Learning Basics Explained
No ratings yet
Reinforcement Learning Basics Explained
13 pages
Unit 05 Machine Learning
No ratings yet
Unit 05 Machine Learning
21 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
6 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
ML UNIT 6 Notes
No ratings yet
ML UNIT 6 Notes
11 pages
Machine Learning Unit 5
No ratings yet
Machine Learning Unit 5
8 pages
Reinforcement Learning Concepts Explained
No ratings yet
Reinforcement Learning Concepts Explained
29 pages
Reinforcement Learning Fundamentals
No ratings yet
Reinforcement Learning Fundamentals
28 pages
Reinforcement Learning & MDP Overview
No ratings yet
Reinforcement Learning & MDP Overview
19 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
38 pages
Reinforcement Learning Overview at IIITM
No ratings yet
Reinforcement Learning Overview at IIITM
64 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
18 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
5 pages
MLT PPTUnit 5
No ratings yet
MLT PPTUnit 5
51 pages
Reinforcement Learning Techniques Explained
No ratings yet
Reinforcement Learning Techniques Explained
15 pages
Introduction to Reinforcement Learning
100% (1)
Introduction to Reinforcement Learning
64 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
13 pages
Reinforcement Learning Overview and Q-Learning
No ratings yet
Reinforcement Learning Overview and Q-Learning
6 pages
Reinforcement Learning Fundamentals Guide
No ratings yet
Reinforcement Learning Fundamentals Guide
72 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
161 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
34 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
19 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
15 pages
TAIC Tutorial RL
No ratings yet
TAIC Tutorial RL
44 pages
Unit 5 - ML
No ratings yet
Unit 5 - ML
38 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
22 pages
Reinforcement Learning for Pathfinding
No ratings yet
Reinforcement Learning for Pathfinding
11 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
40 pages
Enhanced Q-Learning with Relative Rewards
No ratings yet
Enhanced Q-Learning with Relative Rewards
5 pages
Reinforcement Learning Lecture Notes
No ratings yet
Reinforcement Learning Lecture Notes
6 pages
DM Assignment Final
No ratings yet
DM Assignment Final
32 pages
Reinforcement Learning Overview and Applications
100% (1)
Reinforcement Learning Overview and Applications
25 pages
Reinforcement Learning Explained
No ratings yet
Reinforcement Learning Explained
22 pages
Overview of Reinforcement Learning
No ratings yet
Overview of Reinforcement Learning
26 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
33 pages
Decision Tree and Reinforcement Learning Guide
No ratings yet
Decision Tree and Reinforcement Learning Guide
60 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
24 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
14 pages
Types of Machine Learning: Reinforcement
No ratings yet
Types of Machine Learning: Reinforcement
8 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
34 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
8 pages
Reinforcement Learning Overview
100% (2)
Reinforcement Learning Overview
61 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
8 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
25 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
16 pages
Understanding Reinforcement Learning Concepts
No ratings yet
Understanding Reinforcement Learning Concepts
12 pages
Reinforcement Learning Fundamentals
No ratings yet
Reinforcement Learning Fundamentals
72 pages
Unit-Iv PP
No ratings yet
Unit-Iv PP
19 pages
Reinforcement Learning Explained
No ratings yet
Reinforcement Learning Explained
14 pages
Overview of Hadoop Ecosystem Components
No ratings yet
Overview of Hadoop Ecosystem Components
126 pages
Instance-Based Learning Overview
No ratings yet
Instance-Based Learning Overview
37 pages
Concept Learning in Machine Learning
100% (1)
Concept Learning in Machine Learning
16 pages
Candidate Elimination Algorithm Explained
No ratings yet
Candidate Elimination Algorithm Explained
10 pages
Sahel Drought and Burkina Faso's Revolution
No ratings yet
Sahel Drought and Burkina Faso's Revolution
28 pages
Forensic Spoken Portrait Techniques
No ratings yet
Forensic Spoken Portrait Techniques
6 pages
Cargo Manifest for TSS Pearl Voyage
No ratings yet
Cargo Manifest for TSS Pearl Voyage
1 page
Spatial Statistics Analysis in R
No ratings yet
Spatial Statistics Analysis in R
29 pages
Crane Beam Design Analysis
100% (2)
Crane Beam Design Analysis
7 pages
30-Day Obesity Diet Plan for Women
No ratings yet
30-Day Obesity Diet Plan for Women
5 pages
MOOC Aquaponics Datasheet
No ratings yet
MOOC Aquaponics Datasheet
2 pages
Factor Quadratic and Line Intersection
No ratings yet
Factor Quadratic and Line Intersection
4 pages
RMR Modular Enclosure Specifications
No ratings yet
RMR Modular Enclosure Specifications
8 pages
Health Facility Preparedness SOP
No ratings yet
Health Facility Preparedness SOP
9 pages
Understanding Bromatology and Food Value
No ratings yet
Understanding Bromatology and Food Value
26 pages
eBay's Path to Perfect Competition
No ratings yet
eBay's Path to Perfect Competition
6 pages
Thermal Recovery Methods in EOR Analysis
No ratings yet
Thermal Recovery Methods in EOR Analysis
35 pages
D1 Teaching Plan for Software Testing
No ratings yet
D1 Teaching Plan for Software Testing
3 pages
Packet Abis and A Over IP: Soc Classification Level 1 © Nokia Siemens Networks Presentation / Author / Date
No ratings yet
Packet Abis and A Over IP: Soc Classification Level 1 © Nokia Siemens Networks Presentation / Author / Date
86 pages
Cell Biology Study Guide Overview
No ratings yet
Cell Biology Study Guide Overview
75 pages
Web-Based Campus Event Management System
No ratings yet
Web-Based Campus Event Management System
6 pages
Test Bank for Communicating as Professionals
No ratings yet
Test Bank for Communicating as Professionals
14 pages
PHYS 3041 Homework Assignment 1
No ratings yet
PHYS 3041 Homework Assignment 1
3 pages
Hybrid Home Gym Optional Leg Press (SXT-LP) Owner's Manual
No ratings yet
Hybrid Home Gym Optional Leg Press (SXT-LP) Owner's Manual
36 pages
Grade 1 Mother Tongue Curriculum Standards
100% (1)
Grade 1 Mother Tongue Curriculum Standards
8 pages
NS-2 Simulation Tutorial Guide
No ratings yet
NS-2 Simulation Tutorial Guide
3 pages
AVH-P4400BH AVH-P3400BH AVH-P2400BT AVH-P1400DVD: Owner's Manual
No ratings yet
AVH-P4400BH AVH-P3400BH AVH-P2400BT AVH-P1400DVD: Owner's Manual
112 pages
Chest Physiotherapy Procedure Overview
100% (1)
Chest Physiotherapy Procedure Overview
10 pages
Norwegian Petroleum Well Control Risks
No ratings yet
Norwegian Petroleum Well Control Risks
47 pages
Understanding Unemployment Types and Causes
No ratings yet
Understanding Unemployment Types and Causes
17 pages
KNX USB Interface User Manual
No ratings yet
KNX USB Interface User Manual
12 pages
Overview of Polymer Additives and Applications
No ratings yet
Overview of Polymer Additives and Applications
37 pages
Types of Diodes and Their Applications
No ratings yet
Types of Diodes and Their Applications
26 pages
Steel Chimney Types and Design Guide
100% (3)
Steel Chimney Types and Design Guide
6 pages