Welcome to Scribd!

Reinforcement Learning

Uploaded by

0% found this document useful (0 votes)

17 views14 pages

The document discusses reinforcement learning and describes active reinforcement learning. It notes that with active reinforcement learning, the learner makes choices and must balance exploration versus exploitation. The learner actually takes actions in the world and learns from the outcomes rather than doing offline planning. One method described is Q-learning, where the learner uses sample experiences to iteratively update and improve its estimates of the Q-values as it interacts with the environment. The document emphasizes that with active reinforcement learning, the learner has to explore enough options while eventually focusing more on exploitation, and in the long run the exact method for choosing actions does not matter.

Original Description:

Original Title

Reinforcement Learning.pptx

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

17 views14 pages

Reinforcement Learning

Uploaded by

Suot Oliver Rico

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 14

Search inside document

Reinforcement

Learning
Kelompok 4

• Jana Purukan
• Rico Suot
1 Supervised Learning and
Unsupervised Learning

2 Passive Reinforcement Learning

Outline
3 Temporal Difference Learning

4 Active Reinforcement Learning

Supervised
Learning
Unsupervised
Learning
Passive
Reinforcement
Learning
Temporal
Difference
Learning
Active
Reinforcement
Learning
Active Reinforcement Lear ning
Active Reinforcement Lear ning

– You don’t know the transitions T(s,a,s’)

– You don’t know the rewards R(s,a,s’)
– You choose the actions now
– Goal: learn the optimal policy / values

– Learner makes choices!

– Fundamental tradeoff: exploration vs. exploitation
– This is NOT offline planning! You actually take actions
in the world and find out what happens…
Active Reinforcement Lear ning

Q-Learning: sample-based Q-value iteration

Learn Q(s,a) values as you go

– Receive a sample (s,a,s’,r)
– Consider your old estimate:
– Consider your new sample estimate:

– Incorporate the new estimate into a running average

Active Reinforcement Lear ning
Active Reinforcement Lear ning

off-policy learning

– You have to explore enough

– You have to eventually make the learning rate
small enough
– … but not decrease it too quickly
– Basically, in the limit, it doesn’t matter how you select
actions (!)
THANK YOU!
Any Questions?

RBT Module Training
Document28 pages
RBT Module Training
Jessica Barrios
86% (22)
Action Research in Education (Second Lecture) : Azimi Hamzah UPM
Document16 pages
Action Research in Education (Second Lecture) : Azimi Hamzah UPM
ainzd78
No ratings yet
Individual Learning and Behaviour
Document85 pages
Individual Learning and Behaviour
Natasha Bansal
No ratings yet
Chap 02 Learning
Document28 pages
Chap 02 Learning
Madhushi Sandupama
No ratings yet
Learning in An Organization
Document42 pages
Learning in An Organization
Sahil
No ratings yet
Action Research
Document36 pages
Action Research
Andy Fer
No ratings yet
Psychology 7 Santrock Notes Pages 288-303
Document3 pages
Psychology 7 Santrock Notes Pages 288-303
OliviaLimón
No ratings yet
Kirkpatrick's Levels of Evaluation: Presentation By: Hemant Swarnkar
Document18 pages
Kirkpatrick's Levels of Evaluation: Presentation By: Hemant Swarnkar
Karan Soni
No ratings yet
Introduction To Personal and Professional Development
Document13 pages
Introduction To Personal and Professional Development
titan abcd
No ratings yet
Direct and Indirect Measures
Document24 pages
Direct and Indirect Measures
Quinnza
No ratings yet
UNIT-4 Learning: Dr. Pooja Agrawal
Document29 pages
UNIT-4 Learning: Dr. Pooja Agrawal
Shubham Singh
No ratings yet
Action Research: A Tool For Continuous Improvement: Carlo Magno, Phd. Crlmgn@Yahoo.C Om
Document30 pages
Action Research: A Tool For Continuous Improvement: Carlo Magno, Phd. Crlmgn@Yahoo.C Om
Sheryl Cruz Espiritu
No ratings yet
Spiral Playbook Presentation PDF 1
Document20 pages
Spiral Playbook Presentation PDF 1
Katrina Lowe
No ratings yet
Action Research
Document2 pages
Action Research
Linda Sean
No ratings yet
Learning Theories: MBA - I Semester
Document35 pages
Learning Theories: MBA - I Semester
anshuman chowbey
No ratings yet
Action Research in Education: Azimi Hamzah UPM Available at
Document16 pages
Action Research in Education: Azimi Hamzah UPM Available at
miguel hernandez
No ratings yet
Model Jean McNiff
Document8 pages
Model Jean McNiff
vivimahsy
No ratings yet
Evaluation and Reflection
Document9 pages
Evaluation and Reflection
bearmanm2740
No ratings yet
Learning & Organisational Reward System: Compiled by A Srinivasa Rao
Document24 pages
Learning & Organisational Reward System: Compiled by A Srinivasa Rao
Anmol Jain
No ratings yet
Learning: Any Relatively Permanent Change in Behavior That Occurs As A Result of Experience
Document11 pages
Learning: Any Relatively Permanent Change in Behavior That Occurs As A Result of Experience
Mohit Dadhich
No ratings yet
Methods of Assessment in Psychology
Document15 pages
Methods of Assessment in Psychology
Nishita Yadav
No ratings yet
L5 Reinforcement
Document16 pages
L5 Reinforcement
Rudraksh Parey
No ratings yet
Individual Learning and Behavior-Psychology
Document35 pages
Individual Learning and Behavior-Psychology
Ayush
No ratings yet
Career Theories
Document4 pages
Career Theories
cornelpc
100% (4)
Chapter 2: Foundations of Individual Behavior
Document22 pages
Chapter 2: Foundations of Individual Behavior
AbeeraKhanGhaznavi
No ratings yet
Foundation of IB - Support Notes
Document70 pages
Foundation of IB - Support Notes
Anilesh Mathew
No ratings yet
PR2 Group 1 FINAL4
Document12 pages
PR2 Group 1 FINAL4
johnjohn emlaw
No ratings yet
11 - Behavior Therapy
Document76 pages
11 - Behavior Therapy
emaanbilal000
No ratings yet
Visible Thinking: UTEC 2011 Prof. Martín U. Aparicio
Document8 pages
Visible Thinking: UTEC 2011 Prof. Martín U. Aparicio
Andres Eliseo Lima
No ratings yet
Chapter 3 LSP
Document45 pages
Chapter 3 LSP
Srivathsan
No ratings yet
Learning Skill: Dr. Jayalangkara Tanra SP - KJ
Document21 pages
Learning Skill: Dr. Jayalangkara Tanra SP - KJ
Faris Azhar
No ratings yet
4 Alternatives To Experimentation Surveys and Interviews PDF
Document46 pages
4 Alternatives To Experimentation Surveys and Interviews PDF
KIM ABIGAIL ABAG
No ratings yet
Professional Education Terms
Document3 pages
Professional Education Terms
Mark Anthony Solis
No ratings yet
Reality Therapy: Glasser's Control Theory
Document13 pages
Reality Therapy: Glasser's Control Theory
Ariep Ilham
No ratings yet
AR Concept
Document37 pages
AR Concept
Siti Hajar Zaid
No ratings yet
Ok 2 Learning and Conditioning
Document34 pages
Ok 2 Learning and Conditioning
pawansdube
No ratings yet
Models of Action Research-Stephen Kemmis
Document6 pages
Models of Action Research-Stephen Kemmis
Nur Auliya Mohd Azmi
100% (3)
ERA Teaching - Learning - Process 16 12 14
Document41 pages
ERA Teaching - Learning - Process 16 12 14
agus amroni
No ratings yet
Chapter Four: The Emergence of Thought and Language: Cognitive Development in Infancy and Early Childhood
Document39 pages
Chapter Four: The Emergence of Thought and Language: Cognitive Development in Infancy and Early Childhood
Hsieh Yun Ju
No ratings yet
What Is It? How Can It Help Our Students With Unacceptable Behaviors in The School Environment?
Document17 pages
What Is It? How Can It Help Our Students With Unacceptable Behaviors in The School Environment?
chonamarie
No ratings yet
unit6-MB 0001
Document23 pages
unit6-MB 0001
yogakn
No ratings yet
Faraja Teaching Methodology Questions and Answers
Document38 pages
Faraja Teaching Methodology Questions and Answers
Magreth Mbaruku
No ratings yet
Social Cognitive Learning Theory 002
Document18 pages
Social Cognitive Learning Theory 002
api-217446206
No ratings yet
Learning Theories
Document42 pages
Learning Theories
ljabiera
No ratings yet
Taking Your Skills To The Next Level Featuring Puppyworks. Seminar by Ken Ramirez (Animal Training)
Document36 pages
Taking Your Skills To The Next Level Featuring Puppyworks. Seminar by Ken Ramirez (Animal Training)
Barbara
No ratings yet
Experiential Learning Theory
Document14 pages
Experiential Learning Theory
Mahesh
No ratings yet
Unit 2 ppt-1
Document40 pages
Unit 2 ppt-1
Ankur
No ratings yet
Action Learning
Document25 pages
Action Learning
neriojhay
No ratings yet
Cognitive and Motor Development
Document54 pages
Cognitive and Motor Development
Vilma Delos Reyes
No ratings yet
Standards-Based Grading and Reporting Handout
Document45 pages
Standards-Based Grading and Reporting Handout
Cheryl Townsend Dick
No ratings yet
Learning Methods Workshop
Document24 pages
Learning Methods Workshop
Macario Roy Jr Amores
No ratings yet
Experiential Learning
Document19 pages
Experiential Learning
Gorgom Zola
No ratings yet
Module 1: Metacognition: Presented By: Rocelyn Mae B. Herrera UE02 - Facilitating Learning Prof. Russel V. Santos
Document14 pages
Module 1: Metacognition: Presented By: Rocelyn Mae B. Herrera UE02 - Facilitating Learning Prof. Russel V. Santos
Ue ucu
No ratings yet
Historical Research
Document20 pages
Historical Research
vikas
67% (6)
The Research Process: Prof. Shirley D. Dangan
Document22 pages
The Research Process: Prof. Shirley D. Dangan
JOHNDEL L. CUETO
No ratings yet
Sensitization: 5 Steps To Feel
Document36 pages
Sensitization: 5 Steps To Feel
Georgina Regala
No ratings yet
Lesson Plan
Document2 pages
Lesson Plan
Usiel Nicon
No ratings yet
Learning: DR Musarrat Shaheen
Document28 pages
Learning: DR Musarrat Shaheen
Ayushi Yadav
No ratings yet
Role Play & Problem Solving
Document19 pages
Role Play & Problem Solving
LookAtTheMan 2002
No ratings yet
Learning Tactics Inventory: Participant Survey and Workbook
From Everand
Learning Tactics Inventory: Participant Survey and Workbook
Maxine Dalton
Rating: 5 out of 5 stars
5/5 (1)