Welcome to Scribd!

Reinforcement Learning

Uploaded by

0% found this document useful (0 votes)

67 views2 pages

Reinforcement learning is a machine learning method where an agent learns from interactions with an environment by receiving rewards or penalties. The agent learns to maximize rewards by trying different actions and seeing what rewards result. It differs from supervised learning which provides examples of correct outputs. Reinforcement learning is used in many fields and helps agents learn complex tasks like playing Pac-Man by trying actions and receiving numeric rewards to guide learning towards completing levels.

Original Description:

rein

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

67 views2 pages

Reinforcement Learning

Uploaded by

John Vincent D. Reyes

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

DEFINITION

reinforcement learning


Reinforcement learning is a training method based on rewarding desired behaviors

and/or punishing undesired ones. The learning method has been adopted in artificial
intelligence (AI) as a method of directing unsupervised machine learning through
rewards and penalties. Reinforcement learning is used in operations
research, information theory, game theory, control theory, simulation-based
optimization, multi-agent systems, swarm intelligence, statistics and
genetic algorithms.

Where supervised learning algorithms are typically trained with a body of known
correct answers, an agent learning by reinforcement is not. A reinforcement
learning agent learns from the environment where it performs its task. First, a method
of rewarding desired behaviors and punishing negative behaviors is devised. Positive
values are assigned to desired behaviors to provide positive reinforcement and
negative values to undesired behaviors for negative reinforcement.

The agent is programmed to seek long-term and maximum overall reward to achieve
an optimal solution. Long-term goals help prevent the agent from stalling on lesser
goals while avoiding risk. Also of note is the addition of mechanisms to encourage
exploration. Markov decision processes are sometimes used in exploration decisions
where an agent might ignore a reward in order to explore; to that end, developers
might add an effect, like curiosity, that aids in making discoveries.

A learning algorithm playing Pac Man might have the ability to move in one of four
possible directions, barring obstruction. From pixel data an agent might be given a
numeric reward for the result of a unit of travel: 0 for empty space, 1 for pellets, 2 for
fruit, 3 for a power pellet, 4 for a ghost post-power pellet, 5 for collecting all pellets
and completing a level but being deducted 5 points for collision with a ghost. The
agent starts from randomized play to sophisticated, learning the goal of getting all
pellets to complete the level. Given time, an agent might even learn tactics like
conserving power pellets till needed for self-defense.

Because it’s based on an understanding of biological systems, reinforcement learning

is a part of bio-inspired computing. As a psychological principle, reinforcement
learning hails from the school of behavioral psychology.

SCSA3015 Deep Learning Unit 1 Notes PDF
Document30 pages
SCSA3015 Deep Learning Unit 1 Notes PDF
pooja vikirthini
No ratings yet
Ai PPT New
Document14 pages
Ai PPT New
ks8408783
No ratings yet
Unit 1 Notes
Document29 pages
Unit 1 Notes
sakthiasphaltalpha
No ratings yet
Reinforcement 2
Document2 pages
Reinforcement 2
Kelechi
No ratings yet
Machine Learning Approachs (AI)
Document11 pages
Machine Learning Approachs (AI)
Abhishek Gupta
100% (1)
4 Through Reiterative Optimization
Document2 pages
4 Through Reiterative Optimization
Kevin Varela
No ratings yet
Learning
Document18 pages
Learning
Ankitha Singh
No ratings yet
Data Science Solutions IA 2
Document16 pages
Data Science Solutions IA 2
Monesh Rallapalli
No ratings yet
Intermediate AI Prompting – Reinforcement Learning
From Everand
Intermediate AI Prompting – Reinforcement Learning
Eric Centore
No ratings yet
Unit 4 Machine Learning Tools, Techniques and Applications
Document78 pages
Unit 4 Machine Learning Tools, Techniques and Applications
Jyothi Pulikanti
No ratings yet
What Are The Components of A Deep Learning Network?
Document2 pages
What Are The Components of A Deep Learning Network?
amruthabharga
No ratings yet
AIML Module - 03 21CS4
Document34 pages
AIML Module - 03 21CS4
sakshiam12
No ratings yet
Lect03-Unsupervised Learning
Document12 pages
Lect03-Unsupervised Learning
shehzad shafique51
No ratings yet
AI Research Paper
Document14 pages
AI Research Paper
fojoves160
No ratings yet
Learning Agent
Document6 pages
Learning Agent
DIMPAL KUMARI
No ratings yet
Ds d80 Diy Solution v1 7sn U5mfvwr PDF
Document4 pages
Ds d80 Diy Solution v1 7sn U5mfvwr PDF
Sudharshan Venkatesh
No ratings yet
Reinforcement Learning
Document23 pages
Reinforcement Learning
Rajachandra Voodiga
No ratings yet
Unit - 4 Theory Assignment
Document4 pages
Unit - 4 Theory Assignment
Abhinav Arora
No ratings yet
Some Machine Learning Methods
Document2 pages
Some Machine Learning Methods
Seva Javadzada
No ratings yet
AI Assignment 2
Document5 pages
AI Assignment 2
Abraham Onyedikachi Ogudu
No ratings yet
AIUnit II
Document47 pages
AIUnit II
omkar dhumal
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
Document40 pages
Unit V Reinforcement Learning and Genetic Algorithm
anshikay2609
No ratings yet
Unit-1 (Introduction To Machine Learning)
Document4 pages
Unit-1 (Introduction To Machine Learning)
Rudraksh sah
No ratings yet
ML Assignment 1 PDF
Document6 pages
ML Assignment 1 PDF
Anubhav Monga
No ratings yet
ETE Ans
Document73 pages
ETE Ans
Vennapusa Narsamma
No ratings yet
Chapter 1 Introduction To Machine Learning
Document29 pages
Chapter 1 Introduction To Machine Learning
Kenneth Kibet Ngeno
No ratings yet
Unit 5 ML 3year
Document17 pages
Unit 5 ML 3year
ISHAN SRIVASTAVA
No ratings yet
Machine-Learning AI
Document8 pages
Machine-Learning AI
lukegr.dev
No ratings yet
UNIT-5 ML Part1-1
Document59 pages
UNIT-5 ML Part1-1
Shashank Sharma
No ratings yet
AI Module 4
Document38 pages
AI Module 4
FaReeD Hamza
No ratings yet
Machine Learning
Document103 pages
Machine Learning
sp1135220
No ratings yet
Machine Learning
Document29 pages
Machine Learning
Cubic Section
No ratings yet
4 Through Reiterative Optimisation
Document2 pages
4 Through Reiterative Optimisation
Kevin Varela
No ratings yet
Machine Learning - Data
Document11 pages
Machine Learning - Data
Adeeba Iram
No ratings yet
Introduction To Machine Learning: Methods, Applications, Etc
Document15 pages
Introduction To Machine Learning: Methods, Applications, Etc
Pamina Gorospe
No ratings yet
Chapter Five
Document10 pages
Chapter Five
junedijoasli
No ratings yet
Dimpal
Document6 pages
Dimpal
DIMPAL KUMARI
No ratings yet
Unit 5-1
Document8 pages
Unit 5-1
Neeraj Singh Bora
No ratings yet
6CS4 AI Unit-4
Document129 pages
6CS4 AI Unit-4
Nikhil Kumar
No ratings yet
Supervised Machine Learning
Document3 pages
Supervised Machine Learning
Benedict Rosales
No ratings yet
4 - Agents and Learning 31-10-2016
Document16 pages
4 - Agents and Learning 31-10-2016
abdulazizmoosa93
No ratings yet
Machine Learning With Python Programming: - Presentation by Uplatz - Contact Us: - Email: - Phone
Document36 pages
Machine Learning With Python Programming: - Presentation by Uplatz - Contact Us: - Email: - Phone
sdbfhvsdfhvsdhvds
No ratings yet
Introduction To Machine Learning: Methods, Applications, Etc
Document15 pages
Introduction To Machine Learning: Methods, Applications, Etc
Pamina Gorospe
No ratings yet
4 in The Numerical
Document2 pages
4 in The Numerical
kevincoccrr
No ratings yet
Prescriptive Analytics - Research Material: Data Analysis
Document12 pages
Prescriptive Analytics - Research Material: Data Analysis
api-592322284
No ratings yet
Answer No 04: Supervised Machine Learning
Document2 pages
Answer No 04: Supervised Machine Learning
Ayesha
No ratings yet
There Are Key Areas in The Process of Machine Learning, Like
Document45 pages
There Are Key Areas in The Process of Machine Learning, Like
Yashi
No ratings yet
Machine Learning (ML) Techniques
Document14 pages
Machine Learning (ML) Techniques
Mukund Tiwari
No ratings yet
Reinforcement Learning
Document5 pages
Reinforcement Learning
supriya
No ratings yet
Unit 01 Introduction REINFORCEMENT LEARNING
Document4 pages
Unit 01 Introduction REINFORCEMENT LEARNING
PratapePrasad
No ratings yet
Module 3 - AIML
Document134 pages
Module 3 - AIML
harinisk32
No ratings yet
Basics of Machine Learning
Document22 pages
Basics of Machine Learning
Nikhil Pandey
No ratings yet
Mod3 - Learning Theory
Document10 pages
Mod3 - Learning Theory
Knightfury Milan
No ratings yet
Icba04 Sun
Document12 pages
Icba04 Sun
atirina
No ratings yet
Ai RL
Document3 pages
Ai RL
ks.umashanker
No ratings yet
Unit 4
Document8 pages
Unit 4
vvvcxzzz3754
No ratings yet
Machine Learning
Document5 pages
Machine Learning
Malik Muhammad Arslan Alam Awan
No ratings yet
MLT Unit 1
Document15 pages
MLT Unit 1
sahil.utube2003
No ratings yet
Marzano's New Taxonomy: The Three Systems and Knowledge Self-System
Document5 pages
Marzano's New Taxonomy: The Three Systems and Knowledge Self-System
Paola Zerezas
No ratings yet
Types of Data:: Reference Website
Document15 pages
Types of Data:: Reference Website
Vidath Kuna
No ratings yet
Rubric For Cooking Presentation
Document3 pages
Rubric For Cooking Presentation
John Vincent D. Reyes
No ratings yet
All USB Drivers: Navigation
Document27 pages
All USB Drivers: Navigation
John Vincent D. Reyes
No ratings yet
Module. Phil History 3 5 Weeks
Document4 pages
Module. Phil History 3 5 Weeks
John Vincent D. Reyes
No ratings yet
Characteristics of Active Leadership (Trespeces, 2003)
Document3 pages
Characteristics of Active Leadership (Trespeces, 2003)
John Vincent D. Reyes
No ratings yet
Good Morning Sophomores!
Document50 pages
Good Morning Sophomores!
John Vincent D. Reyes
No ratings yet
Bert MAjor Final Pest of Crop
Document21 pages
Bert MAjor Final Pest of Crop
John Vincent D. Reyes
No ratings yet
Health7 - Q2 - Mod5 Layout v1.0
Document24 pages
Health7 - Q2 - Mod5 Layout v1.0
John Vincent D. Reyes
100% (1)
Department of Education: Randolph B. Tortola
Document1 page
Department of Education: Randolph B. Tortola
John Vincent D. Reyes
100% (1)
AGRI CROP 7&8 Module 4-1
Document38 pages
AGRI CROP 7&8 Module 4-1
John Vincent D. Reyes
100% (20)
Ref
Document4 pages
Ref
John Vincent D. Reyes
No ratings yet
Radio Waves and Our Environment 2009
Document12 pages
Radio Waves and Our Environment 2009
John Vincent D. Reyes
No ratings yet
Beneficial Effects of Volcanic Eruption
Document1 page
Beneficial Effects of Volcanic Eruption
John Vincent D. Reyes
No ratings yet
Questioning and Discussion Techniques
Document10 pages
Questioning and Discussion Techniques
John Vincent D. Reyes
No ratings yet
Blind Men
Document1 page
Blind Men
John Vincent D. Reyes
No ratings yet
Camping Activities
Document1 page
Camping Activities
John Vincent D. Reyes
No ratings yet
Returnlabel: To Lazada Warehouse: From
Document1 page
Returnlabel: To Lazada Warehouse: From
John Vincent D. Reyes
No ratings yet
PMMA Psychological Evaluation Announcemen1
Document2 pages
PMMA Psychological Evaluation Announcemen1
John Vincent D. Reyes
100% (1)