Professional Documents
Culture Documents
Aditya Kumar Singh - A2305220463 NTCC Term Paper
Aditya Kumar Singh - A2305220463 NTCC Term Paper
Aditya Kumar Singh - A2305220463 NTCC Term Paper
Term Paper
On
Automation with Reinforcement Learning
Submitted to
Amity School of Engineering and Technology
Guided By:
Dr. Achyut Shankar
Assistant Professor
Department of Computer Science and Engineering
Submitted By:
Aditya Kumar Singh
Enrollment No: A2305220463
B.TECH CSE
The author attests that permission has been obtained for the use
of any copyrighted appearing in the project report other than
brief excerpts requiring only proper acknowledgment in
scholarly writing and all such use is acknowledged.
Signature:
Place: Noida
Date: 28/7/21
Name: Aditya Kumar Singh
Certificate by the Faculty Guide
S. No Topic Page
No
1. What is Machine Learning and Reinforcement learning? 01 - 02
6. CONCLUSION 09
7. REFERENCE 10
What is Machine Learning:
Supervised learning:
Supervised learning uses datasets to train an algorithm so that it can
classify and predict an outcome precisely. The dataset consist of input
and correct output, this allows the model to learn overtime.
Supervised learning helps in two types of problem that are :
Classification: It is used to sort the data accurately into
specific
categories.
Regression: It is used to predict outcomes.
Unsupervised Learning:
Unsupervised learning uses an artificial intelligence (AI) algorithm to
identify patterns in a dataset that contain data points that can neither be
classified nor labeled. The algorithms discover hidden patterns in a data
without any human intervention.
It uses clustering to group unlabeled datasets on their similarities and
differences.
Some algorithms and approach used are:
Exclusive Clustering: In this type of clustering data points
can exist in one group only.
Hierarchical Clustering: It is an unsupervised clustering
algorithm that can be both agglomerative or divisive.
Reinforcement Learning:
In Reinforcement learning, a machine learning model must be trained to
make series of decisions. The aim is to train a model to learn to achieve
a certain goal in an uncertain, complex situation.
It differs from both supervised learning and unsupervised learning
because the sample dataset used does not train the model.
Alternatively, it uses trail and error to reach the most favorable outcome.
This has made reinforcement learning the most efficient way to indicate
computer imagination.
An example of reinforcement learning is Youtube. Whenever, after
watching a video on Youtube , User see similar titles that he may like.
Suppose user starts a video but end it halfway. Then the program
understand that recommendation was not a good one and will try
another approach.
Reinforcement learning v/s Supervised learning
Intelligence.
Q-Learning:
Q-Learning is a model free reinforcement learning
algorithm.
Its objective is to analyze a strategy so the system can
take an action to get maximum needed outcomes.
It is based on Bellman equation.
Consider a function Q(s,a), here “a” represents an action
while “s” represent a particular state.
Working of Q-Learning:
o Initialization of Q-table
o Action (a) is selected to perform
o Perform the selected action
o Analyze the outcome
o In end update the Q-table
State Action Reward State action (SARSA):
1.javatpoint.com
4.Wikipedia
5. synopsys.com