VI Sem Machine Learning CS 601 PDF

Lab Work Machine Learning CS VI Sem
1. Study of Decision Trees for Classification: A Machine Learning Algorithm

2. Demonstratethe FIND-Salgorithm for finding the most specific hypothesis based on a given
set of training data samples.
3. Implementation of Single layer Perceptron Learning Algorithm.
4. Implementation of unsupervised learning algorithm – Hebbian Learning
5. Study of Backpropagation – Algorithm For Training A Neural Network
6. Study of Naïve Bayes Alogorithm
1
Experiment No. 1
Study of Decision Trees for Classification: A Machine Learning

Algorithm
Introduction
Decision Trees are a type of Supervised Machine Learning (that is you explain
what the input is and what the corresponding output is in the training data)
where the data is continuously split according to a certain parameter. The tree
can be explained by two entities, namely decision nodes and leaves. The leaves
are the decisions or the final outcomes. And the decision nodes are where the
data is split.
An example of a decision tree can be explained using above binary tree. Let’s
say you want to predict whether a person is fit given their information like age,
eating habit, and physical activity, etc. The decision nodes here are questions
like ‘What’s the age?’, ‘Does he exercise?’, ‘Does he eat a lot of pizzas’? And the
leaves, which are outcomes like either ‘fit’, or ‘unfit’. In this case this was a
binary classification problem (a yes no type problem).
There are two main types of Decision Trees:
1. Classification trees (Yes/No types)
What we’ve seen above is an example of classification tree, where the outcome
was a variable like ‘fit’ or ‘unfit’. Here the decision variable is Categorical.
2. Regression trees (Continuous data types)
Here the decision or the outcome variable is Continuous, e.g. a number like
123.
Working
Now that we know what a Decision Tree is, we’ll see how it works internally.
There are many algorithms out there which construct Decision Trees, but one of
the best is called as ID3 Algorithm. ID3 Stands for Iterative Dichotomiser 3.
2
Before discussing the ID3 algorithm, we’ll go through few definitions.
Entropy
Entropy, also called as Shannon Entropy is denoted by H(S) for a finite set S, is
the measure of the amount of uncertainty or randomness in data.
Intuitively, it tells us about the predictability of a certain event. Example,

consider a coin toss whose probability of heads is 0.5 and probability of tails is
0.5. Here the entropy is the highest possible, since there’s no way of determining
what the outcome might be. Alternatively, consider a coin which has heads on
both the sides, the entropy of such an event can be predicted perfectly since we
know beforehand that it’ll always be heads. In other words, this event has no
randomness hence it’s entropy is zero.
In particular, lower values imply less uncertainty while higher values imply high
uncertainty.
Information Gain
Information gain is also called as Kullback-Leibler divergence denoted by IG(S,A)
for a set S is the effective change in entropy after deciding on a particular
attribute A. It measures the relative change in entropy with respect to the
independent variables.
Alternatively,
where IG(S, A) is the information gain by applying feature A. H(S) is the Entropy
of the entire set, while the second term calculates the Entropy after applying the
feature A, where P(x) is the probability of event x.
Let’s understand this with the help of an example
Consider a piece of data collected over the course of 14 days where the features
are Outlook, Temperature, Humidity, Wind and the outcome variable is whether
Golf was played on the day. Now, our job is to build a predictive model which
takes in above 4 parameters and predicts whether Golf will be played on the day.
We’ll build a decision tree to do that using ID3 algorithm.
3
Day Outlook Temperature Humidity Wind Play Golf
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes

D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes

D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes

D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes

D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes

D14 Rain Mild High Strong No
ID3 Algorithm will perform following tasks recursively
1. Create root node for the tree

2. If all examples are positive, return leaf node ‘positive’
3. Else if all examples are negative, return leaf node ‘negative’
4. Calculate the entropy of current state H(S)
5. For each attribute, calculate the entropy with respect to the attribute ‘x’
denoted by H(S, x)
6. Select the attribute which has maximum value of IG(S, x)
7. Remove the attribute that offers highest IG from the set of attributes
8. Repeat until we run out of all attributes, or the decision tree has all leaf
nodes.
4
Experiment No. 2
Demonstratethe FIND-S algorithm for finding the most specific hypothesis based on a given set
of training data samples.
What is Find-S Algorithm in Machine Learning?

In order to understand Find-S algorithm, you need to have a basic idea of the following
concepts as well:
1. Concept Learning
2. General Hypothesis
3. Specific Hypothesis
1. Concept Learning
Let’s try to understand concept learning with a real-life example. Most of human learning is
based on past instances or experiences. For example, we are able to identify any type of
vehicle based on a certain set of features like make, model, etc., that are defined over a
large set of features.
These special features differentiate the set of cars, trucks, etc from the larger set of vehicles.
These features that define the set of cars, trucks, etc are known as concepts.
Similar to this, machines can also learn from concepts to identify whether an object belongs
to a specific category or not. Any algorithm that supports concept learning requires the
following:
 Training Data
 Target Concept
 Actual Data Objects
2. General Hypothesis
Hypothesis, in general, is an explanation for something. The general hypothesis basically

states the general relationship between the major variables. For example, a general
hypothesis for ordering food would be I want a burger.
G = { ‘?’, ‘?’, ‘?’, …..’?’}
3. Specific Hypothesis
The specific hypothesis fills in all the important details about the variables given in the
general hypothesis. The more specific details into the example given above would be I want
a cheeseburger with a chicken pepperoni filling with a lot of lettuce.
5
S = {‘Φ’,’Φ’,’Φ’, ……,’Φ’}
Now ,let’s talk about the Find-S Algorithm in Machine Learning.
The Find-S algorithm follows the steps written below:
1. Initialize ‘h’ to the most specific hypothesis.

2. The Find-S algorithm only considers the positive examples and eliminates negative
examples. For each positive example, the algorithm checks for each attribute in the
example. If the attribute value is the same as the hypothesis value, the algorithm
moves on without any changes. But if the attribute value is different than the
hypothesis value, the algorithm changes it to ‘?’.
Now that we are done with the basic explanation of the Find-S algorithm, let us take a look at
how it works.
How Does It Work?
1. The process starts with initializing ‘h’ with the most specific hypothesis, generally, it is
the first positive example in the data set.
2. We check for each positive example. If the example is negative, we will move on to
the next example but if it is a positive example we will consider it for the next step.
3. We will check if each attribute in the example is equal to the hypothesis value.
4. If the value matches, then no changes are made.
5. If the value does not match, the value is changed to ‘?’.
6
6. We do this until we reach the last positive example in the data set.
Limitations of Find-S Algorithm

There are a few limitations of the Find-S algorithm listed down below:
1. There is no way to determine if the hypothesis is consistent throughout the data.

2. Inconsistent training sets can actually mislead the Find-S algorithm, since it ignores
the negative examples.
3. Find-S algorithm does not provide a backtracking technique to determine the best
possible changes that could be done to improve the resulting hypothesis.
Now that we are aware of the limitations of the Find-S algorithm, let us take a look at a
practical implementation of the Find-S Algorithm.
Implementation of Find-S Algorithm

To understand the implementation, let us try to implement it to a smaller data set with a
bunch of examples to decide if a person wants to go for a walk.
The concept of this particular problem will be on what days does a person likes to go on
walk.
Time Weather Temperature Company Humidity Wind Goes

Morning Sunny Warm Yes Mild Strong Yes
Evening Rainy Cold No Mild Normal No
Morning Sunny Moderate Yes Normal Normal Yes
Evening Sunny Cold Yes High Strong Yes
Looking at the data set, we have six attributes and a final attribute that defines the positive or
negative example. In this case, yes is a positive example, which means the person will go for
a walk.
So now, the general hypothesis is:
h0 = {‘Morning’, ‘Sunny’, ‘Warm’, ‘Yes’, ‘Mild’, ‘Strong’}
This is our general hypothesis, and now we will consider each example one by one, but only
the positive examples.
h1= {‘Morning’, ‘Sunny’, ‘?’, ‘Yes’, ‘?’, ‘?’}
h2 = {‘?’, ‘Sunny’, ‘?’, ‘Yes’, ‘?’, ‘?’}
7
We replaced all the different values in the general hypothesis to get a resultant hypothesis.
Now that we know how the Find-S algorithm works, let us take a look at an implementation
using Python.
Use Case
Let’s try to implement the above example using Python. The code to implement the Find-S
algorithm using the above data is given below.
1 import pandas as pd
2 import numpy as np
3
#to read the data in the csv file
4
data = pd.read_csv("data.csv")
5
print(data,"n")
6
7
#making an array of all the attributes
8
d = np.array(data)[:,:-1]
9
print("n The attributes are: ",d)
10
11
#segragating the target that has positive and negative
12
examples
13
target = np.array(data)[:,-1]
14 print("n The target is: ",target)
15
16 #training function to implement find-s algorithm
17 def train(c,t):
18 for i, val in enumerate(t):
19 if val == "Yes":
20 specific_hypothesis = c[i].copy()
21 break
22
23 for i, val in enumerate(c):
24 if t[i] == "Yes":
25 for x in range(len(specific_hypothesis)):
26 if val[x] != specific_hypothesis[x]:
27 specific_hypothesis[x] = '?'
28 else:
29 pass
30
31 return specific_hypothesis
32
33 #obtaining the final hypothesis
34 print("n The final hypothesis is:",train(d,target))
Output:
8
This brings us to the end of this article where we have learned the Find-S Algorithm in
Machine Learning with its implementation and use case. I hope you are clear with all that has
been shared with you in this tutoria
9
Experiment No. 3
1. Aim: Implementation of Single layer Perceptron Learning Algorithm.
2. Objectives:
 To become familiar with neural networks learning algorithms from available
examples.
 Provide knowledge of learning algorithm in neural networks.
3. Outcomes: The student will be able to,
 Have an understanding of the concepts and techniques of neural networks

through the study of the most important neural network models.
 Discuss the main factors involved in achieving good learning and
generalization performance in neural network systems.
 Use the current techniques and tools required for computing practice.
4. Software Required: JAVA / MATLAB
5. Theory:
Neural networks are a branch of ―Artificial Intelligence". Artificial Neural Network is a

system loosely modelled based on the human brain. Neural networks are a powerful
technique to solve many real world problems. They have the ability to learn from
experience in order to improve their performance and to adapt themselves to changes in the
environment. In addition to that they are able to deal with incomplete information or noisy
data and can be very effective especially in situations where it is not possible to define the
rules or steps that lead to the solution of a problem. In a nutshell a Neural network can be
considered as a black box that is able to predict an output pattern when it recognizes a given
input pattern. Once trained, the neural network is able to recognize similarities when
presented with a new input pattern, resulting in a predicted output pattern.
In late 1950s, Frank Rosenblatt introduced a network composed of the units that were
10
enhanced version of McCulloch-Pitts Threshold Logic Unit (TLU) model. Rosenblatt's
model of neuron, a perceptron, was the result of merger between two concepts from the
1940s, McCulloch-Pitts model of an artificial neuron and Hebbian learning rule of adjusting
weights. In addition to the variable weight values, the perceptron model added an extra
input that represents bias. Thus, the modified equation is now as follows:
where b represents the bias value.
6. Algorithm:
Perceptron Learning Algorithm:
The perceptron learning rule was originally developed by Frank Rosenblatt in the late 1950s.
Training patterns are presented to the network's inputs; the output is computed. Then the
connection weightswjare modified by an amount that is proportional to the product of
 the difference between the actual output, y, and the desired output, d, and
 the input pattern, x.
The algorithm is as follows:

1. Initialize the weights and threshold to small random numbers.
2. Present a vector x to the neuron inputs and calculate the output.
3. Update the weights according to:
where
 d is the desired output,
 t is the iteration number, and
 eta is the gain or step size, where 0.0 < n < 1.0
11
4. Repeat steps 2 and 3 until:
12
1. the iteration error is less than a user-specified error threshold or
2. a predetermined number of iterations have been completed.
Learning only occurs when an error is made; otherwise the weights are left unchanged.
Multilayer Perceptron
Output Values
Output Layer
Adjustable Weights
Input Signals (External Stimuli) Input Layer
Problem Statement: Implement AND function using perceptron model
Truth table for AND function is:
X1 X2 Y
0 0 0
0 1 0
1 0 0
1 1 1
13
7. Conclusion:
Single layer perceptron learning algorithm is implemented for AND function. It is used
for train the iterations of neural network. Neural network mimics the human brain and
perceptron learning algorithm trains the neural network according to the input given.
8. Viva Questions:
 What is feed forward network?

 Write the logistic sigmoid function?
 Why use Artificial Neural Networks? What are its advantages?
 List some commercial practical applications of Artificial Neural Networks.
 What are the disadvantages of Artificial Neural Networks?
14
Experiment No. 4
1. Aim: Implementation of unsupervised learning algorithm – Hebbian Learning
2. Objectives:
 To become familiar with neural networks learning algorithms from available

examples.
 To give design methodologies for artificial neural networks.
 Provide knowledge of un-supervised learning in neural networks.
3. Outcomes: The student will be able to,
 Explain the differences between networks for supervised and unsupervised

learning.
 Understand how ANNs can be designed and trained.
 Define the terms: unit, weight, activation function, threshold and architecture as
they relate to ANNs.
 Apply the knowledge of computing to engineering discipline to solve the
problems of the neural network domains.
4. Software Required: C/ C++/JAVA/ MATLAB
5. Theory:
Unsupervised Learning Algorithm:
These types of model are not provided with the correct results during the training.
It can be used to cluster the input data in classes on the basis of their statistical properties
only.
The labelling can be carried out even if the labels are only available for a small
number of objects represented of the desired classes. All similar input patters are grouped
together as clusters. If matching pattern is not found, a new cluster is formed.
In contrast to supervised learning, unsupervised or self-organized learning does

not require an external teacher. During the training session, the neural network receives a
15
number of different patterns & learns how to classify input data into appropriate
categories. Unsupervised learning tends to follow the neuro-biological organization of
brain. It aims to learn rapidly & can be used in real-time.
Hebbian Learning:
In 1949, Donald Hebb proposed one of the key ideas in biological learning,
commonly known as Hebb‘s Law. Hebb‘s Law states that if neuron i is near enough is
excite enough to excite neuron j & repeatedly participates in its activation, the synaptic
connection between these two neurons is strengthened & neuron j becomes more
sensitive to stimuli from neuron i.
Hebb‘s Law can be represented in the form of two rules:
1. If two neurons on either side of a connection are activated synchronously, then the
weight of that connection is increased.
2. If two neurons on either side of a connection are activated asynchronously, then the
weight of that connection is decreased.
Hebb‘s law provide basis for learning without a teacher. Learning here is a local
phenomenon occurring without feedback from the environment.
 Using Hebb‘s Law we can express the adjustment applied to weight at iteration
p in the following form:
 As a special case, we can represent Hebb‘s Law as follows:
Where α is the learning rate parameter.
 Hebbian learning implies that weights can only increase. To resolve this problem,
we might impose a limit on the growth of synaptic weights. It can be done by
introducing non-linear forgetting factor into Hebb‘s Law:
16
Where φ is the forgetting factor.
6. Hebbian learning algorithm
Step 1: Initialization
Set initial synaptic weights and thresholds to small random values, say in an interval [0,1].
Step 2: Activation
Compute the neuron output at iteration p
Where n is number of neuron inputs, & is the threshold value of neuron j.
Step 3: Learning
Update the weights in the network
Where is the weight correction at iteration p.
Step 4:Iteration
Increase iteration p by one, go back to step 2.
7. Conclusion:
Unsupervised Hebbian learning algorithm is implemented which does not require

supervisor. It update the weights the accordingly if error comes and train the network.
8. Viva Questions:
 What is unsupervised training?
17
 How Artificial Neurons learns?
 What is the difference between neural network and fuzzy logi
18
Experiment No. 5
Study of Backpropagation – Algorithm For Training A Neural Network

Backpropagation is a supervised learning algorithm, for training Multi-layer Perceptrons (Artificial
Neural Networks).
Why We Need Backpropagation?
While designing a Neural Network, in the beginning, we initialize weights with some random values or any
variable for that fact.
Now obviously, we are not superhuman. So, it’s not necessary that whatever weight values we have selected
will be correct, or it fits our model the best.
Okay, fine, we have selected some weight values in the beginning, but our model output is way different than
our actual output i.e. the error value is huge.
Now, how will you reduce the error?
Basically, what we need to do, we need to somehow explain the model to change the parameters (weights),
such that error becomes minimum.
Let’s put it in an another way, we need to train our model.
One way to train our model is called as Backpropagation. Consider the diagram below:
Let me summarize the steps for you:
 Calculate the error – How far is your model output from the actual output.
 Minimum Error – Check whether the error is minimized or not.
 Update the parameters – If the error is huge then, update the parameters (weights and biases). After
that again check the error. Repeat the process until the error becomes minimum.
 Model is ready to make a prediction – Once the error becomes minimum, you can feed some inputs
to your model and it will produce the output.
I am pretty sure, now you know, why we need Backpropagation or why and what is the meaning of training a
model.
Now is the correct time to understand what is Backpropagation.
19
What is Backpropagation?
The Backpropagation algorithm looks for the minimum value of the error function in weight space using a
technique called the delta rule or gradient descent. The weights that minimize the error function is then
considered to be a solution to the learning problem.
Let’s understand how it works with an example:
You have a dataset, which has labels.
Consider the below table:
Input Desired Output

0 0
1 2
2 4
Now the output of your model when ‘W” value is 3:
Input Desired Output Model output (W=3)

0 0 0
1 2 3
2 4 6
Notice the difference between the actual output and the desired output:
Model output
Input Desired Output Absolute Error Square Error
(W=3)
0 0 0 0 0
1 2 3 1 1
2 4 6 2 4
Let’s change the value of ‘W’. Notice the error when ‘W’ = ‘4’
Desired Model output Model output

Input Absolute Error Square Error Square Error
Output (W=3) (W=4)
0 0 0 0 0 0 0
1 2 3 1 1 4 4
2 4 6 2 4 8 16
Now if you notice, when we increase the value of ‘W’ the error has increased. So, obviously there is no point in
increasing the value of ‘W’ further. But, what happens if I decrease the value of ‘W’? Consider the table below:
Desired Model output Model output

Input Absolute Error Square Error Square Error
Output (W=3) (W=2)
0 0 0 0 0 0 0
1 2 3 2 4 3 0
20
2 4 6 2 4 4 0
Now, what we did here:
 We first initialized some random value to ‘W’ and propagated forward.

 Then, we noticed that there is some error. To reduce that error, we propagated backwards and
increased the value of ‘W’.
 After that, also we noticed that the error has increased. We came to know that, we can’t increase the
‘W’ value.
 So, we again propagated backwards and we decreased ‘W’ value.
 Now, we noticed that the error has reduced.
So, we are trying to get the value of weight such that the error becomes minimum. Basically, we need to figure
out whether we need to increase or decrease the weight value. Once we know that, we keep on updating the
weight value in that direction until error becomes minimum. You might reach a point, where if you further update
the weight, the error will increase. At that time you need to stop, and that is your final weight value.
Consider the graph below:
We need to reach the ‘Global Loss Minimum’.
This is nothing but Backpropagation.
Let’s now understand the math behind Backpropagation.
How Backpropagation Works?

Consider the below Neural Network:
21
The above network contains the following:
 two inputs
 two hidden neurons
 two output neurons
 two biases
Below are the steps involved in Backpropagation:
 Step – 1: Forward Propagation

 Step – 2: Backward Propagation
 Step – 3: Putting all the values together and calculating the updated weight value
Step – 1: Forward Propagation

We will start by propagating forward.
We will repeat this process for the output layer neurons, using the output from the hidden layer neurons as
inputs.
22
Now, let’s see what is the value of the error:
Step – 2: Backward Propagation

Now, we will propagate backwards. This way we will try to reduce the error by changing the values of weights
and biases.
Consider W5, we will calculate the rate of change of error w.r.t change in weight W5.
23
Since we are propagating backwards, first thing we need to do is, calculate the change in total errors w.r.t the
output O1 and O2.
Now, we will propagate further backwards and calculate the change in output O1 w.r.t to its total net input.
Let’s see now how much does the total net input of O1 changes w.r.t W5?
Step – 3: Putting all the values together and calculating the updated
weight value
Now, let’s put all the values together:
Let’s calculate the updated value of W5:
24
 Similarly, we can calculate the other weight values as well.
 After that we will again propagate forward and calculate the output. Again, we will calculate the error.
 If the error is minimum we will stop right there, else we will again propagate backwards and update the
weight values.
 This process will keep on repeating until error becomes minimum.
Conclusion:
Well, if I have to conclude Backpropagation, the best option is to write pseudo code for the same.
25
Experiment No. 6
Study of Naïve Bayes Alogorithm
What is Naive Bayes algorithm?
It is a classification technique based on Bayes’ Theorem with an assumption of independence among predictors. In
simple terms, a Naive Bayes classifier assumes that the presence of a particular feature in a class is unrelated to the
presence of any other feature.
For example, a fruit may be considered to be an apple if it is red, round, and about 3 inches in diameter. Even if
these features depend on each other or upon the existence of the other features, all of these properties
independently contribute to the probability that this fruit is an apple and that is why it is known as ‘Naive’.
Naive Bayes model is easy to build and particularly useful for very large data sets. Along with simplicity, Naive
Bayes is known to outperform even highly sophisticated classification methods.
Bayes theorem provides a way of calculating posterior probability P(c|x) from P(c), P(x) and P(x|c). Look at the
equation below:
Above,
• P(c|x) is the posterior probability of class (c, target) given predictor (x, attributes).
• P(c) is the prior probability of class.
• P(x|c) is the likelihood which is the probability of predictor given class.
• P(x) is the prior probability of predictor.
How Naive Bayes algorithm works?
Let’s understand it using an example. Below I have a training data set of weather and corresponding target
variable ‘Play’ (suggesting possibilities of playing). Now, we need to classify whether players will play or not
based on weather condition. Let’s follow the below steps to perform it.
26
Step 1: Convert the data set into a frequency table
Step 2: Create Likelihood table by finding the probabilities like Overcast probability = 0.29 and probability of
playing is 0.64.
naive bayes, probability, example
Step 3: Now, use Naive Bayesian equation to calculate the posterior probability for each class. The class with
the highest posterior probability is the outcome of prediction.
Problem: Players will play if weather is sunny. Is this statement is correct?
We can solve it using above discussed method of posterior probability.
P(Yes | Sunny) = P( Sunny | Yes) * P(Yes) / P (Sunny)
Here we have P (Sunny |Yes) = 3/9 = 0.33, P(Sunny) = 5/14 = 0.36, P( Yes)= 9/14 = 0.64
Now, P (Yes | Sunny) = 0.33 * 0.64 / 0.36 = 0.60, which has higher probability.
Naive Bayes uses a similar method to predict the probability of different class based on various attributes. This
algorithm is mostly used in text classification and with problems having multiple classes.
What are the Pros and Cons of Naive Bayes?
Pros:
It is easy and fast to predict class of test data set. It also perform well in multi class prediction
When assumption of independence holds, a Naive Bayes classifier performs better compare to other models
like logistic regression and you need less training data.
27
It perform well in case of categorical input variables compared to numerical variable(s). For numerical variable,
normal distribution is assumed (bell curve, which is a strong assumption).
Cons:
If categorical variable has a category (in test data set), which was not observed in training data set, then model
will assign a 0 (zero) probability and will be unable to make a prediction. This is often known as “Zero
Frequency”. To solve this, we can use the smoothing technique. One of the simplest smoothing techniques is
called Laplace estimation.
On the other side naive Bayes is also known as a bad estimator, so the probability outputs from predict_proba
are not to be taken too seriously.
Another limitation of Naive Bayes is the assumption of independent predictors. In real life, it is almost
impossible that we get a set of predictors which are completely independent.
4 Applications of Naive Bayes Algorithms
Real time Prediction: Naive Bayes is an eager learning classifier and it is sure fast. Thus, it could be used for
making predictions in real time.
Multi class Prediction: This algorithm is also well known for multi class prediction feature. Here we can predict
the probability of multiple classes of target variable.
Text classification/ Spam Filtering/ Sentiment Analysis: Naive Bayes classifiers mostly used in text classification
(due to better result in multi class problems and independence rule) have higher success rate as compared to
other algorithms. As a result, it is widely used in Spam filtering (identify spam e-mail) and Sentiment Analysis
(in social media analysis, to identify positive and negative customer sentiments)
Recommendation System: Naive Bayes Classifier and Collaborative Filtering together builds a
Recommendation System that uses machine learning and data mining techniques to filter unseen information
and predict whether a user would like a given resource or not
28

VI Sem Machine Learning CS 601 PDF

Uploaded by

Copyright:

Available Formats

You might also like

VI Sem Machine Learning CS 601 PDF

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

VI Sem Machine Learning CS 601 PDF

Uploaded by

Copyright:

Available Formats

Lab Work Machine Learning CS VI Sem

1. Study of Decision Trees for Classification: A Machine Learning Algorithm

Study of Decision Trees for Classification: A Machine Learning

1. Classification trees (Yes/No types)

2. Regression trees (Continuous data types)

Intuitively, it tells us about the predictability of a certain event. Example,

D1 Sunny Hot High Weak No

D2 Sunny Hot High Strong No

D3 Overcast Hot High Weak Yes

D5 Rain Cool Normal Weak Yes

D7 Overcast Cool Normal Strong Yes

D8 Sunny Mild High Weak No

D9 Sunny Cool Normal Weak Yes

D11 Sunny Mild Normal Strong Yes

D13 Overcast Hot Normal Weak Yes

ID3 Algorithm will perform following tasks recursively

1. Create root node for the tree

What is Find-S Algorithm in Machine Learning?

Hypothesis, in general, is an explanation for something. The general hypothesis basically

G = { ‘?’, ‘?’, ‘?’, …..’?’}

Now ,let’s talk about the Find-S Algorithm in Machine Learning.

The Find-S algorithm follows the steps written below:

1. Initialize ‘h’ to the most specific hypothesis.

How Does It Work?

Limitations of Find-S Algorithm

1. There is no way to determine if the hypothesis is consistent throughout the data.

Implementation of Find-S Algorithm

Time Weather Temperature Company Humidity Wind Goes

So now, the general hypothesis is:

h0 = {‘Morning’, ‘Sunny’, ‘Warm’, ‘Yes’, ‘Mild’, ‘Strong’}

h1= {‘Morning’, ‘Sunny’, ‘?’, ‘Yes’, ‘?’, ‘?’}

h2 = {‘?’, ‘Sunny’, ‘?’, ‘Yes’, ‘?’, ‘?’}

1. Aim: Implementation of Single layer Perceptron Learning Algorithm.

3. Outcomes: The student will be able to,

 Have an understanding of the concepts and techniques of neural networks

4. Software Required: JAVA / MATLAB

Neural networks are a branch of ―Artificial Intelligence". Artificial Neural Network is a

where b represents the bias value.

Perceptron Learning Algorithm:

The algorithm is as follows:

Input Signals (External Stimuli) Input Layer

Problem Statement: Implement AND function using perceptron model

Truth table for AND function is:

 What is feed forward network?

1. Aim: Implementation of unsupervised learning algorithm – Hebbian Learning

 To become familiar with neural networks learning algorithms from available

3. Outcomes: The student will be able to,

 Explain the differences between networks for supervised and unsupervised

4. Software Required: C/ C++/JAVA/ MATLAB

Unsupervised Learning Algorithm:

In contrast to supervised learning, unsupervised or self-organized learning does

Hebb‘s Law can be represented in the form of two rules:

 As a special case, we can represent Hebb‘s Law as follows:

Where α is the learning rate parameter.

6. Hebbian learning algorithm

Compute the neuron output at iteration p

Where n is number of neuron inputs, & is the threshold value of neuron j.