Professional Documents
Culture Documents
Machine Learning
Machine Learning
Machine Learning
If shape of object is rounded and depression at top having color Red then it will
be labelled as –Apple.
If shape of object is long curving cylinder having color Green-Yellow then it will
be labelled as –Banana.
Now suppose after training the data, you have given a new separate fruit say
Banana from basket and asked to identify it.
Since the machine has already learned the things from previous data and this time
have to use it wisely. It will first classify the fruit with its shape and color and would
confirm the fruit name as BANANA and put it in Banana category. Thus the machine
learns the things from training data(basket containing fruits) and then apply the
knowledge to test data(new fruit).There is a basket which is filled with some fresh
fruits, the task is to arrange the same type of fruits at one place.
Also, suppose that the fruits are apple, banana, cherry, grape.
Suppose one already knows from their previous work (or experience) that, the shape
of each and every fruit present in the basket so, it is easy for them to arrange the
same type of fruits in one place.
Here, the previous work is called as training data in Data Mining terminology. So, it
learns the things from the training data. This is because it has a response variable
which says y that if some fruit has so and so features then it is grape, and similarly
for each and every fruit.
This type of information is deciphered from the data that is used to train the model.
This type of learning is called Supervised Learning.
Such problems are listed under classical Classification Tasks
Unsupervised learning:
Unsupervised learning is the training of machine using information that is neither
classified nor labeled and allowing the algorithm to act on that information without
guidance. Here the task of machine is to group unsorted information according to
similarities, patterns and differences without any prior training of data.
Unlike supervised learning, no teacher is provided that means no training will be
given to the machine. Therefore machine is restricted to find the hidden structure in
un labeled data by our-self.
For instance, suppose it is given an image having both dogs and cats which have
not seen ever.
Thus the machine has no idea about the features of dogs and cat so we can’t
categorize it in dogs and cats. But it can categorize them according to their
similarities, patterns, and differences i.e., we can easily categorize the above picture
into two parts. First first may contain all pics having dogs in it and second part may
contain all pics having cats in it. Here you didn’t learn anything before, means no
training data or examples.
Unsupervised learning classified into two categories of algorithms:
Clustering: A clustering problem is where you want to discover the inherent
groupings in the data, such as grouping customers by purchasing behavior.
Association: An association rule learning problem is where you want to
discover rules that describe large portions of your data, such as people that buy
X also tend to buy Y.
Unsupervised learning is where only the input data (say, X) is present and no
corresponding output variable is there.
Why Unsupervised Learning?
The main aim of Unsupervised learning is to model the distribution in the data
in order to learn more about the data.
It is called so, because there is no correct answer and there is no such
teacher(unlike supervised learning). Algorithms are left to their own devises to
discover and present the interesting structure in the data.
Here is an example as above mentioned:
As like same there is a basket and it is filled with some fresh fruits. The task is to
arrange the same type of fruits at one place.
This time there is no information about those fruits beforehand, its the first time that
the fruits are being seen or discovered
So how to group similar fruits without any prior knowledge about those.
First, any physical characteristic of a particular fruit is selected. Suppose color.
Then the fruits are arranged on the basis of the color. The groups will be something
as shown below:
RED COLOR GROUP: apples & cherry fruits.
GREEN COLOR GROUP: bananas & grapes.
So now, take another physical character say, size, so now the groups will be
something like this.
RED COLOR AND BIG SIZE: apple.
RED COLOR AND SMALL SIZE: cherry fruits.
GREEN COLOR AND BIG SIZE: bananas.
GREEN COLOR AND SMALL SIZE: grapes.
Here, there is no need to know or learn anything beforehand. That means, no train
data and no response variable. This type of learning is known as Unsupervised
Learning.
Difference b/w Supervised and Unsupervised Learning :
SUPERVISED
Uses Known
and Labeled
Computational
Uses off-line
Number of
Accurate and