Professional Documents
Culture Documents
Data Mining & Classifiers
Data Mining & Classifiers
AMIT PAUL
BY TAMAGHNO CHAUDHURI, CSE, 3RD YEAR
In the computer and Internet arena, a data set is a group of numbers, or bytes, often
displayed in a table with the columns categorizing the data into subsets.
1.Sequential
2.Partitioned
3.Visual Storage Access Method (VSAM)
Data sets are organized according to their quantity and the frequency and method by
which they will be accessed. The format of the individual data sets also depends on
the intended use of the information Bijou Solutions, Inc. | 2020
WHAT
IS
DATA MINING ??
KNOWLEDGE DISCOVERY FROM DATA (KDD)
Natural evolution of Information Technology. We're are living in the data age.
Data mining turns a large collection of data into knowledge, based on which a
machine can recognize a given data set based on predefined data analytics.
ASSOCIATION
CLUSTERING
CLASSIFICATION
Working on a data set, the data analysis task is classification, where a model
or classifier is constructed to predict class (categorical) labels based on one
or more numerical and/or categorical variables (predictors, attributes,
features).
LEARNING
STEP
Place the best attribute of the data-set at the root of the tree.
Repeat step 1 and step 2 on each subset until you find leaf
nodes in all the branches of the tree.
For example:
We're actually studying a maths example sum
with the formulae in hand and then solving an
exercise problem.
Based on class level , the decision is taken,
the formula for this is derived from the
information table.
TRAINING
How:
While the decision is taken, the decision tree
is constructed. Every node/level is one
decision.
How will we attend
our objective?
As soon as the training data-set changes,
the model changes, but the algorithm never
changes.