Professional Documents
Culture Documents
DATA Presentation
DATA Presentation
DATA Presentation
PREPARATION
Divesh Dubey
AGENDA
Data Reduction:
Sampling
Feature selection
Principal Component analysis
Data discretization
3
INTRODUCTION
DATA REDUCTION
Whenever there is a larger dataset available it is also appropriate to
reduce its size, in order to make learning algorithms more efficient,
without sacrificing the quality of the results obtained.
DATA REDUCTION
SAMPLING
EXAMPLE:
1. Bamboo (10)
2. Palm (7)
3. Mango (8)
4. Araucaria (20)
5. Coconut (10)
FEATURE SELECTION
Data reduction process.
Process where you automatically / manually select important
features.
Irrelevant features
Decreases the accuracy
Data presentation 9
Feature selection
methods
Embedded
Filter method Wrapper method
method
Presentation title 10