Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

Data Mining: Confluence of Multiple Disciplines

Statistics Machine
Learning Pattern
Recognition
Database
Systems
Visualization
Data warehouse Data Mining
Algorithm
Information
Retrieval High
Applications Performance
computing

Dr Senthilkumar N C, Asso Prof, SITE


Classification of Data Mining Systems
 according to the kinds of databases mined

- data models (relational, transactional, DW, object-relational)

- types of data (spatial, text, time series, multimedia, www)

- application

 according to the kinds of knowledge mined

-based on data mining functionalities

-based on the granularity or levels of abstraction of the


knowledge mined (generalized, primitive level, multiple levels)

-mine data regularities(commonly occurring patterns) versus


those that mine data irregularities (such as exceptions, or outliers)
Dr Senthilkumar N C, Asso Prof, SITE
 according to the kinds of techniques utilized

-degree of user interaction involved (e.g., autonomous systems,


interactive exploratory systems, query-driven systems)

-the methods of data analysis employed (e.g., database-oriented


or data warehouse–oriented techniques, machine learning, statistics,
visualization, pattern recognition, neural networks, and so on)

 according to the applications adapted

finance, telecommunications, DNA, stock markets, e-mail, and so on

Dr Senthilkumar N C, Asso Prof, SITE


Integration of a Data Mining with DB/DW Systems

 No coupling—flat file processing, not recommended


 Loose coupling
 Fetching data from DB/DW. it is difficult to achieve high
scalability and good performance with large data sets
 Semi-tight coupling—enhanced DM performance
 Provide efficient implement a few data mining primitives in a
DB/DW system, e.g., sorting, indexing, aggregation, histogram
analysis, multiway join, precomputation of some stat functions
 Tight coupling—A uniform information processing environment
 DM is smoothly integrated into a DB/DW system, mining query
is optimized based on mining query, indexing, query processing
methods, etc.

Dr Senthilkumar N C, Asso Prof, SITE

You might also like