Professional Documents
Culture Documents
Data Mining and Warehousing
Data Mining and Warehousing
AND
DATA MINING
PRESENTED BY
N.ASHOK
S.R.ARUN KUMAR
THIRU RAMAKRISHNA NALLAMMAI POLYTECHNIC COLLEGE , DHARAPURAM.
DATA WAREHOUSING
Data warehousing is defined as a method of
collecting information from various sources and
storing it under a unique model at a single site
and a process of centralized data management
and retrieval.
Data warehousing represents an ideal vision of
maintaining a central storage area of all
organizational data
Centralization of data is needed to maximize
user access and analysis.
CHARACTERISTICS
Subject oriented
Integrated
Time varient
Non volatile
FUNCTIONS
5.Enhance 1.Design
4.Operate 2.Prototype
3.Deploy
BUILDING A
DATAWARE HOUSE
Extract data from multiple sources
Format the data for consistency within the
ware house
Cleaning the data – Ensures validity.
Converting the schema of data to a
common integrated schema
Back flushing – Loading the cleaned data
into the warehouse
OLAP
Online Analytical Processing:
Analyze data in a multidimensional
format.
Transforms the data warehouse data into
specific meaningful information.
Provides User friendly environment for
interactive data analysis
OLAP
Structure of OLAP
DATA WAREHOUSE
SQL RESULT
OLAP SERVER
USER
DATA MINING
Data mining is the process of extracting
(relevant information) or finding hidden
knowledge (new Information) from large
database.
It has been described as "the science of
extracting useful information from large
data sets or databases."
DATA MINING
Data mining software is one of a number of
analytical tools for analyzing data. It allows users
to analyze data from many different dimensions
or angles, categorize it, and summarize the
relationships identified.
Technically, data mining is the process of finding
correlations or patterns among dozens of fields
in large relational databases.
DATA MINING
Pattern Evaluation
DATA MINING and
Knowledge Presentation