Professional Documents
Culture Documents
Data Warehousing An Overview1821
Data Warehousing An Overview1821
Challenges
Users with different needs Performance and speed Multiple data sources Diverse platforms and security systems Global deployment, multilingual support Many technologies deployed
Comparison
Database
For operational activities. Volatile 2-D Structure SQL query is required
Data Warehouse
To store historical data Non Volatile (Permanent) Multi Dimensional Advanced SQL query is required
Data Warehouse
Definition
A data warehouse is a centralized repository containing comprehensive detailed and summary data that provides a complete view of customers, suppliers, business processes, and transactions, from a historical perspective with little volatility.
Process
Transform Source EXTRACT Source
Extraction Integration Data Cleansing Data Scrubbing Transformation
LOAD
TARGET
Advantages
High query performance: queries are answered directly from DW Does not interfere with local processing at sources Data is available in the DW
12/19/2012
Disadvantages
DW contains possibly outdated data lacks latest data
Depends on refresh rate
12/19/2012
Data Mart
Data Mart
Contains a subset of the data stored in the data warehouse that is of interest to a specific business community, department, or set of users (for example: marketing promotions, finance, or account collections) The data warehouse serves as a single source for multiple data marts
THANK YOU