Professional Documents
Culture Documents
Introduction To Data Warehousing Concepts
Introduction To Data Warehousing Concepts
Concepts
Topics Covered
December 9, 2021
What is a data warehouse?
– Data warehouse is a database designed in such a way that it is optimized for querying
and data analysis.
December 9, 2021
Definition of a data warehouse
A data warehouse is a :
– Subject-oriented
– Integrated
– Non-volatile
– Time variant
– Accessible
Store of data obtained from variety of sources and made available to end users in a way
that they can understand and use in a business context.
December 9, 2021
Definition of a data warehouse
Subject-oriented
– The data in the data warehouse is organized so that all the data elements relating to
the same real-world event or object are linked together.
Time-variant
– The changes to the data in the data warehouse are tracked and recorded so that
reports can be produced showing changes over time.
Non-volatile
– Data in the data warehouse is never over-written or deleted - once committed, the data
is static, read-only, and retained for future reporting.
Integrated
– The data warehouse contains data from most or all of an organization's operational
systems and this data is made consistent.
December 9, 2021
Why organizations use data warehousing?
Competitive business environment creates need for complex analysis of ever increasing
volume of business data.
Hence data warehousing is used:
– to turn vast volumes of business data into meaningful management information
– Give users online access to this information
December 9, 2021
OLTP Vs. OLAP
The main aim of OLTP is reliable and efficient processing of a large number of
transactions and ensuring data consistency.
The main aim of OLAP is efficient multidimensional processing of large data volumes.
December 9, 2021
OLTP Vs. OLAP
OLTP OLAP
December 9, 2021
OLTP Vs. OLAP
OLTP OLAP
December 9, 2021
Dimensional Modeling
December 9, 2021
Dimensions ( Who, what, when, where )
December 9, 2021
For Example
Loc_Code Varchar(4)
Name Varchar(50)
State_Name Varchar(20)
County_Name Varchar(20)
December 9, 2021
For Example
December 9, 2021
Measures ( metrics and measurements )
Measures are summarized numeric data regarding the actual business process.
Features of Measures:
– Usually measures are additive ( like total sales ). However they can be semi-additive
( like balances ) or non-additive ( like unit price ).
– Measures are aggregated/rolled up on the basis of the dimensions.
– Facts are an overall summary of the measures related to a business area i.e. fact
tables contain measures.
December 9, 2021
For Example
PR_Dim_Id Integer(4)
LOC_Dim_Id Integer(4)
Sales Integer(4)
Tax Integer(4)
December 9, 2021
For Example
December 9, 2021
Types of data warehouses…
Data warehouse without staging area
Operational
system Analysis
Metadata
repository
December 9, 2021
Types of data warehouses…
Data warehouse with staging area
Operational
system Analysis
Metadata
repository
December 9, 2021
Types of data warehouses…
Data warehouse with staging area and data marts
Operational
system Purchasing Analysis
Metadata
repository
Inventory
Flat files Data mining
December 9, 2021
Data warehouse schemas and other basics
– Star Schema : A single object (fact table) in the middle connected to a number of
dimension tables
December 9, 2021
Data warehouse schemas and other basics
Star Schema
Sales Fact
Product Dimension
Dimension 1
Store Dimension
December 9, 2021
Data warehouse schemas and other basics
Star Schema
Date ID Product ID
Date ID
Month Prod Name
Year Product ID
Prod Desc
Store ID Category
Store QOH
Customer ID
Store ID
City Unit Sales Customer
State
Dollar Sales Customer ID
Country
Region Cust Name
Cust City
December 9, 2021
Data warehouse schemas and other basics
Snowflake Schema
Year
Quarter
Customer Dimension
Time
Dimension 1
Sub Cat
Store City
Category
State
December 9, 2021
Data warehouse schemas and other basics
Snowflake Schema
Sub Cat
December 9, 2021
Data warehouse schemas and other basics
Fact Constellation
Sales Fact
Forecast Fact
December 9, 2021
Data warehouse schemas and other basics
Fact Constellation Sales Fact Table
Store Date ID
Product ID Product
Store ID
City Store ID
Product ID
State Customer ID Prod Name
Country Unit Sales Prod Desc
Region
Dollar Sales Category
QOH
Forecast Fact Table
Date
Date ID Customer
Date ID
Month ID
Month Customer ID
Year Product ID
Cust Name
Customer ID Cust City
Fcst_Weight_net Cust Country
Measurements
Fcst_Turnover
December 9, 2021
Thank You
December 9, 2021