Professional Documents
Culture Documents
Data Warehousing Design Transparencies
Data Warehousing Design Transparencies
2
Chapter Objectives
The concepts associated with dimensionality
modeling, which is a core technique of
Kimball’s Business Dimensional Lifecycle.
The Dimensional Modeling stage of
Kimball’s Business Dimensional Lifecycle.
The step-by-step creation of a dimensional
model (DM) using the DreamHome case
study.
The issues associated with the development of
a data warehouse.
3
Designing Data Warehouses
To begin a data warehouse project, we
need to find answers for questions such as:
– Which user requirements are most
important and which data should be
considered first?
– Which data should be considered first?
– Should the project be scaled down into
something more manageable?
– Should the infrastructure for a scaled
down project be capable of ultimately
delivering a full-scale enterprise-wide
data warehouse? 4
Designing Data Warehouses
For many enterprises the way to avoid the
complexities associated with designing a
data warehouse is to start by building one
or more data marts.
5
Designing Data Warehouses
6
Designing Data Warehouses
7
Designing Data Warehouses
8
Designing Data Warehouses
9
Data Warehouse Development Methodologies
There are two main methodologies that
incorporate the development of an enterprise
data warehouse (EDW) and these are
proposed by the two key players in the data
warehouse arena.
– Kimball’s Business Dimensional Lifecycle
(Kimball, 2008)
– Inmon’s Corporate Information Factory
(CIF) methodology (Inmon, 2001).
10
Data Warehouse Development Methodologies
11
Kimballs’ Business Dimensional Lifecycle
12
Kimballs’ Business Dimensional Lifecycle
13
Kimballs’ Business Dimensional Lifecycle
14
Kimballs’ Business Dimensional Lifecycle
15
Kimballs’ Business Dimensional Lifecycle
18
Dimensionality modeling
A logical design technique that aims to
present the data in a standard, intuitive
form that allows for high-performance
access
19
Dimensionality modeling
20
Dimensionality modeling
21
Star schema (dimensional model) for property
sales of DreamHome
22
Dimensionality modeling
26
Property sales with normalized version of
Branch dimension table
27
Dimensionality modeling
Predictable and standard form of the
underlying dimensional model offers
important advantages:
– Efficiency
– Ability to handle changing requirements
– Extensibility
– Ability to model common business
situations
– Predictable query processing
28
Comparison of DM and ER models
A single ER model normally decomposes
into multiple DMs.
29
Dimensional Modeling Stage of Kimball’s
Business Dimensional Lifecycle
30
Dimensional Modeling Stage of Kimball’s
Business Dimensional Lifecycle
31
Step 1: Select business process
32
ER model of an extended version of DreamHome
33
ER model of property sales business process of
DreamHome
34
Step 2: Declare grain
Decide what a record of the fact table is to
represents.
37
Step 4: Identify facts
The grain of the fact table determines
which facts can be used in the data mart.
39
Dimensional Modeling Stage of Kimball’s
Business Dimensional Lifecycle
42
Additional design issues
There are three basic types of slowly
changing dimensions:
– Type 1, where a changed dimension
attribute is overwritten
– Type 2, where a changed dimension
attribute causes a new dimension record to
be created
– Type 3, where a changed dimension
attribute causes an alternate attribute to
be created so that both the old and new
values of the attribute are simultaneously
accessible in the same dimension record 43
Kimball’s Business Dimensional lifecycle
45
Data Warehouse Development Issues
46
Data Warehouse Development Issues