Professional Documents
Culture Documents
Data Warehouse
Data Warehouse
Definition
“A data warehouse is a collection of computerized
data that is organized to most optimally support
reporting and analysis activity”.
A data warehouse is a repository of an
organization's data, where the informational
assets of the organization are stored and managed,
to support various activities such as reporting,
analysis, decision-making, as well as other
activities such as support for optimization of
organizational operational processes.
Evolution Stages Of Data Warehouse
Offline Operational Databases - Data
warehouses in this initial stage are developed
by simply copying the database of an
operational system to an off-line server where
the processing load of reporting does not
impact on the operational system's
performance.
Offline Data Warehouse - Data warehouses in
this stage of evolution are updated on a regular
time cycle (usually daily, weekly or monthly)
from the operational systems and the data is
stored in an integrated reporting-oriented data
structure
Evolution Stages Of Data Warehouse
Cont.
Real Time Data Warehouse - Data
warehouses at this stage are updated on a
transaction or event basis, every time an
operational system performs a transaction
(e.g. an order or a delivery or a booking
etc.)
Integrated Data Warehouse - Data
warehouses at this stage are used to
generate activity or transactions that are
passed back into the operational systems
for use in the daily activity of the
organization.
Components of a data warehouse
Data Sources: Refers to any electronic repository of
information that contains data of interest for
management use or analytics.
Data Transformation: This layer receives data from
the data sources, cleans and standardizes it, and loads
it into the data repository. This is often called
"staging" data as data often passes through a
temporary database whilst it is being transformed.
This activity of transforming data can be performed
either by manually created code or a specific type of
software could be used called an ETL (Extract,
Transform & Load ) tool.
Data Transformation
Activities occurring during data transformation
Components of a data warehouse
Data Warehouse: The data warehouse need not to
be a relational database, as it must be organized
to hold information in a structure that best
supports not only query and reporting, but also
advanced analysis techniques, like data mining.
Most data warehouses hold information for at
least 1 year and sometimes can reach half
century, depending on the business/operations
data retention requirement. As a result these
databases can become very large.
Components of a data warehouse
Product DescriptionCost
Code
P001 Pants 800
P002 Shirts 600 Product Quantity
Code
P003 T-Shirts 550 P001 200
P001 100
P003 120
Advantages of using data warehouse