Download as pdf or txt
Download as pdf or txt
You are on page 1of 73

Department of Computer Science and Engineering (CSE)

DATA WAREHOUSING

Prepared By: Vasudha (E7382)

University Institute of Engineering (UIE) 1


Department of Computer Science and Engineering (CSE)

What is a Data Warehouse?


“A data warehouse is simply a single, complete, and
consistent store of data obtained from a variety of
sources and made available to end users in a way
they can understand and use it in a business
context.”

University Institute of Engineering (UIE) 2


Department of Computer Science and Engineering (CSE)

Definition
➢Bill Inmon, considered to be the father of Data Warehousing
provides the following definition:
“A Data Warehouse is a subject oriented, integrated, non-volatile,
and time variant collection of data in support of management’s
decisions.”
➢Defining Features are
▪Subject Oriented
▪Integrated
▪Non Volatile
▪Time Variant
▪Data Granularity
➢ Data Warehousing
The process of constructing and using data warehouses.
University Institute of Engineering (UIE) 3
Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 4


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 5


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 6


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 7


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 8


Department of Computer Science and Engineering (CSE)

Information Systems: Relational DB vs. DW

University Institute of Engineering (UIE) 9


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 10


Department of Computer Science and Engineering (CSE)

ETL : Extract, Transform and Load

University Institute of Engineering (UIE) 11


Department of Computer Science and Engineering (CSE)

Data Mart

Data Mart 3

University Institute of Engineering (UIE) 12


Department of Computer Science and Engineering (CSE)

Types of Data Mart

University Institute of Engineering (UIE) 13


Department of Computer Science and Engineering (CSE)

Metadata

University Institute of Engineering (UIE) 14


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 15


Department of Computer Science and Engineering (CSE)

Data Warehouse: Building Blocks

University Institute of Engineering (UIE) 16


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 17


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 18


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 19


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 20


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 21


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 22


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 23


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 24


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 25


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 26


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 27


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 28


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 29


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 30


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 31


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 32


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 33


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 34


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 35


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 36


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 37


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 38


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 39


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 40


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 41


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 42


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 43


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 44


Department of Computer Science and Engineering (CSE)

Data Warehousing - OLAP


➢Online Analytical Processing Server (OLAP) is
based on the multidimensional data model.
➢It allows managers, and analysts to get an
insight of the information through fast,
consistent, and interactive access to information.

University Institute of Engineering (UIE) 45


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 46


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 47


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 48


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 49


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 50


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 51


Department of Computer Science and Engineering (CSE)

OLAP Characteristics

▪They use multidimensional data analysis techniques.

▪They provide advanced database support.

▪They provide easy-to-use end-user interfaces.

▪They support the client/server architecture.

University Institute of Engineering (UIE) 52


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 53


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 54


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 55


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 56


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 57


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 58


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 59


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 60


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 61


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 62


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 63


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 64


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 65


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 66


Department of Computer Science and Engineering (CSE)

Relational OLAP (ROLAP)


•Relational On-Line Analytical Processing (ROLAP) work mainly for
the data that resides in a relational database, where the base data
and dimension tables are stored as relational tables.
•ROLAP servers are placed between the relational back-end server
and client front-end tools. ROLAP servers use RDBMS to store and
manage warehouse data, and OLAP middleware to support missing
pieces.

Advantages of ROLAP
1 ROLAP can handle large amounts of data.
2 Can be used with data warehouse and OLTP systems.
Disadvantages of ROLAP
1 Limited by SQL functionalities.
2 Hard to maintain aggregate tables.
University Institute of Engineering (UIE) 67
Department of Computer Science and Engineering (CSE)

Multidimensional OLAP (MOLAP)


•Multidimensional On-Line Analytical Processing (MOLAP) support
multidimensional views of data through array-based
multidimensional storage engines.
•With multidimensional data stores, the storage utilization may be
low if the data set is sparse.

Advantages of MOLAP
1 Optimal for slice and dice operations.
2 Performs better than ROLAP when data is dense.
3 Can perform complex calculations.
Disadvantages of MOLAP
1 Difficult to change dimension without re-aggregation.
2 MOLAP can handle limited amount of data.
University Institute of Engineering (UIE) 68
Department of Computer Science and Engineering (CSE)

Multidimensional OLAP (MOLAP)


•Hybrid On-Line Analytical Processing (HOLAP) is a combination of
ROLAP and MOLAP.
•HOLAP provide greater scalability of ROLAP and the faster
computation of MOLAP.

Advantages of HOLAP
1 HOLAP provide advantages of both MOLAP and ROLAP.
2 Provide fast access at all levels of aggregation.

Disadvantages of HOLAP
1 HOLAP architecture is very complex because it support both
MOLAP and ROLAP servers.

University Institute of Engineering (UIE) 69


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 70


Department of Computer Science and Engineering (CSE)

University Institute of Engineering (UIE) 71


Department of Computer Science and Engineering (CSE)

Conclusion
• A Data Warehouse is a collection of integrated subject-
oriented databases designed to support a DSS.
– Each unit of data is non-volatile and relevant to some moment in
time.

• An operational data store (ODS) stores data for a specific


application. It feeds the data warehouse a stream of desired
raw data.

• A data mart is a lower-cost, scaled-down version of a data


warehouse, usually designed to support a small group of
users (rather than the entire firm).

• The metadata is information that is kept about the


warehouse.

University Institute of Engineering (UIE)


Department of Computer Science and Engineering (CSE)

Thank You…

University Institute of Engineering (UIE)

You might also like