Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 12

Presented By :-

Jitendra Kumar Sahu


Guided By- M.Sc.(c.s.) IV Sem.
Dr. S.Pavani C.M.D.P.G.
(HOD) College Bilaspur (C.G.)
 A data warehouse is an appliance for storing and
analyzing data, and reporting.
 Central database that includes information from
several different sources.
 Keep current as well as historical data.

“Data Warehouse is a subject oriented, integrated,


time-variant and non-volatile collection of data in
support of management’s decision making process.”
Subject
Oriented

Non- Data
Warehouse Integrated
volatile

Time
variant
 Organized around major subjects, such as
customer, product, sales.
 Focusing on the modeling and analysis of
data for decision makers, not on daily
operations or transaction processing.
 Provide a simple and concise view around
particular subject issues by excluding data
that are not useful in the decision support
process.
 Constructed by integrating multiple, heterogeneous data
sources
- relational databases, flat files, on-line transaction
records
 Data cleaning and data integration techniques are
applied.
-Ensure consistency in naming conventions,
encoding structure, attribute measures, etc. among
different data sources
 E.g., Hotel price: currency, tax, breakfast covered, etc.
-When data is moved to the warehouse, it is converted.
 The time horizon for the data warehouse is
significantly longer then that of operational
systems.
- Operational database: current value data
-Data warehouse data: provide information from a
historical perspective(e.g., past 5-10 year)
 Every key structure in the data warehouse
-Contains an element of time, explicitly or
implicitly
-But the key of operational data may or may not
contain “time element”
 A physically separate store of data transformed from
the operational environment.
 Operational update of data dose not occur in the
data warehouse environment
- Dose not require transaction processing, recovery,
and concurrency control mechanisms
- Requires only two operations in data accessing:
 initial loading of data and access of data
Operational
Data
Detail Summary
User
Meta
Data

External
Data
Data Warehouse
Manager

Archive/Backup Information
Data
Industry Application
Finance Credit card Analysis
Insurance Claims, Fraud Analysis
Telecommunication Call record Analysis
Transport Logistics management
Consumer goods Promotion Analysis
 Enhances end-access to a wide variety of
data.
 Increases data consistency.
 Increases productivity and decreases
computing costs.
 Is able to combine data from different
sources, in one place.
 It provide an infrastructure that could
support changes to data and replication of
the changed data back into operational
systems.
 Extracting, cleaning and loading data could
be time consuming.
 problem with compatibility with systems
already in place e.g. transaction processing
system.
 Providing training to end- users, who end
up not using the data warehouse.
 security could developed into a serious
issue, especially if the if the data warehouse
is web accessible.

You might also like