University of Petroleum & Energy Studies: 2021-22 Batch

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

UNIVERSITY OF PETROLEUM & ENERGY STUDIES

2021-22 Batch
L T P C
Data Warehousing & Multidimensional Modeling
Version 1.0 3 0 0 3

Pre-requisites/Exposure Fundamental knowledge of DBMS and data structure

Co-requisites --

Unit 1. Introduction to Data Warehousing


An introduction to data warehousing, Data warehouse architectures, Two-layer architecture, Three-layer
architecture, Data staging and ETL, Decision support applications, Data warehouse functional definitions.

Unit 2. Data Warehousing and Modeling


Data warehouse & modeling, Data warehouse modeling, Types of data warehouse models, Data
warehouse implementation, Normal system for metadata sharing, Data warehouse environment, Data
warehouse and data mart, Operational data store, Data warehouse data modeling styles, OLTP data model
considerations, Data warehouse usage, Data mart, Warehouse modeling approaches, Global data
warehouse architecture, Independent data mart architecture, Approaches to implement the architecture,
Data warehouse delivery process, Data warehouse modelling: Techniques and guidelines, ER modeling,
The ER model, Normalization: First normal form, Normalization: Second normal form, Normalization:
Third normal form, OLAP (Online Analytical Processing), Multi-dimensional data analysis, Drill-down and
roll-up.

Unit 3. Multi - Dimensional Modeling - Methodology


Requirements analysis: Base activities, Representation of the query as a cube, Requirements modeling,
Measures, Candidate measures, Dimensions, The grain of a dimension, Granularity of a measure, Facts,
Terminologies in a multi-dimension model, Advantage of multi-dimensional modelling, Multi-dimensional
queries, Dimension entities and attributes, Dimension hierarchies, Aggregation levels, MD model
structures, Star schema, Snowflake schema, Example for snowflake schema, Considerations star and
snowflake, Hints, tips, and guidelines, Business-related facts, What is fact constellation schema?
Applications for data warehouse, Fact identifiers, dimension keys and uniqueness, Identifiers in an MD
model, Dimension roles, Requirements analysis, Solution validation, Base activities, Detailed dimension
modeling, Why needed? Detailed dimension modeling approach, Detailed dimension modeling
considerations.

Unit 4. Non-Temporal Design - R-OLAP


R-OLAP, OLAP query against RDBMS (ROLAP), R-OLAP design techniques, OLAP, Fundamental OLAP
analytical practices, Types of OLAP systems, Architecture of ROLAP, Star schema and star join queries, Star
B.TECH (CSE) with Specialization in Data Science Page 125 of 138

This document is the Intellectual Property of University of Petroleum & Energy Studies and its contents
are .
UNIVERSITY OF PETROLEUM & ENERGY STUDIES
2021-22 Batch
join support, Efficient star join, Dynamic bitmap index ANDing DB2 UDB for distributed platforms, Star
join: DB2 UDB for distributed platforms, Star join: Snowflake schema context, Collapse dimensions, Effects
of dimension collapse, Dimension split, Redundant attributes in facts, Fact-oriented design techniques,
Fact aggregation, Consolidated facts, Utilize cubing services to improve R-OLAP and M-OLAP performance,
Cubing services architecture, Cubing services performance and scalability, Scalability, Cubing services
security, Benefits of cubing services: Improved data mart performance, Data marts, Data marts:
Materialized query tables, Materialized query table example, Performance with MQT refresh options,
Implementation example: MQT, Multi-dimension cluster tables, MDC table example, Terminology-
dimension, Terminology -slice, Terminology-cell, Block based index: Dynamic bitmap index ANDing, Query
processing examples, MDC performance example, Example: Object size comparisons, Power of block
prefetch coupled, Performance results for a query, Performance results from a query, Example: Index
oring of block and RID indexes, Example: Joins with block index.

Unit 5. Non-Temporal Design - MOLAP


Introduction to MOLAP, MOLAP framework, M-OLAP architecture, OLAP schemes types, Cognos
functional architecture, Cognos Transformer: Functions, Storage method, Hyperion, Hyperion data load,
Hyperion: data load rules, Hyperion calculation storage types, MOLAP characteristics, ROLAP
characteristics, Online data analysis.

Unit 6. Temporal Modeling and Design


Keeping history about dimensions, Customer attrition: Business requirements, Customer attrition:
Exercise, Customer attrition: Base analysis, Customer attrition: Solution, Primary key change, The
historical customer, Customer address: Business requirement, Customer address: Exercise, Customer
address: Solution, Issues, Time normalization, Customer address: Time normalized, Neighborhoods:
Business requirements, Neighborhoods: Exercise, Neighborhoods: Base solution, Neighborhoods: Review,
Neighborhoods: HTK solution, Customer classes: Business requirement, Customer classes: Exercise,
Customer classes: Proposed solution.

Unit 7. Process Architecture: Data Warehouse


Process architecture: Data warehouse, Contrasting OLTP and data warehousing environments, Common
data warehouse tasks, Architectures of the data warehouse, Centralized process architecture, Distributed
process architecture, Peer-to-Peer architecture, Data warehouse tools, 25 BEST data mining tools in 2020,
18 BEST reporting tools in 2020, IBM data warehouse case study: American Airlines.

B.TECH (CSE) with Specialization in Data Science Page 126 of 138

This document is the Intellectual Property of University of Petroleum & Energy Studies and its contents
are .

You might also like