BIS 05 1 Foundation of BI

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 16

Foundations of Business

Intelligence
Databases and Information
System

Introduction
Redundant, inconsistent and
fragmented data across organization
Challenge in querying and reporting
Presence of data across various
systems and integrating them is a
challenge

Target
Getting timely, accurate and relevant
information
Better data management techniques
with support of good data model

Terminology

Minimum measurement bit


Character is group of bits Byte
Group of characters Field
Group of fields Record
Group of records of same type File
Group of related files/records
Database
Entity and attribute

Traditional File System


Presence of multiple master files
created, maintained by separate
division
Problems
Data Redundancy
Data Inconsistency
Inflexibility
Poor security
Lack of data sharing capabilities

Terminology
Data Redundancy-data stored in
multiple places
Data Inconsistency Same attribute
has different values
Lack of Flexibility Support for adhoc
reports
Poor Security Lack of control on data
Lack of data sharing and availability
Sharing data even if available in
timely manner is challenge.

Database Management
System
Collection of data to server multiple
applications by centralizing data and
eliminating data redundancy &
incosistency
DBMS is a software that permits an
organization to centralize data,
manage them efficiently and provide
access to stored data

Relational DBMS
Stores data in 2 dimensional table
Each row represents a record/tuple
Each row can be uniquely identified
and retrieved by key field
One of the key is designated as
primary key
Relationship with other table
maintained by foreign key

Object-Oriented DBMS
Treat data as objects
Better suited for structured and
unstructured data like voice, video,
images etc
Relatively slow as compared to DBMS
Hybrid ORDBMS is a good solution

DBMS Capabilities
Data Definition capability to specify
structure of the content of database
Data Dictionary stores definition of data
elements and their characteristics
Benefits
Querying and reporting via SQL
CUD operations via DML

Normalization and referential integrity rules


ER model used for designing

Data Warehouse & Data


Mart
Data Warehouse
Database that stores current and historical
data of potential interest to decision makers

Data Mart
Subset of data warehouse in which a
summarized or highly focused portion of the
organizations data is placed in a separate
database

ETL
Extract, Transform and Load

BI Dashboard

BI Tools
Online Analytical Processing(OLAP)
Multidimensional data analysis to view
same data using multiple dimensions
Dimension may be
product,pricing,region,data etc

Data Mining
Discovery-driven, finding patterns and
relationship
Output includes association ,
sequences ,clusters ,
classifications and forecasts

Data Mining O/P


Associations linked to single event,
purchase of chips with cola
Sequences Events are linked over
time, purchase of refrigerator, oven,
tv etc
Classifications-patterns describing
group, credit card to IT employees
/sales managers
Clustering- discovering groups within
data

BI Tools
Text Mining
Analyzing unstructured data like call
center transcripts , survey responses,
email, memo etc.

Web Mining
Finding patterns from contents of web
pages which includes text, image,audio
etc

ETL

You might also like