Professional Documents
Culture Documents
Bisness Intelligence
Bisness Intelligence
• BI Meaning
• Components of BI
• BI process
• BI Providers
• Functions of BI server
• BI Capabilities
• BI Users
• BI Infrastructure
• BI Tools
• Others
07/24/23 1
Business Intelligence
• IS generates enormous amounts of operational data that contain patterns,
relationships, clusters, and other information that can facilitate management,
especially planning and forecasting.
07/24/23 2
Components of Business Intelligence (BI) Systems
07/24/23 3
Three Primary Activities in the BI Process
07/24/23 4
Acquire Data: Extracted Order Data
• Query
Sales (CustomerName, Contact, Title, Bill Year, Number Orders, Units, Revenue, Source,
PartNumber)
Part (PartNumber, Shipping Weight, Vendor)
07/24/23 5
Sample Extracted Data: Part Data Table
07/24/23 6
Analyze Data
07/24/23 7
Qualifying Parts Query Design
07/24/23 8
Publish Results: Qualifying Parts Query Results
07/24/23 9
What Are the Two Functions of a BI
Server?
Management and delivery
07/24/23 10
BI Providers
07/24/23 11
BI Users
07/24/23 12
BUSINESS INTELLIGENCE USERS
07/24/23 1-13
BUSINESS INTELLIGENCE AND ANALYTICS CAPABILITIES
07/24/23 1-14
BUSINESS INTELLIGENCE AND ANALYTICS CAPABILITIES
07/24/23 1-15
BI Capabilities Example
07/24/23 16
BI Infrastructure
• an array of tools for obtaining useful information from all the
different types of data used by businesses today, including
semi-structured and unstructured big data in vast quantities.
• These capabilities include
– Data warehouse and data mart
– Hadoop,
– in-memory computing, and
– analytical platforms.
07/24/23 17
07/24/23 18
Hadoop
• For handling unstructured and semi-structured data
in vast quantities, as well as structured data, organizations are
using Hadoop.
07/24/23 19
Key services:
• Hadoop consists of several key services:
– the Hadoop Distributed File System (HDFS) for data
storage and
– MapReduce for high-performance parallel data
processing.
– HDFS links together the file systems on the numerous
nodes in a Hadoop cluster to turn them into one big file
system.
– Hadoop’s MapReduce was inspired by Google’s
MapReduce system for breaking down processing of huge
datasets and assigning work to the various nodes in a
cluster.
07/24/23 20
• HBase, Hadoop’s non-relational database, provides rapid
access to the data stored on HDFS and a
transactional platform for running high-scale real-time
applications.
• Hadoop can process large quantities of any kind of data,
including structured transactional data, loosely structured
data such as Facebook and Twitter feeds, complex data such
as Web server log files, and unstructured audio and video
data.
• Hadoop runs on a cluster of inexpensive servers, and
processors can be added or removed as needed. Companies
use Hadoop for analyzing very large
21
In-Memory Computing
• Another way of facilitating big data analysis is to use in-memory
computing, which relies primarily on a computer’s main memory (RAM) for
data storage.
(Conventional DBMS use disk storage systems.)
07/24/23 22
• Leading commercial products for in-memory computing
include SAP’s High Performance Analytics Appliance (HANA)
and Oracle Exalytics.
07/24/23 23
• Centrica, a gas and electric utility, uses HANA to quickly
capture and analyze the vast amounts of data generated by
smart meters.
07/24/23 24
Analytic platforms
• Commercial database vendors have developed specialized
high-speed analytic platforms using both relational and non-
relational technology that are optimized for analyzing large
datasets.
• These analytic platforms such as
– IBMNetezza and Oracle Exadata, feature preconfigured hardware-
software systems that are specifically designed for query processing
and analytics
07/24/23 25
Analytic platforms contd…
• For example,IBM Netezza features tightly integrated
database, server, and storage components that
handle complex analytic queries 10 to 100 times
faster than traditional systems.
07/24/23 26
BI Tools
1. Reporting Tools
• Integrate data from multiple systems
2. Data-mining Tools
• Used to discover hidden patterns and relationships
• Use sophisticated statistical techniques, regression analysis, and decision
07/24/23 27
Reporting Tools
• Reporting tools produce information from data using
five basic operations:
• Sorting
• Grouping
• Calculating
• Filtering
• Formatting
07/24/23 1-28
RFM Analysis
07/24/23 29
RFM Analysis ….
1-30
07/24/23
Interpreting RFM Score result ….
07/24/23 1-31
Online Analytical processing (OLAP)
07/24/23 1-32
Online Analytical processing (OLAP)
07/24/23 1-33
OLAP Features….
• Dynamic
• User can change report structure
• View online
• Dimension
• Characteristic of measure—purchase date, customer
type, location, sales region
07/24/23 1-34
OLAP operation
i. Drill Down
ii. Consolidation
07/24/23 1-35
OLAP Consolidation Data
07/24/23 1-36
OLAP Drill Down
07/24/23 1-37
Data Mining
07/24/23 1-38
Data Mining Tools
07/24/23 1-39
Data Mining Tools/Techniques
07/24/23 1-40
Supervised data mining …
•Examples:
..Regression analysis—measures impact of set of variables
on one another
..Used for making predictions
07/24/23 1-41
Supervised data mining… Neural Network
07/24/23 42
Unsupervised data mining….
• Analysts do not create model before running analysis. i.e.
07/24/23 1-43
Unsupervised data mining…Decision tree
07/24/23 44
Decision tree
07/24/23 1-45
Create Set of If/Then Decision Rules
07/24/23 1-46
Market Basket Analysis (MBA)
07/24/23 1-47
Hypothetical sales data of 1000 items at a Dive shop
07/24/23 1-48
Market-Basket terminologies
Support
..Probability that two items will be bought together
..Fins and masks purchased together 150 times,
thus support for fins and a mask is 150/1,000, or
15 percent
..Support for fins and weights is 60/1,000, or 6
percent
..Support for fins along with a second pair of fins is
10/1,000, or 1 percent
07/24/23 49
Market-Basket terminologies
Confidence
..What proportion of the customers who bought a mask also
bought fins?
..Conditional probability estimate
• Example:
» Probability of buying fins = 28%
» Probability of buying swim mask = 27%
• After buying fins,
» Probability of buying mask = 150/270 or 55.56%
• ..Likelihood that a customer will also buy fins almost doubles, from
28% to 55.56%.
• Thus, all sales personnel should try to sell fins to anyone buying a
mask.
07/24/23 1-50
Market-Basket terminologies…
Lift
..Ratio of confidence to base probability of buying
item
..Shows how much base probability increases or
decreases when other products are purchased
•Example:
..Lift of fins and a mask is confidence of fins given
a mask, divided by the base probability of fins.
..Lift of fins and a mask is .5556/.28 = 1.98
07/24/23 1-51