Professional Documents
Culture Documents
Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
1
How to leverage all the data that
organizations collect and store?
Answer
• Data warehousing
• Data acquisition (access)
• Data mining
• Online analytical processing (OLAP)
or Business Analytics
• Data Visualization
2
Data, Information, Knowledge
• Data
– Items that are the most elementary descriptions
of things, events, activities, and transactions
– May be internal or external
• Information
– Organized data that convey meaning and value
• Knowledge
– Processed data or information that conveys
understanding, experience, accumulated
learning and expertise applicable to a problem
or activity
3
What kinds of Data Issues
organizations deal with?
• Multiple sources
• Wide time frame
• Data reduction (aggregation)
• Various levels of detail
• Various amounts of data
• Varying degrees of accuracy
• Provide random access to database
• Security and private databases
• End user interface (i.e., ability to interface two or
more databases at a time)
4
5
Preparing data for Warehousing
• Cleanse data
– When populating warehouse
– Data quality action plan
– Best practices for data quality
– Measure results
• Data integrity issues
– Uniformity
– Version
– Completeness check
– Conformity check
– Genealogy or drill-down
6
Preparing data for Warehousing
• Data Integration
• Access needed to multiple sources
– Often enterprise-wide
– Disparate and heterogeneous databases
– XML becoming language standard
• Web (external data source)
– Intelligent agents
– Document management systems
– Content management systems
• External Commercial databases
– Sell access to specialized databases
7
Database Models
• Hierarchical
– Top down, like inverted tree
– Fields have only one “parent”, each “parent” can have multiple
“children”
– Fast
• Network
– Relationships created through linked lists, using pointers
– “Children” can have multiple “parents”
– Greater flexibility, substantial overhead
• Relational
– Flat, two-dimensional tables with multiple access queries
– Examines relations between multiple tables
– Flexible, quick, and extendable with data independence
• Object oriented
– Data analyzed at conceptual level
– Inheritance, abstraction, encapsulation
8
Database Models
• Multimedia Based
– Multiple data formats
• JPEG, GIF, bitmap, PNG, sound, video, virtual reality
– Requires specific hardware for full feature
availability
• Document Based
– Document storage and management
• Intelligent
– Intelligent agents and ANN
• Inference engines
9
What is Data Warehousing?
10
11
Business Intelligence and
Analytics
• Business intelligence
– Acquisition of data and information for
use in decision-making activities
• Business analytics
– Models and solution methods
• Data mining
– Applying models and methods to data to
identify patterns and trends
12
What is Multidimensionality?
13
What is OLAP?
• A database-oriented DSS which uses data warehouse and a
set of tools usually with multidimensional capabilities to aid
in reporting, querying and data analysis.
• Activities performed by end users in OLAP systems
– Specific, open-ended query generation
• SQL
– Requesting Ad hoc reports
– Conducting Statistical and other (e.g. data mining) analyses
– Building DSS applications
• Modeling and visualization capabilities
• OLAP tools fall into four product groups:
– Multidimensional spreadsheets
– Multidimensional query & report writing tools for standard
RDMS (e.g., Business Objects)
– Fully multidimensional DBMS
– Visual information access systems
14
What is Data Mining?
16
What is an Intelligent Database?
18
Analytic systems
20
21
Web Analytics/Intelligence
• Web analytics
– Application of business analytics to Web
sites
• Web intelligence
– Application of business intelligence
techniques to Web sites
22