Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 11

External schema-

logical data independence

physical data independence

1
THE HIERARCHY OF DATA – bits, characters, fields, records, files
Character – basic building block of information consisting of letters,
Digits and special symbols
Field – typically a name, number or combination of characters that
Describes an aspect of business object or activity
Record – a collection of related data fields
File – a collection of related records
Entity – generalized class of people, places or things for which data is
Collected, stored and maintained
Attribute – characteristic feature of the entity
Data item – specific value of the attribute
Key – a field or set of fields in a records used to identify the record
Primary key - a field or set of fields in a records used to
uniquely identify the record

2
TRADITIONAL APPROACH TO DATA MANAGEMENT
Data redundancy – repetitive data
Data integrity – uniform modifications affected
Program data dependence – incompatibility between diff. applns

DATABASE APPROACH TO DATA MANAGEMENT


DBMS – a group of programs that manipulate the database and provide
an interface between the database, user and other application programs

3
ADVANTAGES OF DB APPROACH
• Accurate data available at fingertips
• Reduced data redundancy
• Better data integrity
• Easy modification
• Data and program independence
• Easy access to data and information
• Standardization of data access
• Modular framework for application development
• Better overall protection of data
• Shared and centralized data and information

DISADVANTAGES OF DB APPROACH
• High cost of purchase and operation in mainframes
• Specialized staff required, minimal training insufficient
• Data is more vulnerable to trespassers
4
DATA MODELLING
Hierarchical – data organized in a top-down or
Inverted tree structure
Network – expansion of Hierarchical to express owner-member
Relationship, one member can have multiple owners
Relational – data described so that all data elements are placed in
2 dimensional tables, called relations (files)
Domain – allowable values for data items
Data normalization – process of converting a complex set of data into
A set of simple 2 dimensional tables
Selection – eliminating rows
Projection – eliminating columns
Joining – combines 2 or more tables
Linking – combines 2 or more tables using common attributes to form
a 3rd table with only unique data attributes
Data cleanup – getting rid of inconsistencies to have accurate data
5
DBMS
Schema – description of the entire database
Sub-schema – subset of schema
DDL – a collection of instructions and commands used to define and
describe data and data relationships in a specific database
DML – commands used to manipulate data in the DB
Data dictionary – detailed description of data used in the database

USES OF DATA DICTIONARY


•Provides standard definitions for data description
•Assists programmers to define data elements
•Simplifies DB modifications
•Reduced unnecessary data
•More reliability
•Faster application development

6
THE DATABASE ADMINISTRATOR (DBA)
•Highly skilled and trained systems professional who directs or performs
all activities related to maintaining a successful database environment
• designs, implements, maintains the DB system
•Establishes policies and procedures pertaining to mgmt, security,
Maintenance and use of DBMS
•Imparts training to DBMS users

SELECT A DBMS BASED ON


•Cost
•Vendor
•Database size
•Number of concurrent users
•Performance
•Integration
Ex: Oracle, Ingress, Sybase, MS-Access, Paradox, Lotus
7
Application server – Software package most often written in Java for
use on computers with Windows NT OS, connects end users to the
databases and monitors system performance
Data warehouse – it is a database that collects business information from
many sources in the enterprise about the processes, products, customers
and vendors

ALL DATABASES

DATA EXTRACTION

DATA CLEANUP

QUERIES,
DATA WAREHOUSE ANALYSIS TOOLS END USER
8
Characteristics of a data warehouse :
•Organization
•Consistency
•Time variant
•Nonvolatile
•Relational
•Client-server
•Web-based

Data Mart – It is a subset of a data warehouse – section wise,


Department wise . Eg. Finance, HR, inventory, production, vendor
Development – Replicated/Stand alone

9
Data Mining is an information analysis tool that involves the automated
discovery of patterns and relationships in a data warehouse
•Extracts patterns, trends and rules from DW to evaluate business
strategies for profits and enhanced business performance
Predictive analysis
Business Intelligence – The process of collecting the right information
at the right place and at the right time and analyzing it to have a positive
Impact on the business
Competitive Intelligence - A continuous process that involves legal
and ethical collection of information and analysis for the right decision
Makers,Counterintelligence - Action taken by an organization to protect
Information gathered by ‘hostile’ info collectors
Knowledge management – Process of capturing a company’s collective
Expertise to make a profit (computers, on paper, in human minds, etc)
Knowledge management is a concept in which an enterprise consciously
and comprehensively gathers, organizes, shares, and analyzes its
knowledge in terms of resources, documents, and people skills. 10
Distributed database – A DB in which data can be spread across several
smaller databases connected via telecommunication systems
Replicated DB
OLAP – Online analytical processing – graphs, sales performance
ODBC standards for data to have compatibility across several databases
OORDBMS – ODBC and additional data types like audio can be created,
Hypermedia, Spatial Data Technology

11

You might also like