Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 23

Distributed Database

Management Systems
Evolution of DDBMS
Decentralized database management systems (DDBMS)
Interconnected computer systems
Data/processing functions reside on multiple sites
1970s: Centralized DBMS
1980s: Social and Technical Changes
Ad hoc capability required
Decentralized management structure common
1990s: New forces
Internet and the World Wide Web used for data access and
distribution
Data analysis through data mining and data warehousing
2
DDBMS Advantages
Data located near site with greatest demand
Faster data access
Faster data processing
Growth facilitation
Improved communications
Reduced operating costs
User-friendly interface
Less danger of single-point failure
Processor independence

3
DDBMS Disadvantages
Complexity of management and control
Security
Lack of standards
Increased storage requirements
Greater difficulty in managing data environment
Increased training costs

4
Distributed Processing
Shares databases logical processing among
physically, networked independent sites

5
Distributed Database
Stores logically related database over physically
independent sites

6
Distributed Database
vs. Distributed Processing
Distributed processing
Does not require distributed database
May be based on a single database on single
computer
Copies or parts of database processing functions
must be distributed to all data storage sites
Distributed database
Requires distributed processing
Both
Require a network to connect components
7
Functions of DDBMS
Application/end user interface
Validation to analyze data requests
Transformation to determine request components
Query optimization to find the best access
strategy
Mapping to determine the data location
I/O interface to read or write data
Formatting to prepare the data for presentation
Security to provide data privacy
Backup and recovery
DB Administration
Concurrency Control
Transaction Management
8
Fully Distributed Database
Management System

9
DDBMS Components
Computer workstations
Network hardware and software components
Communications media
Transaction processor (TP)
Also called application manager (AP) or
transaction manager (TM)
Data processor (DP)
Also called data manager (DM)

10
Distributed Database Components

Figure 10.5
11
DDBMS Protocols
Interface with network to transport data and
commands between DPs and TPs
Synchronize data received from DPs and route to
appropriate TPs
Ensure common database functions
Security
Concurrency control
Backup and recovery

12
Levels of Data and Process
Distribution
Database systems can be classified based on
process distribution and data distribution

Table 10.1

13
Single-Site Processing, Single-Site
Data (SPSD)

All processing on single CPU or host computer


All data are stored on host computer disk
DBMS located on the host computer
DBMS accessed by dumb terminals
Typical of mainframe and minicomputer DBMSs
Typical of 1st generation of single-user
microcomputer database

14
Multiple-Site Processing, Single-Site
Data (MPSD)
Requires network file server
Applications accessed through LAN
Variation known as client/server architecture

15
Multiple-Site Processing,
Multiple-Site Data (MPMD)
Fully distributed DDBMS with support for multiple
DPs and TPs at multiple sites
Homogeneous I
Integrate one type of centralized DBMS over the
network
Heterogeneous
Integrate different types of centralized DBMSs over a
network

16
Distributed DB Transparency
Allows end users to feel like only database user
Hides complexities of distributed database
Transparency features
Distribution
Transaction
Failure
Performance
Heterogeneity

17
Distributed Concurrency Control
Multisite, multiple-process operations more likely
to create data inconsistencies and deadlocked
transactions
Problems
Transaction committed by local DP
One DP could not commit transactions result
Yields inconsistent database

18
Two-Phase Commit Protocol
DO-UNDO-REDO protocol
Write-ahead protocol
Two kinds of nodes
Coordinator
Subordinates
Phases
Preparation
Coordinator sends message to all subordinates
Confirms all are ready to commit or abort
Final Commit
Ensures all subordinates have committed or aborted
19
Performance Transparency
and Query Optimization
Objective: Minimize total cost associated with
execution of request
Main costs
Access time
Communication
CPU time
Basis for query optimization algorithms
Optimum execution order
Sites accessed to minimize communication costs
Dynamic or static optimization
Statistically based vs. rule-based query
optimization algorithms
20
Distributed Database Design
Partition database into fragments
Horizontal
Vertical
Mixed
Fragments to replicate
Storage of data copies at multiple sites
Fully, partially, unreplicated databases
Data allocation
Where to locate data
Centralized, partitioned, replicated
21
Client/Server Advantages Over DDBMS
Client/server less expensive
Client/server solutions allow use of
microcomputers GUI
More people with PC skills than mainframe skills
PC is well established in workplace
Numerous data analysis and query tools exist
Considerable cost advantages to off-loading
application development

22
Client/Server Disadvantages
Creates more complex environment with different
platforms
Increased number of users and sites creates
security problems
Training issues become more complex and
expensive

23

You might also like