Professional Documents
Culture Documents
Distributed Catalog Management
Distributed Catalog Management
Distributed Catalog Management
EDU.)
3RD SEMESTER
M.TECH. 2023 24
ADVANCE DATBASE MANAGEMENT
SYSTEM
Faculty: Dr. Savita Patil
Seminar Topic
DISTRIBUTED CATALOG MANAGEMENT
Presented By: Sandesh Mathpati
USN: SG22ADS014
TOPICS COVERED
• Fragmentation and Replication Intro
• What is DCM
• Why DCM is Used
• Naming Objects
• Catalog Structure
• Distributed data Independency
Fragmentation and replication
FRAGMENTATION
The process of dividing the database into smaller multiple parts
or sub−tables is called fragmentation. The smaller parts or
sub−tables are called fragments and are stored at different
locations.
REPLICATION
Data replication means a replica is made i. e. data is copied at
multiple locations to improve the availability of data. It is used
to remove inconsistency between the same data which result in a
distributed database so that users can do their task without
interrupting the work of other users.
What is distributed
catalog system?
Catalogs are referred to as database systems that contain information about
objects present in the database itself or the database itself that contains
metadata of a distributed database.
LOAD BALANCING
Q U E RY O P T I M I Z AT I O N
Distributed catalog management is essential for load
The catalog plays a role in query optimization by
balancing. It helps in distributing the workload
providing information about the structure of the
across different nodes by providing information
database and the distribution of data.
about the current state and usage of each node.
IMPORTANT FACTORS
• These two fields identify a relation uniquely; we call the combination a global relation
name.
• we take the global relation name and add a replica-id field; we call the combination a
global replica name.
CATALOG Fully Replicated Catalogs Each location site in this plan has identical copies of
the whole catalog. Questions may be answered locally under this system and
STRUCTURE reading can go more quickly. All changes must be distributed across all websites.
Let us see how users can be enabled to access relations without considering how
the relations are distributed
DATA When a user writes a program or SQL statement that refers to a relation, he or she simply uses
the relation name
INDEPENDENCE
Global relation name =
local name + user’s site-id ( birth site )
By looking up the global relation name—in the local catalog if it is cached there or in the
catalog at the birth site—the DBMS can locate replicas of the relation.