0% found this document useful (0 votes)
12 views12 pages

Autonomy in Distributed Database Systems

The document discusses distributed databases and distributed database management systems. It defines key concepts like distribution, autonomy, and heterogeneity. It also describes local and global transaction steps in distributed DBMS architectures. Advantages include organizational structure, availability, reliability and performance. Disadvantages include increased complexity, cost, security and integrity challenges.

Uploaded by

Shazia wasim
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views12 pages

Autonomy in Distributed Database Systems

The document discusses distributed databases and distributed database management systems. It defines key concepts like distribution, autonomy, and heterogeneity. It also describes local and global transaction steps in distributed DBMS architectures. Advantages include organizational structure, availability, reliability and performance. Disadvantages include increased complexity, cost, security and integrity challenges.

Uploaded by

Shazia wasim
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd

Distributed DB & DBMS

Concepts

Distributed Database
A logically interrelated collection of shared data, physically distributed
over a computer network.

Distributed DBMS
Software system that permits the management of the distributed
database and makes the distribution transparent to users.
Main Characteristics

• Data at multiple sites


• DM at each site.
• Local requirements
• Global perspective
Distributed Processing
Concepts
• Distribution
– Refers to the distribution of data. i.e. physical distribution of data over
multiple sites that looks as one logical pool to the user.
– Client/Server distribution
– Peer-to-peer system: Each machine has full DBMS functionality
• Autonomy
– Autonomy is the distribution of control, not data.
– It is the level to which individual DBMSs can operate independently.
– The local operations of the individual DBMSs are not affected by their
participation in the multidatabase system.
– The queries process by individual DBMSs should not be affected by the
execution of global queries
Concepts
• Heterogeneity
– Heterogeneity may occur in various form in distributed system, ranging from
hardware heterogeneity and differences in networking protocols, data model,
query language and transaction management protocols

• Homogeneous DDBMS
– All sites use same DBMS product.
– Much easier to design and manage.
Distributed DBMS Architecture
showing Local Transaction Steps

3 5

Local transaction – all data stored locally


Local Transaction Steps

• Application makes request to distributed DBMS


• Distributed DBMS checks distributed data repository for
location of data. Finds that it is local
• Distributed DBMS sends request to local DBMS
• Local DBMS processes request
• Local DBMS sends results to application
Distributed DBMS architecture
showing global transaction steps

2
3
1
7 6
8
4

Global transaction – some data is at remote site(s)


Global Transaction Steps

• Application makes request to distributed DBMS


• Distributed DBMS checks distributed data repository for location of data. Finds
that it is remote
• Distributed DBMS routes request to remote site
• Distributed DBMS at remote site translates request for its local DBMS if
necessary, and sends request to local DBMS
• Local DBMS at remote site processes request
• Local DBMS at remote site sends results to distributed DBMS at remote site
• Remote distributed DBMS sends results back to originating site
• Distributed DBMS at originating site sends results to application
Advantages of DDBMS

• Organizational Structure : Organizations naturally distributed


• local autonomy :Data can be placed at the site close to the user
• Improved availability : Continue function despite failure
• Improved Reliability : Data replication
• Improved Performance: Data near the site (access speed factor)
• Economics :Data transmission cost (centralized vs Distributed)
• Modular Growth :Adding new sites is easy without effecting the operation of
other site
Disadvantages of DDBMS

• Complexity : Hides distributed nature from user


• Cost :Maintenance cost increase with complexity
• Security: Multiple locations
• Integrity: Communication and processing costs
• Lack of Standards: No tools to convert centralized DBMS to distributed DBMS
• Lack of Experience: Not much general purpose DDBMS
• Database Design more complex : Replication

Common questions

Powered by AI

The costs of deploying and maintaining a distributed DBMS can exceed those of a centralized system due to the complexity and need for additional resources to manage multiple locations. Each site typically requires separate hardware, software, and network infrastructure. Furthermore, ensuring data consistency, integrity, and security across distributed sites can incur higher development and operational overheads. The lack of standardization tools for converting centralized to distributed systems further elevates costs .

A homogeneous distributed database management system offers the advantage of using the same DBMS product across all sites, which greatly simplifies design and management processes. As all sites operate using the same software, compatibility issues due to differences in data models, query languages, or transaction management protocols are reduced. This uniformity reduces the complexity often associated with data distribution and query processing in heterogeneous environments, enhancing the ease of implementation and maintenance .

Distributed systems experience heterogeneity in forms such as differences in hardware, networking protocols, data models, query languages, and transaction management protocols. These variations necessitate additional layers to mediate interactions, increasing complexity in design and implementation. The heterogeneity complicates the integration and coordination of various components, requiring specialized solutions to ensure interoperability and consistent performance in a distributed DBMS environment .

In a distributed DBMS, local transactions are processed entirely at the site where the data is stored. The system checks if the data is local, processes the request via the local DBMS, and sends the results to the application. Global transactions, however, involve data at remote sites. The distributed DBMS must route requests to these remote sites, process them there, and then send results back to the original site. This additional communication and coordination in global transactions can introduce latency and overhead, potentially impacting performance compared to local transactions, which are more straightforward and quicker .

A distributed database system improves availability and reliability by distributing data across multiple sites, allowing the system to continue functioning even if one or several sites fail. Data replication ensures that each piece of data is stored at multiple locations, so if a site goes offline, the system can access data from another site, maintaining service continuity. This capability contrasts with centralized systems, where a single point of failure can halt operations .

Autonomy in a distributed database system allows each DBMS to operate independently, which enables local transactions to be processed without being affected by global queries. This independence ensures that the local operations can continue efficiently even when there are broader system-wide queries. Consequently, this separation minimizes the complexity and interdependence, aiding effective global query processing as the distributed DBMS can coordinate among various DBMSs for global data requests without interfering with local processes .

Modular growth in distributed database systems allows companies to scale their database infrastructure more flexibly and cost-effectively. Businesses can add new sites as their data or transactional needs increase without interrupting the current system, thus accommodating business expansion seamlessly. This contrast with traditional centralized databases, where scaling often involves substantial reconfiguration and can disrupt existing operations. The modular nature also supports local autonomy, ensuring that expansions are aligned with geographic or departmental needs .

A DDBMS has several main characteristics: multiple sites storing data, each site having data management capabilities, accommodating local requirements, and offering a global perspective. This supports organizational needs by aligning the database structure with the natural decentralized structure of organizations. It allows for local autonomy where data can be managed at sites close to users, thus improving reliability and performance by reducing latency and enhancing data availability through replication. Additionally, the modular growth feature supports scalability as new sites can be added without disrupting existing operations .

Security in distributed DBMS is a significant concern due to the multiple locations involved, introducing vulnerability to unauthorized access and data breaches at numerous entry points. Unlike centralized systems where data security can be managed at a single point, in distributed systems, securing data requires comprehensive strategies across all sites. This dispersal increases the complexity of implementing consistent security policies and controls, and managing authentication and authorization across a distributed network can be challenging .

The absence of standardized tools for converting centralized DBMS to distributed DBMS presents significant challenges due to the heterogeneity and complexity involved in distributed environments. Without standardization, each transition needs custom solutions, which requires additional time, expertise, and resources. This lack of standards implies that consistency across distributed sites can be difficult to achieve, increasing the potential for errors or discrepancies in data processing and management, thus complicating the transition process .

You might also like