Unit 4_Concept of Distributed DBMS
Unit 4_Concept of Distributed DBMS
Introduction
What is a distributed DBMS
Problems
Current state-of-affairs
Background
Distributed DBMS Architecture
Distributed Database Design
Semantic Data Control
Distributed Query Processing
Distributed Transaction Management
Parallel Database Systems
Distributed Object DBMS
Database Interoperability
Current Issues
Distributed DBMS Page 1.1
File Systems
program 1
File 1
data description 1
program 2
data description 2 File 2
program 3
data description 3 File 3
description
Application
program 2 manipulation
(with data database
semantics) control
Application
program 3
(with data
semantics)
Database Computer
Technology Networks
integration distribution
Distributed
Database
Systems
integration
integration ≠ centralization
Distributed DBMS Page 1.4
Distributed Computing
Synonymous terms
distributed function
distributed data processing
multiprocessors/multicomputers
satellite processing
backend processing
dedicated/special purpose computers
timeshared systems
functionally modular systems
Processing logic
Functions
Data
Control
Site 1
Site 2
Site 5
Communication
Network
Site 4 Site 3
Site 1
Site 2
Site 5
Communication
Network
Site 4 Site 3
P1 Pn M
D
P1 Pn
D
M1 Mn
P1 Pn
D1 Dn
M1 Mn
Improved performance
Replication transparency
Fragmentation transparency
horizontal fragmentation: selection
vertical fragmentation: projection
hybrid
PROJ PAY
PNO PNAME BUDGET TITLE SAL
P1 Instrumentation 150000 Elect. Eng. 40000
P2 Database Develop. 135000 Syst. Anal. 34000
P3 CAD/CAM 250000 Mech. Eng. 27000
P4 Maintenance 310000 Programmer 24000
SELECT ENAME,SAL
Tokyo
FROM EMP,ASG,PAY
WHERE DUR > 12
Boston Paris
AND EMP.ENO = ASG.ENO
AND PAY.TITLE = EMP.TITLE Paris projects
Paris employees
Communication Paris assignments
Network Boston employees
Boston projects
Boston employees
Boston assignments
Montreal
New
Montreal projects
York
Paris projects
Boston projects New York projects
New York employees with budget > 200000
New York projects Montreal employees
New York assignments Montreal assignments
Distributed DBMS Page 1.20
Distributed Database - User View
Distributed Database
DBMS Communication
Software Subsystem
User
DBMS User Application
Software Query DBMS
Software
User
Query
Distributed DBMS Page 1.22
Potentially Improved Performance
Parallelism in execution
Inter-query parallelism
Intra-query parallelism
Query Processing
convert user transactions to data manipulation
instructions
optimization problem
min{cost = data transmission + local processing}
general formulation is NP-hard
Concurrency Control
synchronization of concurrent accesses
consistency and isolation of transactions' effects
deadlock management
Reliability
how to make the system resilient to failures
atomicity and durability
Query Distribution
Reliability
Processing Design
Concurrency
Control
Deadlock
Management
Distributed DBMS Page 1.28
Related Issues