0% found this document useful (0 votes)

11 views

DDBMS Design

Distributed Database design is discussed here.

Uploaded by

debjit7864

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

DDBMS Design

Distributed Database design is discussed here.

Uploaded by

debjit7864

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 5

DDBMS Design Principle

Design of centrally located Database (CD) involves following:

 Design the conceptual (all the data which are used by the database applications)
schema
 Design the physical database (mapping the conceptual schema to storage areas
and determining appropriate access methods)

Design of distributed database (DD):

Two abovementioned steps are done by

 Designing of global schema
 Design of the local physical databases at each site
Additional steps for distribution of database
 Design fragmentation (logical concept)
 Design allocation (physical concept) of fragments including replication decision

Points to remember
 Although the design of application programs is made after schema design, the
knowledge of application requirements influences schema design, since schemata
must be able to support applications efficiently.

 The site from which the application is issued is called site of origin of the
application.
Objectives of data distribution design

 Processing locality – Place data as close as possible to the applications which use
them.
 Availability and reliability – A high degree of availability for read-only
applications is achieved by storing multiple copies of the same information.
Reliability is also achieved by storing multiple copies of the same information,
since it is possible to recover from crashes of one of the copies by using the other.

 Workload distribution is done for the following:

o Taking advantage of specialized powers/utility of computers at each

site
o Maximizing the degree of parallelism
As workload distribution and processing locality are two conflicting requirements,
there must be a trade-off.
 Storage cost and availability—Database distribution should reflect the cost and
availability of storage at the different sites. The cost of data storage is not relevant
w.r.t. the CPU, I/O and transmission cost of applications but the limitation of
available storage at each site must be considered.
Satisfying all these conditions leads to complex optimization models.
Designer’s options:

 Option 1 -- Consider some of the above features as constraints rather than

objectives.
 Option 2 -- Consider the most important criterion in the initial design and to
introduce other criteria in the post-optimization.

Design of data distribution: Top-Down vs. Bottom-up approaches

Top-down design:
 Design the global schema
 Fragment the database
 Allocate the fragments to the sites
 Create the physical images

This approach is suitable for systems which are developed from scratch since it allows
performing the design rationally.

When the distributed database is developed as the aggregation of the existing databases,
bottom-up approach is normally followed.

Bottom-up design:
 Select a common database model for describing the global schema of the database
 Translate each local schema into the common data model
 Integrate all the local schema into a common global schema

By integration it means the merging of common data definition and the resolution of
conflicts among different representations given to the same data.

Example:
Consider a distributed database for a company in West Bengal having 3 sites at North
Bengal (site 1), Kolkata (site 2) and South Bengal (site 3). Kolkata is located about
halfway between NB and SB. There are 30 depts physically grouped as follows: the first
10 are close to NB, depts. between 11 and 20 are close to Kolkata, and depts. over 20 are
close to SB.

Suppliers of the company are all either in the city of NB or in the city of SB. Moreover
NB is in area ‘North’ and SB is in ‘South’. Kolkata falls on the border with some depts.
in North and some are in South.

Let us design the fragmentation of the following two tables:

SUPPLIER (snum,name,city)
DEPT (deptnum,name,area,mgrnum)

Consider the following application:

 Retrieve name of suppliers with a given number snum. The pseudocode is as
follows:
select name
from SUPPLIER
where snum= $X;

It is more likely that it references the supplier whose city

Query issued at site 1, city = NB
Query issued at site 2, city = NB or SB
Query issued at site 3, city = SB

Possible predicates in this application domain:

P1: city = “NB”
P2: city = “SB”
Consider the following applications:

 Administrative information about departments in area = North are issued at site 1

and in area=South are issued at site 3.
 Regular information about work conducted at each department may be issued at
any department.
Possible predicates in this application domain:
P1: deptnum  10
P2: 10 < deptnum  20
P3: deptnum > 20
P4: area = “North”
P5: area = “South”
There are a number of combinations between the elements of the two sets {P1,P2,P3}
and {P4,P5}which are not valid :
Example: P3 AND P4 is invalid; only four combinations are valid as shown below:

Y1: deptnum  10
Y2: (10 < deptnum  20) AND (area = “North”)
Y3: (10 < deptnum  20) AND (area = “South”)
Y4: deptnum > 20

P4: area = “North” P5: area = “South”

P1 Y1 False
P2 Y2 Y3
P3 False Y4

Table: Fragmentation of relation DEPT

Allocation of fragments:
 Fragments corresponding to Y1 and Y4 can easily be allocated at sites 1 and 3.
 The allocation of fragments Y2 and Y3 needs a trade-off between two conflicting
requirements as follows:
o Administrative applications which would like fragments to be allocated at
site 1 and 3 respectively.
o Regular application would like fragments to be allocated at site 2.

NB: In this example fragments Y2 and Y3 are appropriate units for the allocation
Problem.

A distributed join is a join between horizontally fragmented relations. R X S means all

the tuples of R and S need to be compared; that in turn means to compare all the
fragments Ri with all the fragments Sj. For some applications, if it is found that some of
partial joins Ri JN Sj are intrinsically empty (values of the join attribute in R i and Sj are
disjoint.

R1
R1 S1
S1 R2
R2
S2
R3 R3
S3 S2
R4 R4
S3
R5
(a) Join graph (b) Partitioned join graph (c) Simple join graph

A distributed join can be represented by join graphs. It is defined as a graph (N,E) where
nodes N represent fragments of R and S and non-directed edges represent joins between
fragments which are not intrinsically empty.

 Total join graph–Graph contains all possible edges between fragments of R and S

 Reduced join graph – Some of the edges between fragments of R and fragments
of S are missing

o Partitioned – graph is composed of two or more subgraphs without edges

between them (Fig. b)
o Simple – it is partitioned and each subgraph has just one edge (Fig. c)

Allocation of fragments
 Decide whether we would go for non-redundant or redundant allocation.
 Non-redundant – The best-fit approach
o A measure is associated with each possible allocation
o The site with the highest measure is selected

 Redundant – All beneficial site approach

o Determine the set of all sites where the benefit of allocating one copy of
the fragment is higher than the update cost
o Allocate a copy of the fragment to each element of this set

 Redundant – Progressive introduction of replication approach

o Determine the solution of the non-replicated problem
o Progressively introduce replication starting from the most beneficial site
o Terminate replication when no additional replication is beneficial

 Both the approaches have some disadvantages

o In the all beneficial site approach quantifying

Vrrenewal PDF
0% (1)
Vrrenewal PDF
1 page
RD314 Service Manual
No ratings yet
RD314 Service Manual
108 pages
BPM Cbok 4.0 PDF - pdf-201-250
100% (1)
BPM Cbok 4.0 PDF - pdf-201-250
50 pages
Asus Laptop
No ratings yet
Asus Laptop
2 pages
Ddbms Architecture Modified
No ratings yet
Ddbms Architecture Modified
13 pages
DDBMS Architecture
No ratings yet
DDBMS Architecture
7 pages
Distributed Databases: CS347 May 30, 2001
No ratings yet
Distributed Databases: CS347 May 30, 2001
48 pages
Distributed Database Design
No ratings yet
Distributed Database Design
15 pages
Distributed Database Design
No ratings yet
Distributed Database Design
52 pages
Distributed Database: Source
No ratings yet
Distributed Database: Source
19 pages
Network Analysis: Sharique Najam Muzaffar
100% (1)
Network Analysis: Sharique Najam Muzaffar
32 pages
Ee4001 - Quiz-2008
No ratings yet
Ee4001 - Quiz-2008
5 pages
4th Assignment
No ratings yet
4th Assignment
8 pages
Lec - 4 Final
No ratings yet
Lec - 4 Final
52 pages
ECE3073 P8 Compilation Answers PDF
No ratings yet
ECE3073 P8 Compilation Answers PDF
7 pages
Ans Assi1
No ratings yet
Ans Assi1
8 pages
M0302 Computer Science - E
No ratings yet
M0302 Computer Science - E
14 pages
Ch10 Axiomatic
No ratings yet
Ch10 Axiomatic
81 pages
os QB
No ratings yet
os QB
5 pages
Distributed Database Management Systems: Week-4
No ratings yet
Distributed Database Management Systems: Week-4
24 pages
Distributed Database Design
No ratings yet
Distributed Database Design
51 pages
Lecture22 dr2
No ratings yet
Lecture22 dr2
37 pages
Lecture 4db
No ratings yet
Lecture 4db
14 pages
Advanced Data Types and New Applications: Solutions To Practice Exercises
No ratings yet
Advanced Data Types and New Applications: Solutions To Practice Exercises
4 pages
Descriptive Statistics: Tabular and Graphical Methods: Summarizing Qualitative Data Summarizing Quantitative Data
No ratings yet
Descriptive Statistics: Tabular and Graphical Methods: Summarizing Qualitative Data Summarizing Quantitative Data
32 pages
Unit-Iv Syllabus What Is Greedy Approach?: Greedy: Interval Scheduling, Minimum Cost
No ratings yet
Unit-Iv Syllabus What Is Greedy Approach?: Greedy: Interval Scheduling, Minimum Cost
15 pages
Distributed Database Design Concept
No ratings yet
Distributed Database Design Concept
5 pages
Dynamic Programming
0% (1)
Dynamic Programming
51 pages
MASTAN2 v3
100% (1)
MASTAN2 v3
4 pages
Gmail - ? System Design Interview Questions For You!
No ratings yet
Gmail - ? System Design Interview Questions For You!
5 pages
CS239 Operating System (End - SP23)
No ratings yet
CS239 Operating System (End - SP23)
2 pages
Data Structure 4
No ratings yet
Data Structure 4
7 pages
Chapter 5 Distributed Database Design
No ratings yet
Chapter 5 Distributed Database Design
12 pages
JUNE 2023P2-Solution
No ratings yet
JUNE 2023P2-Solution
6 pages
Data Structure4
No ratings yet
Data Structure4
6 pages
LECTURE 11
No ratings yet
LECTURE 11
103 pages
Assignment 4: Data Structure
No ratings yet
Assignment 4: Data Structure
9 pages
Abstract: in This Paper Indoor Localization Problem Is Addressed From Theoretical and
No ratings yet
Abstract: in This Paper Indoor Localization Problem Is Addressed From Theoretical and
4 pages
Data Structure Assignment - 4
100% (1)
Data Structure Assignment - 4
9 pages
8.CS G553-Compre Answer Key-I Sem 2018-2019
No ratings yet
8.CS G553-Compre Answer Key-I Sem 2018-2019
7 pages
O.S Assignment
No ratings yet
O.S Assignment
7 pages
Unit 2
No ratings yet
Unit 2
73 pages
Disjunktni Skupovi
No ratings yet
Disjunktni Skupovi
44 pages
RD TH
No ratings yet
RD TH
4 pages
Unit-V: Database Management System
No ratings yet
Unit-V: Database Management System
5 pages
COSC 3101A - Design and Analysis of Algorithms 7
No ratings yet
COSC 3101A - Design and Analysis of Algorithms 7
50 pages
344 Answers 2 ND Mid
No ratings yet
344 Answers 2 ND Mid
4 pages
Ad Server Diagram
No ratings yet
Ad Server Diagram
54 pages
Maximum Mark: 90: University of Cambridge International Examinations General Certificate of Education Advanced Level
No ratings yet
Maximum Mark: 90: University of Cambridge International Examinations General Certificate of Education Advanced Level
6 pages
Distributed Database Chapter 3 Modified
No ratings yet
Distributed Database Chapter 3 Modified
40 pages
Hint and Sample Question For CW1 Q1
No ratings yet
Hint and Sample Question For CW1 Q1
5 pages
The Solution of Assignment I For Distributed Databases
No ratings yet
The Solution of Assignment I For Distributed Databases
7 pages
Chapter 3 Distributed Database Design
No ratings yet
Chapter 3 Distributed Database Design
34 pages
CPT212-Test2-2023 Solution
No ratings yet
CPT212-Test2-2023 Solution
6 pages
IJERT_Efficient_Fragmentation_and_Alloca
No ratings yet
IJERT_Efficient_Fragmentation_and_Alloca
7 pages
Relational Database Design
No ratings yet
Relational Database Design
9 pages
Sparse Matrix
No ratings yet
Sparse Matrix
6 pages
PIER Input Guide
No ratings yet
PIER Input Guide
21 pages
Adb CH 4
No ratings yet
Adb CH 4
14 pages
Lec - 15a Disjoint - Sets
No ratings yet
Lec - 15a Disjoint - Sets
54 pages
DDB 05 PDF
No ratings yet
DDB 05 PDF
19 pages
Module 4 AOA
No ratings yet
Module 4 AOA
97 pages
CS 704a
No ratings yet
CS 704a
3 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
End To End Testing
100% (1)
End To End Testing
6 pages
Danh Sách PH Tùng CLG890H1
No ratings yet
Danh Sách PH Tùng CLG890H1
1,900 pages
Mix Design and Calculation of Cement For Different Grades of Concrete
No ratings yet
Mix Design and Calculation of Cement For Different Grades of Concrete
12 pages
Photographic Superimpositions
No ratings yet
Photographic Superimpositions
10 pages
SikaGrout-214 MY
No ratings yet
SikaGrout-214 MY
4 pages
Codigo 1688 Cummins
No ratings yet
Codigo 1688 Cummins
3 pages
HLP23E-Manual Washing Machine
No ratings yet
HLP23E-Manual Washing Machine
21 pages
You Incorporated - 22.08.18
No ratings yet
You Incorporated - 22.08.18
41 pages
8086 Assembler Tutorial Part 5
No ratings yet
8086 Assembler Tutorial Part 5
17 pages
2021 Amaes Obe Ab Pol Sci - V7
No ratings yet
2021 Amaes Obe Ab Pol Sci - V7
28 pages
PDF NCP Pada Pasien Osteo1 DD
No ratings yet
PDF NCP Pada Pasien Osteo1 DD
32 pages
NeRF FOR HERITAGE 3D RECONSTRUCTION - Odf
No ratings yet
NeRF FOR HERITAGE 3D RECONSTRUCTION - Odf
8 pages
Freebitco
No ratings yet
Freebitco
3 pages
Maker Line Datasheet
No ratings yet
Maker Line Datasheet
7 pages
Chapter 3 Review Quiz
No ratings yet
Chapter 3 Review Quiz
1 page
Scholz v. Goudreau
No ratings yet
Scholz v. Goudreau
35 pages
FY 23 Ansonia Mayor's Presentation
No ratings yet
FY 23 Ansonia Mayor's Presentation
64 pages
Mail Server
0% (1)
Mail Server
78 pages
HARSH's Resume
No ratings yet
HARSH's Resume
1 page
Epp301 LCD
No ratings yet
Epp301 LCD
2 pages
Tendeal Presentation EN
No ratings yet
Tendeal Presentation EN
14 pages
Expoling OEEin Ind MFG Plant
No ratings yet
Expoling OEEin Ind MFG Plant
12 pages
ME Web Module
No ratings yet
ME Web Module
95 pages
Log Com - Roblox.client
No ratings yet
Log Com - Roblox.client
262 pages
Lecture 2
No ratings yet
Lecture 2
35 pages
Lab MGT Prelim
No ratings yet
Lab MGT Prelim
7 pages

DDBMS Design

Uploaded by

DDBMS Design

Uploaded by

DDBMS Design Principle

Design of centrally located Database (CD) involves following:

Design of distributed database (DD):

Two abovementioned steps are done by

 Workload distribution is done for the following:

o Taking advantage of specialized powers/utility of computers at each

 Option 1 -- Consider some of the above features as constraints rather than

Design of data distribution: Top-Down vs. Bottom-up approaches

Let us design the fragmentation of the following two tables:

Consider the following application:

It is more likely that it references the supplier whose city

Possible predicates in this application domain:

 Administrative information about departments in area = North are issued at site 1

P4: area = “North” P5: area = “South”

Table: Fragmentation of relation DEPT

A distributed join is a join between horizontally fragmented relations. R X S means all

o Partitioned – graph is composed of two or more subgraphs without edges

 Redundant – All beneficial site approach

 Redundant – Progressive introduction of replication approach

 Both the approaches have some disadvantages

You might also like