Parallel Databases

Parallel databases are increasingly common as the cost of hardware has decreased. Large databases require parallelism for storage, queries, and throughput. There are different types of parallelism including interquery, intraquery, interoperation, and intraoperation parallelism. Data can be partitioned horizontally or vertically across multiple disks for parallel input/output and queries can utilize various parallelization techniques. Issues in parallel database design include parallel data loading, resilience to failures, and redundancy.

Uploaded by

Madara Uchiha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

Parallel Databases

Uploaded by

Madara Uchiha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 11

Parallel Databases

Introduction
 Parallel machines are becoming quite common and affordable
 Prices of microprocessors, memory and disks have dropped
sharply
 Recent desktop computers feature multiple processors and this
trend is projected to accelerate
 Databases are growing increasingly large
 large volumes of transaction data are collected and stored for later
analysis.
 multimedia objects like images are increasingly stored in
databases
 Large-scale parallel database systems increasingly used for:
 storing large volumes of data
 processing time-consuming decision-support queries
 providing high throughput for transaction processing
Parallelism in Databases
 Data can be partitioned across multiple disks for parallel I/O.
 Individual relational operations (e.g., sort, join, aggregation) can be
executed in parallel
 Queries are expressed in high level language (SQL, translated to
relational algebra)
 makes parallelization easier.
 Different queries can be run in parallel with each other.
Concurrency control takes care of conflicts.
Partitioning

 Types of partitioning

Horizontal partitioning – tuples of a relation are divided among many

disks such that each tuple resides on one disk.

Vertical partitioning-Schema of relation is divided among many disks

such that data fields of each tuple are split and stored on various
multiple disks.
Partitioning
 Partitioning techniques (number of disks = n):
Round-robin:
Send the I th tuple inserted in the relation to disk i mod n.
Hash partitioning:
 Choose one or more attributes as the partitioning attributes.
 Choose hash function h with range 0…n - 1
 Let i denote result of hash function h applied to the partitioning
attribute value of a tuple. Send tuple to disk i.
 Range partitioning:
 Choose an attribute as the partitioning attribute.
 A partitioning vector [vo, v1, ..., vn-2] is chosen.
 Let v be the partitioning attribute value of a tuple. Tuples such that vi  vi+1 go to
disk I + 1. Tuples with v < v0 go to disk 0 and tuples with v  vn-2 go to disk n-1.
Interquery Parallelism
 Queries/transactions execute in parallel with one another.
 Increases transaction throughput; used primarily to scale up a transaction
processing system to support a larger number of transactions per second.
 Easiest form of parallelism to support, particularly in a shared-memory
parallel database, because even sequential database systems support
concurrent processing.
Intraquery Parallelism

 Execution of a single query in parallel on multiple processors/disks;

important for speeding up long-running queries.
 Two complementary forms of intraquery parallelism:
 Intraoperation Parallelism – parallelize the execution of each individual
operation in the query.
 Interoperation Parallelism – execute the different operations in a query
expression in parallel.
the first form scales better with increasing parallelism because
the number of tuples processed by each operation is typically more than the
number of operations in a query.
Interoperator Parallelism

 Pipelined parallelism
 Consider a join of four relations
 r1 r2 r3 r4
 Set up a pipeline that computes the three joins in parallel
 Let P1 be assigned the computation of
temp1 = r1 r2
 And P2 be assigned the computation of temp2 = temp1
r3
 And P3 be assigned the computation of temp2 r4
 Each of these operations can execute in parallel, sending result
tuples it computes to the next operation even as it is computing
further results
Independent Parallelism

 Independent parallelism
 Consider a join of four relations
r1 r2 r3 r4
 Let P1 be assigned the computation of
temp1 = r1 r2
 And P2 be assigned the computation of temp2 = r 3 r4
 And P3 be assigned the computation of temp1 temp 2
 P1 and P2 can work independently in parallel
 P3 has to wait for input from P1 and P2
 Can pipeline output of P1 and P2 to P3, combining
independent parallelism and pipelined parallelism
 Does not provide a high degree of parallelism
 useful with a lower degree of parallelism.
 less useful in a highly parallel system.
Design of Parallel Systems

Some issues in the design of parallel systems:

 Parallel loading of data from external sources is needed in order
to handle large volumes of incoming data.
 Resilience to failure of some processors or disks.
 Probability of some disk or processor failing is higher in a parallel
system.
 Operation (perhaps with degraded performance) should be possible
in spite of failure.
 Redundancy achieved by storing extra copy of every data item at
another processor.
End of Chapter

Datacom DM1200 - Command - Reference
No ratings yet
Datacom DM1200 - Command - Reference
954 pages
OSCP Notes NagendranGS
No ratings yet
OSCP Notes NagendranGS
58 pages
DDCS Expert Install File Description
No ratings yet
DDCS Expert Install File Description
2 pages
ADBMS Parallel and Distributed Databases
No ratings yet
ADBMS Parallel and Distributed Databases
98 pages
TDD: Topics in Distributed Databases: Parallel Database Management Systems
No ratings yet
TDD: Topics in Distributed Databases: Parallel Database Management Systems
38 pages
Unit No.4 Parallel Database
No ratings yet
Unit No.4 Parallel Database
32 pages
Lecture 1 Parallel Databases
No ratings yet
Lecture 1 Parallel Databases
30 pages
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
No ratings yet
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
27 pages
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
No ratings yet
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
23 pages
ParallelDBs PDF
No ratings yet
ParallelDBs PDF
23 pages
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
No ratings yet
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
114 pages
Dbms
No ratings yet
Dbms
14 pages
2 Parallel Databases
No ratings yet
2 Parallel Databases
44 pages
Parallel and Distributed Databases in DBMS
No ratings yet
Parallel and Distributed Databases in DBMS
31 pages
Parallel Database System
No ratings yet
Parallel Database System
55 pages
Elective-I Advanced Database Management Systems: Unit Ii
100% (1)
Elective-I Advanced Database Management Systems: Unit Ii
141 pages
LN 2
No ratings yet
LN 2
33 pages
SAYAN_GHOSH_26900123054_DISTRIBUTED_DATABASE_SYSTEM_CSE_6TH_SEM
No ratings yet
SAYAN_GHOSH_26900123054_DISTRIBUTED_DATABASE_SYSTEM_CSE_6TH_SEM
11 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
37 pages
Fundamentals of Database Systems: (Parallel and Distributed Databases)
No ratings yet
Fundamentals of Database Systems: (Parallel and Distributed Databases)
46 pages
9.CSI2004-ADBMS_Module2__part1
No ratings yet
9.CSI2004-ADBMS_Module2__part1
54 pages
Unit 5 Parallel and Distributed Databases
No ratings yet
Unit 5 Parallel and Distributed Databases
22 pages
ADTHEORY1
No ratings yet
ADTHEORY1
15 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
adbms-unit4
No ratings yet
adbms-unit4
24 pages
Module1 ADBMS
No ratings yet
Module1 ADBMS
99 pages
Ads unit 3
No ratings yet
Ads unit 3
8 pages
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
No ratings yet
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
70 pages
Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem
No ratings yet
Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem
11 pages
Cs6005 - Advanced Database Systems (Unit-1)
No ratings yet
Cs6005 - Advanced Database Systems (Unit-1)
136 pages
Query Parallelism
No ratings yet
Query Parallelism
8 pages
Module III
No ratings yet
Module III
132 pages
Parallel Database
No ratings yet
Parallel Database
22 pages
Unit I
No ratings yet
Unit I
43 pages
Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
No ratings yet
Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
29 pages
8-Parallel Nhom5
No ratings yet
8-Parallel Nhom5
59 pages
Parallel Dbms
No ratings yet
Parallel Dbms
5 pages
UNIT-3: Introduction To Parallel Database and I/O Parallelism
No ratings yet
UNIT-3: Introduction To Parallel Database and I/O Parallelism
52 pages
CH14
No ratings yet
CH14
43 pages
Inter and Intra Query Parallelism
No ratings yet
Inter and Intra Query Parallelism
1 page
Parallel Database Systems and Their Architecture
No ratings yet
Parallel Database Systems and Their Architecture
17 pages
Parallel Database
No ratings yet
Parallel Database
27 pages
Database Management Systems: Unit 4 - Parallel DBMS
No ratings yet
Database Management Systems: Unit 4 - Parallel DBMS
14 pages
Lesson2 Parallel Database
No ratings yet
Lesson2 Parallel Database
58 pages
CH 2
No ratings yet
CH 2
51 pages
Ptimimation of F Ulti-Join Ri
No ratings yet
Ptimimation of F Ulti-Join Ri
14 pages
Introduction To Parallel Databases
No ratings yet
Introduction To Parallel Databases
24 pages
Query Processing in Distributed Database
No ratings yet
Query Processing in Distributed Database
20 pages
Second Unit ADBMS
No ratings yet
Second Unit ADBMS
53 pages
Parallelisation Comment
No ratings yet
Parallelisation Comment
3 pages
Parallel_Database_QA_Detailed
No ratings yet
Parallel_Database_QA_Detailed
2 pages
Parallel Database
No ratings yet
Parallel Database
8 pages
Third Year Engineering: 21BTCS604 - Advanced DBMS
No ratings yet
Third Year Engineering: 21BTCS604 - Advanced DBMS
51 pages
databace1
No ratings yet
databace1
7 pages
26 Distributed Dbms Nosql
No ratings yet
26 Distributed Dbms Nosql
45 pages
Parallel DBMS: Chapter 22, Sections 22.1-22.6
No ratings yet
Parallel DBMS: Chapter 22, Sections 22.1-22.6
23 pages
Parallel-Databases
No ratings yet
Parallel-Databases
10 pages
Adbms
No ratings yet
Adbms
70 pages
17 DatabaseArchitectures
No ratings yet
17 DatabaseArchitectures
41 pages
Lecture 10: Parallel Query Evaluation: CS 838: Foundations of Data Management Spring 2016
No ratings yet
Lecture 10: Parallel Query Evaluation: CS 838: Foundations of Data Management Spring 2016
4 pages
Parallel Database Systems an Overview
No ratings yet
Parallel Database Systems an Overview
10 pages
14-queryexecution2
No ratings yet
14-queryexecution2
47 pages
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
CCNP Interview Questions
No ratings yet
CCNP Interview Questions
4 pages
Manual HD330 External Hard Drive
No ratings yet
Manual HD330 External Hard Drive
2 pages
Sujatha Hadoop Admin
No ratings yet
Sujatha Hadoop Admin
5 pages
FIFA Mod Manager Log20250203
No ratings yet
FIFA Mod Manager Log20250203
5 pages
Guide Techview User S Guide en 132476
No ratings yet
Guide Techview User S Guide en 132476
130 pages
RL78 IDE/Compiler Migration Guide: White Paper
No ratings yet
RL78 IDE/Compiler Migration Guide: White Paper
18 pages
Research Proposal: Objective
No ratings yet
Research Proposal: Objective
4 pages
Assignment
No ratings yet
Assignment
8 pages
Logcat 1715318065559
No ratings yet
Logcat 1715318065559
12 pages
Exchange Installation & FSMO Roles
No ratings yet
Exchange Installation & FSMO Roles
7 pages
83032107-ohaus-rs232-data-interface-user-guide
No ratings yet
83032107-ohaus-rs232-data-interface-user-guide
10 pages
Workspace Environment Management 2411
No ratings yet
Workspace Environment Management 2411
520 pages
Updated Flash Procedure For All Caterpillar Products (1920, 7610, 7620)
No ratings yet
Updated Flash Procedure For All Caterpillar Products (1920, 7610, 7620)
2 pages
Assignment 2 Answer
No ratings yet
Assignment 2 Answer
7 pages
Computer Architecture Lab 3
No ratings yet
Computer Architecture Lab 3
5 pages
Network Viva Questions and Answers
50% (2)
Network Viva Questions and Answers
29 pages
Sil 3 Systems PDF
No ratings yet
Sil 3 Systems PDF
52 pages
Windows 11 Operating System
No ratings yet
Windows 11 Operating System
10 pages
SIEMENS Technical Paper 2
100% (1)
SIEMENS Technical Paper 2
18 pages
4JBM Network Design Presentation
No ratings yet
4JBM Network Design Presentation
12 pages
Ha 463B
No ratings yet
Ha 463B
2 pages
MM Mod1 QB
No ratings yet
MM Mod1 QB
1 page
BACnet Explorer Guide
No ratings yet
BACnet Explorer Guide
32 pages
Standard functions SFC_2017.pdf_page_19
No ratings yet
Standard functions SFC_2017.pdf_page_19
1 page
ES Notes PDF
No ratings yet
ES Notes PDF
160 pages
CMMT-AS Manual 2023-11l 8204518g1-16
No ratings yet
CMMT-AS Manual 2023-11l 8204518g1-16
1 page
DDOS Attack Tools
100% (1)
DDOS Attack Tools
71 pages

Parallel Databases

Uploaded by

Parallel Databases

Uploaded by

Parallel Databases

Horizontal partitioning – tuples of a relation are divided among many

Vertical partitioning-Schema of relation is divided among many disks

 Execution of a single query in parallel on multiple processors/disks;

Some issues in the design of parallel systems:

You might also like