0% found this document useful (0 votes)

41 views27 pages

Distributed Transactions

The document discusses distributed transactions and the two-phase commit (2PC) protocol. 2PC ensures that transactions are atomic and either commit or abort across all participating servers. It does so in two phases: 1. In the prepare phase, the coordinator asks all participants if they can commit. They write their response to a log and await the coordinator's decision. 2. In the commit phase, if all participants agreed to commit, the coordinator tells them to do so. Otherwise it tells them to abort. This ensures all participants take the same action. The protocol handles failures through logging to stable storage before responses. It allows distributed transactions to follow the ACID properties of atomicity, consistency, isolation

Uploaded by

actualruthwik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views27 pages

Distributed Transactions

Uploaded by

actualruthwik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

Commit Protocols

Edited slides of
Pallabh Dasgupta IITKgp
D Goswami IIT Guwahati

1
Distributed Transactions
 A transaction that invokes
operations at several servers.

A
X

B
Y

D
Z

https://www.iitg.ac.in/dgoswami/cs542.html
Coordinator of a Distributed Transaction

In a distributed environment, a coordinator is

needed
Client sends an openTransaction to the
coordinator
• Other servers that manage the objects accessed by
the transaction become participants.
Distributed banking transaction
Coordinator

join participant

A A.withdraw(4);
join

BranchX
T
participant

Client B B.withdraw(3);

T = openTransaction
join BranchY
A.withdraw(4);
C.deposit(4); participant
B.withdraw(3);
D.deposit(3); C C.deposit(4);
closeTransaction
D D.deposit(3);
Note: the coordinator is in one of the servers, e.g. BranchX
BranchZ

https://www.iitg.ac.in/dgoswami/cs542.html
Transaction needs to follow the ACID properties
Atomicity—all or none—Transaction is an atomic unit. It is performed
either to entirety or not performed at all.

Consistency—don't violate DB integrity constraints: execution of the operation

should be correct taking from one consistent state to another.
Example: Amount cant be negative, total amount in the bank remains the same
after within bank transfer, etc.

Isolation (Atomicity)—partial results are hidden. That is, execution of a

transaction should not be interfered with by other transactions executing
concurrently.

Durability—effects (of transactions that "happened" or committed) are forever.

They should not be lost due to failure

5
What do we want to do?
Consider T1 and T2

T1
ADD(X,100)
REMOVE(Y,100)

T2
Get( X + Y)

Ensure isolation, atomicity, durability and consistency to these transactions given X and Y
are on different locations.

In 2PC, each sub transaction locks the data before doing any work on it. The locks are released
only after the transaction is completed.
This ensures isolation.

Can this lead to a deadlock?

6
What is an ABORT ?
Sub Transactions on certain sites may want to decide to fail(abort) in certain situations

Some Examples
- if in a deadlock, the process may need to abort to break the deadlock and release the resource
- Abort might be needed in case the transaction goes into an error like account does not
exist or no money in account.
- Issues like divide by 0 situation encountered.
- Node failure so unable to decide.

Concurrency Control
A. Lock based which is a very careful approach.
However, might be slower as you wait for locks to be released but needed if too many conflicts

B. Don’t bother about concurrent transactions. If you lucky then no conflicts. That saves you
waiting on locks. If not lucky, then abort and retry.
If conflicts not frequent then this can be used.

7
Two Phase Commit Protocol (2PC)
 Acquires locks before accessing any record
 Lock released only after transaction is either committed or aborted
 This can land in deadlock situations
 Why do we need to keep the locks?

8
System Failure Modes
 Failures unique to distributed systems:
– Failure of a site.
– Loss of messages
• Handled by network transmission control protocols such as TCP-IP
– Failure of a communication link
• Handled by network protocols, by routing messages via alternative links
– Network partition
• A network is said to be partitioned when it has been split into two or more
subsystems that lack any connection between them
– Note: a subsystem may consist of a single node
 Network partitioning and site failures are generally indistinguishable.
 Site A commits as it completes its work but site B realizes there is an error so has to abort –
violates atomicity

9
Commit Protocols
 Commit protocols are used to ensure atomicity across sites
– a transaction which executes at multiple sites must either be committed at all the sites, or
aborted at all the sites.
– not acceptable to have a transaction committed at one site and aborted at another

 The two-phase commit (2PC) protocol is widely used

 The three-phase commit (3PC) protocol is more complicated and more expensive, but avoids
some drawbacks of two-phase commit protocol. This protocol is not used in practice.

10
Distributed Transactions
 Transaction may access data at several sites.
 Each site has a local transaction manager responsible for:
– Maintaining a log for recovery purposes
– Participating in coordinating the concurrent execution of the transactions
executing at that site.
 Each site has a transaction coordinator, which is responsible for:
– Starting the execution of transactions that originate at the site.
– Distributing subtransactions at appropriate sites for execution.
– Coordinating the termination of each transaction that originates at the site,
which may result in the transaction being committed at all sites or aborted at
all sites.

11
Transaction System Architecture

12
Distributed banking transaction
Coordinator

join participant

A A.withdraw(4);
join

BranchX
T
participant

Client B B.withdraw(3);

https://www.iitg.ac.in/dgoswami/cs542.html
Two Phase Commit Protocol (2PC)
 Assumes fail-stop model – failed sites simply stop working, and do not cause any other
harm, such as sending incorrect messages to other sites. They can come up later

 Execution of the protocol is initiated by the coordinator after the last step of the
transaction has been reached.

 The protocol involves all the local sites at which the transaction executed

 Let T be a transaction initiated at site Si, and let the transaction coordinator at Si be Ci

14
Phase 1: Obtaining a Decision
 Coordinator asks all participants to prepare to commit transaction Ti.
– Ci adds the records <prepare T> to the log and forces log to stable storage
– sends prepare T messages to all sites at which T executed
 Upon receiving message, transaction manager at site determines if it can commit the transaction
– if not, add a record <no T> to the log and send abort T message to Ci
– if the transaction can be committed, then:
– add the record <ready T> to the log
– force all records for T to stable storage
– send ready T message to Ci

* Hence prepare<T> and <no T> or <ready T> are now stored in log which is now in stable storage (database untouched)

15
Phase 2: Recording the Decision
 T can be committed of Ci received a ready T message from all the participating sites:
otherwise T must be aborted.

 Coordinator adds a decision record, <commit T> or <abort T>, to the log and forces record
onto stable storage. Once the record stable storage it is irrevocable (even if failures occur)

 Coordinator sends a message to each participant informing it of the decision (commit or

abort)

 Participants take appropriate action locally.

16
<abort T>/<ready T>

17
Handling of Failures - Site Failure
When site Si recovers, it examines its log to determine the fate of transactions active at the time
of the failure.
 Log contain <commit T> record: site executes redo (T) *copies from log to db
 Log contains <abort T> record: site executes undo (T) *remove from log
 Log contains <ready T> record: site must consult Ci to determine the fate of T.
– If T committed, redo (T)
– If T aborted, undo (T)

 The log contains no control records concerning T implies that Sk failed before responding

to the prepare T message from Ci

– since the failure of Sk precludes the sending of such a response C1 must abort T

18
– S must execute undo (T)
Handling of Failures- Coordinator Failure
 If coordinator fails while the commit protocol for T is executing then participating sites must decide on T’s fate:

1. If an active site contains a <commit T> record in its log, then T must be committed.
2. If an active site contains an <abort T> record in its log, then T must be aborted.
3. If some active participating site does not contain a <ready T> record in its log, then the failed coordinator Ci
cannot have decided to commit T. Can therefore abort T.
4. If none of the above cases hold, then all active sites must have a <ready T> record in their logs, but no
additional control records (such as <abort T> of <commit T>). In this case active sites must wait for Ci to
recover, to find decision.
** we don’t know Ci’s local decision as a participating site. Also we don’t know if all ready T reached. If has not reached then Ci has not written
in db so we don’t need to commit.

 Blocking problem : active sites may have to wait for failed coordinator to recover (hence resources
locked and held up). Again Coordinator failure very uncommon

19
Handling of Failures - Network Partition
 If the coordinator and all its participants remain in one partition, the failure has no effect on the
commit protocol.
 If the coordinator and its participants belong to several partitions:
*Sites that are not in the partition containing the coordinator think the coordinator
has failed, and execute the protocol to deal with failure of the coordinator.
• No harm results, but sites may still have to wait for decision from coordinator.

*The coordinator and the sites which are in the same partition as the coordinator think
that the sites in the other partition have failed, and follow the usual commit protocol.
• Again, no harm results

20
Recovery and Concurrency Control
 In-doubt transactions have a <ready T>, but neither a
<commit T>, nor an <abort T> log record.
 The recovering site must determine the commit-abort status of such transactions by contacting other
sites; this can slow and potentially block recovery.
 Recovery algorithms can note lock information in the log.
– Instead of <ready T>, write out <ready T, L> L = list of locks held by T when the log is written (read
locks can be omitted).
– For every in-doubt transaction T, all the locks noted in the
<ready T, L> log record are reacquired.
 After lock reacquisition, transaction processing can resume; the commit or rollback of in-doubt
transactions is performed concurrently with the execution of new transactions.

21
Three Phase Commit (3PC)
 Assumptions:
– No network partitioning
– At any point, at least one site must be up.
– At most K sites (participants as well as coordinator) can fail

 Phase 1: Obtaining Preliminary Decision: Identical to 2PC Phase 1.

– Every site is ready to commit if instructed to do so

22
Three Phase Commit (3PC)
 Phase 2 of 2PC is split into 2 phases, Phase 2 and Phase 3 of 3PC
– In phase 2 coordinator makes a decision as in 2PC (called the pre-commit
decision) and sends pre-commit msg.
When he receives at least k replies (ack) then he starts sending commit to those sites. As he receives
more acks he keeps sending commits. Hence its decision to commit is recorded in multiple (at least K)
sites before final commit goes out.
– In phase 3, coordinator sends commit/abort message to all participating sites,

 Under 3PC, knowledge of pre-commit decision can be used to commit despite coordinator failure
– Avoids blocking problem as long as upto K sites fail. If beyond k fail then blocking could happen

 Drawbacks:
– higher overheads
– assumptions may not be satisfied in practice

23
C A1

A2
A3

D2
B1

B2
E1

24
Handling of Failures - Site Failure
When site Si recovers, it examines its log to determine the fate of transactions active at the time of the failure.

 F:Log contain <commit T> record: site executes redo (T) *copies from log to db
 F: Log contains <abort T> record: site executes undo (T) *remove from log
 A3: Log contains <ready T> record: site must consult Ci to determine the fate of T.
– If T committed, redo (T)
– If T aborted, undo (T)
 A1 A2: (includes (a) After <prepare T> before <ready t> and (b) Before <prepare T>)
The log contains no control records concerning T replies that Sk failed before responding to the prepare T

message from Ci

– since the failure of Sk precludes the sending of such a response C1 must abort T

– Sk must execute undo (T)

25
Handling of Failures - Site Failure
When site Si recovers, it examines its log to determine the fate of transactions active at the time
of the failure.
 B1:Log contain <pre-commit T> record: site needs to check with coordinator since on
getting k acks the coordinator could have committed.
The transaction can also be in an abort situation if the coordinator failed after sending
precommit to site Si(Si also had failed). So remaining sites seeing no precommit msgs
among them would abort.
 B2: Log contains <ack T> record: same as B1

26
Handling of Failures- Coordinator Failure
 If coordinator fails while the commit protocol for T is executing then participating sites must decide on T’s fate:

1. E2:If an active site contains a <commit T> record in its log, then T must be committed.
2. E2:If an active site contains an <abort T> record in its log, then T must be aborted.
3. C:If some active participating site does not contain a <ready T> record in its log, then the failed coordinator
Ci cannot have decided to commit T. Can therefore abort T.
4. all active sites have a <ready T> record in their logs, but no additional control records (such as <abort T> of
<commit T>).
D2,E1: If there is <precommit T> in someone’s log then vote for new coordinator. New coordinator behaves
as if it received <ready T> from everyone. It sends <precommit T> to make sure k sites have the info.
D1: Abort
.

Cs9152 DBT Unit IV Notes
100% (5)
Cs9152 DBT Unit IV Notes
61 pages
Lec22
No ratings yet
Lec22
22 pages
Ddbs Checkpointing ... Ddbs Checkpointing ... : Phase 1 at Css Phase 2 at CC
No ratings yet
Ddbs Checkpointing ... Ddbs Checkpointing ... : Phase 1 at Css Phase 2 at CC
9 pages
Nonblocking Commit Protocols: Dale Skeen
No ratings yet
Nonblocking Commit Protocols: Dale Skeen
42 pages
05distributed Transaction Processinglec5 (Feb 28)
No ratings yet
05distributed Transaction Processinglec5 (Feb 28)
41 pages
Distributed Systems Unit 4
No ratings yet
Distributed Systems Unit 4
26 pages
Unit IV - Distributed Transaction Processing
No ratings yet
Unit IV - Distributed Transaction Processing
38 pages
8 Transaction
No ratings yet
8 Transaction
30 pages
10 DS_Ch17
No ratings yet
10 DS_Ch17
16 pages
Nested Transactions Nested Transactions
No ratings yet
Nested Transactions Nested Transactions
11 pages
Distributed Transaction Model
No ratings yet
Distributed Transaction Model
17 pages
Lecture 13
No ratings yet
Lecture 13
37 pages
WINSEM2023-24 CSI2004 TH VL2023240501820 2024-02-07 Reference-Material-I
No ratings yet
WINSEM2023-24 CSI2004 TH VL2023240501820 2024-02-07 Reference-Material-I
75 pages
2.11 Distributed Transaction
No ratings yet
2.11 Distributed Transaction
27 pages
mod 5 (ds)
No ratings yet
mod 5 (ds)
25 pages
DistributedTransaction
No ratings yet
DistributedTransaction
46 pages
Distributed Transactions
No ratings yet
Distributed Transactions
37 pages
Distributed Recovery Management: UNIT-4
No ratings yet
Distributed Recovery Management: UNIT-4
31 pages
Commit Protocols Non-Blocking Commit Protocols
No ratings yet
Commit Protocols Non-Blocking Commit Protocols
10 pages
Fert
No ratings yet
Fert
38 pages
Distributed Transactions
0% (1)
Distributed Transactions
52 pages
Reliability and Security in The Distributed Databases
No ratings yet
Reliability and Security in The Distributed Databases
29 pages
What is a Transaction
No ratings yet
What is a Transaction
7 pages
Design and Implementation of A Two-Phase Commit Protocol Simulator
No ratings yet
Design and Implementation of A Two-Phase Commit Protocol Simulator
8 pages
Consensus On Transaction Commit
No ratings yet
Consensus On Transaction Commit
28 pages
DSC4
No ratings yet
DSC4
15 pages
CS542: Topics in Distributed Systems
No ratings yet
CS542: Topics in Distributed Systems
11 pages
lect22
No ratings yet
lect22
27 pages
08 Transactions
No ratings yet
08 Transactions
15 pages
Introduction: Chapter 13: Distributed Transactions
No ratings yet
Introduction: Chapter 13: Distributed Transactions
62 pages
Lecture 11A - Replication Control
No ratings yet
Lecture 11A - Replication Control
15 pages
3 - Nonblocking Commit Protocols
No ratings yet
3 - Nonblocking Commit Protocols
28 pages
Distributed DBMS Reliability - 3 of 3 (Good)
50% (2)
Distributed DBMS Reliability - 3 of 3 (Good)
35 pages
DDS Unit - 4
No ratings yet
DDS Unit - 4
22 pages
Atomic Commit and Concurrency Control: COS 418: Distributed Systems Wyatt Lloyd
No ratings yet
Atomic Commit and Concurrency Control: COS 418: Distributed Systems Wyatt Lloyd
40 pages
25 DistributedCoordination
No ratings yet
25 DistributedCoordination
30 pages
Lecture 9 Distributed Transactions
No ratings yet
Lecture 9 Distributed Transactions
7 pages
Transactions &ConcurrencyControl
No ratings yet
Transactions &ConcurrencyControl
40 pages
CH 23
No ratings yet
CH 23
126 pages
Concurrency Control
No ratings yet
Concurrency Control
26 pages
Synchronization: CS403/534 Distributed Systems Erkay Savas Sabanci University
No ratings yet
Synchronization: CS403/534 Distributed Systems Erkay Savas Sabanci University
46 pages
Lec16 2023
No ratings yet
Lec16 2023
45 pages
Reliable Distributed Systems: Transactions in Large, Complex Systems and Web Services
No ratings yet
Reliable Distributed Systems: Transactions in Large, Complex Systems and Web Services
47 pages
DDS Unit - 3
No ratings yet
DDS Unit - 3
15 pages
DS Chapter V8.0fault Tolerance
No ratings yet
DS Chapter V8.0fault Tolerance
23 pages
Transactions & Concurrency Control: CS4262 Distributed Systems
No ratings yet
Transactions & Concurrency Control: CS4262 Distributed Systems
45 pages
Lecture 05
No ratings yet
Lecture 05
29 pages
3C03 Concurrency: Distributed Transactions
No ratings yet
3C03 Concurrency: Distributed Transactions
10 pages
ds part b
No ratings yet
ds part b
30 pages
Distributed_Commit_Protocols
No ratings yet
Distributed_Commit_Protocols
9 pages
Consensus
No ratings yet
Consensus
77 pages
Module 4
No ratings yet
Module 4
51 pages
Two Phase Commit
No ratings yet
Two Phase Commit
10 pages
Slides For Chapter 17: Distributed Transactions: Distributed Systems: Concepts and Design
No ratings yet
Slides For Chapter 17: Distributed Transactions: Distributed Systems: Concepts and Design
24 pages
5 Transaction Processing
No ratings yet
5 Transaction Processing
70 pages
Distributed Transaction Management & Recovery: S.Santhi Priya
No ratings yet
Distributed Transaction Management & Recovery: S.Santhi Priya
16 pages
Distributed Systems - Fault Tolerance
No ratings yet
Distributed Systems - Fault Tolerance
21 pages
Distributed Transactions - Database Systems
No ratings yet
Distributed Transactions - Database Systems
10 pages
Slides For Chapter 10: Distributed Transactions: Distributed Systems: Concepts and Design
No ratings yet
Slides For Chapter 10: Distributed Transactions: Distributed Systems: Concepts and Design
46 pages
Word Unit5
No ratings yet
Word Unit5
19 pages
Building Full Stack DeFi Applications: A practical guide to creating your own decentralized finance projects on blockchain
From Everand
Building Full Stack DeFi Applications: A practical guide to creating your own decentralized finance projects on blockchain
Samuel Zhou
No ratings yet
Dbms Tutorial
No ratings yet
Dbms Tutorial
13 pages
The Data Science Process Course Slides Red
No ratings yet
The Data Science Process Course Slides Red
95 pages
2.data Mining and Audit Tools
No ratings yet
2.data Mining and Audit Tools
27 pages
Tree Manager
No ratings yet
Tree Manager
6 pages
Andromeda
No ratings yet
Andromeda
6 pages
Ow To Increase Your SEO Traffic in 30 Dayssssssssssss
No ratings yet
Ow To Increase Your SEO Traffic in 30 Dayssssssssssss
20 pages
Information-Management Unit 3
No ratings yet
Information-Management Unit 3
20 pages
SOLMAN System Decommission Process
No ratings yet
SOLMAN System Decommission Process
2 pages
Skripsi Full Tanpa Bab Pembahasan PDF
No ratings yet
Skripsi Full Tanpa Bab Pembahasan PDF
76 pages
SQL Commands
No ratings yet
SQL Commands
34 pages
Database Security Lab Manual
100% (2)
Database Security Lab Manual
65 pages
Experiment 10
No ratings yet
Experiment 10
17 pages
Chapter 2 Part 1
No ratings yet
Chapter 2 Part 1
87 pages
Mysql Commands Notes
100% (1)
Mysql Commands Notes
23 pages
What Are Enterprise Application Integration (EAI) and Enterprise Information Integration (EII) Technologies?
No ratings yet
What Are Enterprise Application Integration (EAI) and Enterprise Information Integration (EII) Technologies?
5 pages
SQL Exercises - Pieces and Providers - Wikibooks, Open Books For An Open World With Answers PDF
No ratings yet
SQL Exercises - Pieces and Providers - Wikibooks, Open Books For An Open World With Answers PDF
3 pages
Interview Questions Consolidated Mainframe
100% (1)
Interview Questions Consolidated Mainframe
8 pages
How To Create Students Database
No ratings yet
How To Create Students Database
50 pages
Parameters and Variables in Informatica
No ratings yet
Parameters and Variables in Informatica
37 pages
AI Meets BI: Artificial Intelligence and Business Intelligence 1st Edition Lakshman Bulusu all chapter instant download
100% (1)
AI Meets BI: Artificial Intelligence and Business Intelligence 1st Edition Lakshman Bulusu all chapter instant download
55 pages
Hbase PDF
No ratings yet
Hbase PDF
8 pages
h13028 WP d65sp2 Migration d7x
No ratings yet
h13028 WP d65sp2 Migration d7x
34 pages
BDA Presentations
No ratings yet
BDA Presentations
26 pages
Student - Aytan Garaisayeva Instruction - Leyla Jumayeva Major - Computer Science Subject - Database
No ratings yet
Student - Aytan Garaisayeva Instruction - Leyla Jumayeva Major - Computer Science Subject - Database
2 pages
BI Lab Manual_SLY (1)
No ratings yet
BI Lab Manual_SLY (1)
86 pages
TaLend Online Training
No ratings yet
TaLend Online Training
4 pages
Uts Access
No ratings yet
Uts Access
3 pages
Clarifying The Research Question Through Secondary Data and Exploration
No ratings yet
Clarifying The Research Question Through Secondary Data and Exploration
25 pages
Q: What Is A Virtual Warehouse in Snowflake and How Do You Create and Manage It?
No ratings yet
Q: What Is A Virtual Warehouse in Snowflake and How Do You Create and Manage It?
2 pages

Distributed Transactions

Uploaded by

Distributed Transactions

Uploaded by

Commit Protocols

In a distributed environment, a coordinator is

Consistency—don't violate DB integrity constraints: execution of the operation

Isolation (Atomicity)—partial results are hidden. That is, execution of a

Durability—effects (of transactions that "happened" or committed) are forever.

Can this lead to a deadlock?

 The two-phase commit (2PC) protocol is widely used

 Coordinator sends a message to each participant informing it of the decision (commit or

 Participants take appropriate action locally.

to the prepare T message from Ci

 Phase 1: Obtaining Preliminary Decision: Identical to 2PC Phase 1.

– Sk must execute undo (T)

You might also like