06 Consensus

Uploaded by

helsytran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

06 Consensus

Uploaded by

helsytran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Distributed

Systems
CPSC 5520 Consensus
Kevin Lundeen
Consensus Protocols

• Consensus
• ”Where everyone agrees”
• Used to describe distributed systems behavior on replication, especially in
the face of failures
• Especially state machine replication, i.e., log replication
• Log Replication
• We used a logical clock last week to implement an algorithm that successfully
replicated logs
• Intolerant of failures
• Intolerant of dynamic addition/removal of nodes
• Requires reliable ordered messaging
• O(n2) messages per log write
• We’ll study Raft next to overcome most of these issues (textbook has Paxos)
• Even more robust (handling Byzantine failures) is PBFT
Raft

Author’s recent note on the name:

There's a few reasons we came up with the name Raft:
- It's not quite an acronym, but we were thinking about the words 'reliable', 'replicated',
'redundant', and 'fault-tolerant'.
- We were thinking about logs and what can be built using them.
- We were thinking about the island of Paxos and how to escape it.
As a plus, we were using the randomly generated name Cheesomi in the paper
before we came up with the name Raft in September 2012. The name
appeared just over 100 times in our paper submission back then, so switching
to the shorter name actually helped shrink the paper down quite a bit.

- Diego Ongaro

 We will follow an extended version of the Raft paper presented at

USENIX 2014
Raft Basics §5.1

• Raft cluster
• small number of servers (five is typical)
• can tolerate a minority of servers failing simultaneously (two can fail
simultaneously in five-node Raft cluster)
• Each server is one of three states:
1. Leader – sole leader that handles all client communications
2. Follower – most servers merely respond to RPCs from leader and
candidates
3. Candidate – a server that noticed absence of leader and is trying to elect
itself to be leader
• Each leader leads for a term
• Term numbers are incremented at each new election
• Once elected a candidate becomes sole leader for that term
• All entries for a term are initiated by its leader
Raft Basics §5.1
(continued)
• Elections can result in split vote
• Term ends with no leader (and no entries in log)
• Another election ensues
• Terms act as a logical clock
• Current term is communicated in all RPCs
• if one server’s current term is smaller than the other’s, then it updates its
current term to the larger value
• if a candidate or leader discovers that its term is out of date, it immediately
reverts to follower state
• if a server receives a request with a stale term number, it rejects the request
• Communication is via RPCs
• RequestVote – candidate to all others
• AppendEntries – leader to all followers
• Failed RPCs are retried
• RPCs done in parallel for best performance
Leader Election §5.2

• Raft uses a heartbeat mechanism to trigger leader election

• If a follower fails to hear from leader after election_timeout period, it
transitions to candidate state:
1. Increments current term number
2. Votes for itself
3. Issues RequestVote RPCs to everyone else
4. Once majority of servers grant vote, becomes leader
5. Alternatively, if it fails to get majority within certain period, it restarts the
election (with a new term)
• A candidate may receive messages from:
• A new leader (this leader got the majority), so it becomes a follower
• Another candidate, if the other candidate is more up-to-date, grants vote
• A follower grants at most one vote
• For first candidate that is at least as up-to-date
Up-To-Date

§5.4.1
A is more up-to-date than B iff:
1. A’s current term is greater than B’s, or
2. A and B have the same current term, but A has a longer log than B
Log Replication §5.3

• The paper talks about replicated state machines, so:

• Their “log” consists of both the entries in the server’s volatile internal queue
along with committed entries
• Entries that have been committed have been applied to the local state
machine; uncommitted entries have not
• Comparing it to how we talked about log replication last week, our “red line”
indicated which entries had been committed (those above the red line) and
those that were still in flux. Raft is the same, but they call the combined list
the log and distinguish the committed entries.
• Each entry has a term number and an index
• Term number was the current term of the leader at the time the client asked
the leader to apply it
• Index is the log number (starting from the beginning of the system)
Log Replication §5.3
(continued)
• Entry index
increments for
every new entry
• Term number
increments for
every election
Log Replication §5.3
(continued)
• A new leader
gets all the
followers in sync
with itself
• A leader sends
new entries to
the followers
• A leader commits
its new entries
when they have
been replicated
to a majority of
followers
The End

Parallel Programming for Modern High Performance Computing Systems (Czarnul, Pawel)
No ratings yet
Parallel Programming for Modern High Performance Computing Systems (Czarnul, Pawel)
330 pages
Unit 4 BCT
No ratings yet
Unit 4 BCT
29 pages
Raft
No ratings yet
Raft
68 pages
Raft
No ratings yet
Raft
30 pages
Raft - Consensus Protocol
100% (1)
Raft - Consensus Protocol
6 pages
Arxiv2004 2004.05074 (Heidi Howard 2020) Paxos Vs Raft Have We Reached Consensus On Distributed Consensus
No ratings yet
Arxiv2004 2004.05074 (Heidi Howard 2020) Paxos Vs Raft Have We Reached Consensus On Distributed Consensus
8 pages
Raft Consensus Mechanism and The Applications
No ratings yet
Raft Consensus Mechanism and The Applications
10 pages
Sec4 Consensus With Raft
No ratings yet
Sec4 Consensus With Raft
23 pages
Permissioned Blockchain - Raft Consensus - Complete
No ratings yet
Permissioned Blockchain - Raft Consensus - Complete
13 pages
Ch8 RAFT,Paxos
No ratings yet
Ch8 RAFT,Paxos
24 pages
Ch9 Consensus
No ratings yet
Ch9 Consensus
40 pages
Chapter 8
No ratings yet
Chapter 8
29 pages
RAFT Consensus Algorithm: Leader The Followers
No ratings yet
RAFT Consensus Algorithm: Leader The Followers
36 pages
CSE446 Lecture 5
No ratings yet
CSE446 Lecture 5
34 pages
PBFT
No ratings yet
PBFT
26 pages
Non Text Magic Studio Magic Design for Presentations L&P
No ratings yet
Non Text Magic Studio Magic Design for Presentations L&P
23 pages
Ch8 Distributed
No ratings yet
Ch8 Distributed
12 pages
Raft Diego Ongaro
No ratings yet
Raft Diego Ongaro
11 pages
!" U#$"r%&'#$'b (" D) %&R) B &"$ C+#%"#% % PR+&+,+ (
No ratings yet
!" U#$"r%&'#$'b (" D) %&R) B &"$ C+#%"#% % PR+&+,+ (
109 pages
Unit 4
No ratings yet
Unit 4
11 pages
LogDevice Consensus Deepdive
No ratings yet
LogDevice Consensus Deepdive
56 pages
Fault Tolerance:-: Introduction, Process Resilience, Distributed Commit, Recovery
No ratings yet
Fault Tolerance:-: Introduction, Process Resilience, Distributed Commit, Recovery
52 pages
Distributed Consensus
No ratings yet
Distributed Consensus
6 pages
Blockchain-Assignment 4 Answer Description
No ratings yet
Blockchain-Assignment 4 Answer Description
4 pages
Byzantine Fault-Tolerance: COMP 413 Fall 2002
No ratings yet
Byzantine Fault-Tolerance: COMP 413 Fall 2002
21 pages
CSE446 Lecture 5
No ratings yet
CSE446 Lecture 5
10 pages
Blockchain Assignment 2
No ratings yet
Blockchain Assignment 2
33 pages
u4p6
No ratings yet
u4p6
10 pages
Cse535 F24 1003 BFT
No ratings yet
Cse535 F24 1003 BFT
47 pages
Consensus
No ratings yet
Consensus
77 pages
midterm-cheatsheet
No ratings yet
midterm-cheatsheet
2 pages
Consensus and Paxos
No ratings yet
Consensus and Paxos
34 pages
Lec14 Paxos
No ratings yet
Lec14 Paxos
4 pages
CSE446 Lecture 4
No ratings yet
CSE446 Lecture 4
32 pages
Unit 3
No ratings yet
Unit 3
62 pages
CSE446 Lecture 4
No ratings yet
CSE446 Lecture 4
30 pages
Raft Made Simple
No ratings yet
Raft Made Simple
18 pages
Raft Extended
No ratings yet
Raft Extended
18 pages
PBFT Algorithm
No ratings yet
PBFT Algorithm
76 pages
In Search of an Understandable Consensus Algorithm
No ratings yet
In Search of an Understandable Consensus Algorithm
18 pages
Raft
No ratings yet
Raft
17 pages
In Search of An Understandable Consensus Algorithm
No ratings yet
In Search of An Understandable Consensus Algorithm
17 pages
Lecture 05
No ratings yet
Lecture 05
29 pages
Distributed UNIT IV (1)
No ratings yet
Distributed UNIT IV (1)
60 pages
ByzantineFT_SMR
No ratings yet
ByzantineFT_SMR
41 pages
Paxos Made Moderately Complex: Robbert Van Renesse Cornell University Rvr@cs - Cornell.edu March 25, 2011
No ratings yet
Paxos Made Moderately Complex: Robbert Van Renesse Cornell University Rvr@cs - Cornell.edu March 25, 2011
15 pages
u4p2
No ratings yet
u4p2
46 pages
L20: Replicated State Machines With Paxos: Sam Madden 6.033 Spring 2014
No ratings yet
L20: Replicated State Machines With Paxos: Sam Madden 6.033 Spring 2014
44 pages
Consensus Failure
No ratings yet
Consensus Failure
79 pages
Chen 07
No ratings yet
Chen 07
39 pages
Part 0
No ratings yet
Part 0
6 pages
Proactive Leader Election in Asynchronous Shared Memory Systems
No ratings yet
Proactive Leader Election in Asynchronous Shared Memory Systems
15 pages
Distributed Systems - Fault Tolerance
No ratings yet
Distributed Systems - Fault Tolerance
21 pages
module 5
No ratings yet
module 5
11 pages
06 Synchronization
No ratings yet
06 Synchronization
52 pages
Coordination and Agreement: Check Point Threat Extraction Secured This Document
No ratings yet
Coordination and Agreement: Check Point Threat Extraction Secured This Document
18 pages
BCT1
No ratings yet
BCT1
9 pages
T1 BFTSMR
No ratings yet
T1 BFTSMR
68 pages
Implementing Replicated Logs With Paxos: John Ousterhout and Diego Ongaro Stanford University
No ratings yet
Implementing Replicated Logs With Paxos: John Ousterhout and Diego Ongaro Stanford University
33 pages
Chapter 8-Fault Tolerance
100% (1)
Chapter 8-Fault Tolerance
71 pages
Rust Essentials: Master the Language of Safe Systems Programming
From Everand
Rust Essentials: Master the Language of Safe Systems Programming
Tyler Hayes
No ratings yet
Snooping vs. Directory Based Coherency: Professor David A. Patterson Computer Science 252 Fall 1996
No ratings yet
Snooping vs. Directory Based Coherency: Professor David A. Patterson Computer Science 252 Fall 1996
59 pages
Thread Implementation: For Parallel Processing
No ratings yet
Thread Implementation: For Parallel Processing
42 pages
TF On Spark
No ratings yet
TF On Spark
35 pages
CH20 COA11e
No ratings yet
CH20 COA11e
40 pages
Java Asif
No ratings yet
Java Asif
2 pages
Unit 3 Threads Notes All
No ratings yet
Unit 3 Threads Notes All
96 pages
Classical IPC Problems Reader's and Writer Problem
No ratings yet
Classical IPC Problems Reader's and Writer Problem
79 pages
Download Parallel Programming with Microsoft NET Design Patterns for Decomposition and Coordination on Multicore Architectures Patterns Practices 1st Edition Colin Campbell ebook All Chapters PDF
100% (6)
Download Parallel Programming with Microsoft NET Design Patterns for Decomposition and Coordination on Multicore Architectures Patterns Practices 1st Edition Colin Campbell ebook All Chapters PDF
81 pages
Kubernetes & Google Kubernetes Engine (GKE) : by Akash Agrawal
No ratings yet
Kubernetes & Google Kubernetes Engine (GKE) : by Akash Agrawal
24 pages
Operating System Fundamentals Exam - Intake 42 Allowed Time 60 Minutes Tuesday 16 - 11
No ratings yet
Operating System Fundamentals Exam - Intake 42 Allowed Time 60 Minutes Tuesday 16 - 11
20 pages
Ruud Van Der Pas - Eric Stotzer - Christian Terboven - Using Openmp - The Next Step - Affinity, Accelerators, Tasking, and Simd (2017, Mit Press) PDF
No ratings yet
Ruud Van Der Pas - Eric Stotzer - Christian Terboven - Using Openmp - The Next Step - Affinity, Accelerators, Tasking, and Simd (2017, Mit Press) PDF
381 pages
High Performance Computing
No ratings yet
High Performance Computing
67 pages
Unit4 Deadlocks
No ratings yet
Unit4 Deadlocks
51 pages
Mutual Exclusion
No ratings yet
Mutual Exclusion
7 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
19 pages
Lamports Logical Clock & Vector Clock
No ratings yet
Lamports Logical Clock & Vector Clock
30 pages
Multi-Threading in C
No ratings yet
Multi-Threading in C
10 pages
Data-Intensive Computing
No ratings yet
Data-Intensive Computing
88 pages
Daftar Residu GTK
No ratings yet
Daftar Residu GTK
26 pages
Multi Threaded Programming: Unit - Iv
No ratings yet
Multi Threaded Programming: Unit - Iv
29 pages
Ps
No ratings yet
Ps
46 pages
CS609 Update SOLVED MCQs FINAL TERM BY JUNAID
No ratings yet
CS609 Update SOLVED MCQs FINAL TERM BY JUNAID
33 pages
ch6 LMS
No ratings yet
ch6 LMS
44 pages
Final Exam 2020 PDF
No ratings yet
Final Exam 2020 PDF
22 pages
Semphores
No ratings yet
Semphores
34 pages
GPFS and HDFS
No ratings yet
GPFS and HDFS
5 pages
Cps 303 Note
No ratings yet
Cps 303 Note
40 pages
Operating System
No ratings yet
Operating System
31 pages
CS8461 Os Lab Manual
No ratings yet
CS8461 Os Lab Manual
59 pages

06 Consensus

Uploaded by

06 Consensus

Uploaded by

Distributed

Author’s recent note on the name:

 We will follow an extended version of the Raft paper presented at

• Raft uses a heartbeat mechanism to trigger leader election

• The paper talks about replicated state machines, so:

You might also like