SlideShare a Scribd company logo
Constraints of
Highly Scalable
Databases
1.
Traditional Databases
Recap of the ACID constraints
“Traditional” databases operate with the Transaction
paradigm that guarantees certain properties
(A) Atomicity
(C) Consistency
(I) Isolation
(D) Durability
The ACID Guarantees
1. Atomicity
Each transaction must be “all
or nothing” - if any part fails,
the whole transaction must be
rolled back as if it never
happened.
2. Consistency
The end-state of a transaction
must follow all the rules defined
in the database: data
constraints, cascades, triggers
etc.
3. Isolation
The result of 2 concurrent
operations should be the same
as if they occurred in
sequential order.
4. Durability
A transaction, once committed, will survive permanently even if the system fails.
This includes disk crashes, power outages, etc.
Locking
● Read / write / range locks
How do they do this?
Concurrency Control
● 2-phase commit (2PC), 3PC
protocols
● Distributed locks
But then came the 2000s
And Scale Happened
Traditional RDBMSs were not designed for
the needs of modern web applications
Global Scale
Netflix knows which movies you
watched, when, at what
point(s) you paused and for
how long, etc.
It then replicates that data
across 3 global data centers.
Volume
In 2008, Facebook had only
100 million users and needed
8,000 shards of MySQL.
Today it has ~ 1.86 Billion users.
Speed
In 2013 Twitter was recording
150,000 new tweets/second
every single day.
What to do?
Scale up! (?)
- Increase memory, cores, CPU
- Cache reads with memcached
- Master-slave replication
- Sharding
NOT
ENOUGH
2.
Redefining Constraints
Replacing ACID with BASE
“DMBS research is about ACID
(mostly). But we forfeit “C” and “I”
for availability, graceful
degradation, and performance.
This tradeoff is fundamental.
- Eric Brewer, 2000
Eric Brewer proposed a new set of properties: BASE
Soft State
Basically
Available
Eventual
consistency
System is always available
for clients (but may not be
consistent)
Database is no longer in
charge of “valid” data state.
The app is now responsible.
If all goes well, all clients will
eventually see the same
thing. Probably.
In the world of BASE parameters,
A different set of priorities rule
Availability is most important
Weak consistency (i.e. stale data) is okay
Approximate answers are okay
Aggressive (optimistic) algorithms are okay
Simple, fast, easy evolution of the schema is important
A new set of constraints:
the CAP Theorem
It is impossible for a distributed computer system to simultaneously
provide more than 2 of these 3 guarantees:
Consistency
Availability
Partition tolerance
(Eric Brewer, 1998-2000)
The CAP Parameters
1. Consistency*
All clients get the same view of
the data, or they get an error
(i.e. every read receives the
most recent write)
2. Availability
All clients can always read and
always write
(i.e. every request receives a
non-error response)
3. Partition tolerance
The system functions even if
some nodes are unavailable
(i.e. system operates despite
an arbitrary number of
messages being dropped by
the network between nodes)
Lightning talk: highly scalable databases and the PACELC theorem
All NoSQL databases live somewhere on this
spectrum, based on how they’re tuned
ACID BASE
● What levels of availability do you choose to provide?
● What levels of consistency do you choose to provide?
● What do you do when a partition is detected?
● How do you recover from a partition event?
But wait…
we’re not
through yet
2010: Daniel Abadi (Yale) says CAP is misleading
The trade-offs defined by CAP’s “pick any 2” are misleading:
● The only time you need to make a trade-off is when there is
a partition event (P)
● Systems that sacrifice C must do so all the time
● But systems that sacrifice A only need to do so when
there’s a partition
Most importantly, you don’t give up C to gain A
You give up C to get another missing ingredient: L
LATENCY
Latency = how long must a client request wait for your response?
Imagine replicating data across global data centers
Data Center 1
Data Center 2
Data Center 3
Data Center 4
Data Center n
Data Center 5
“A high availability requirement implies
that the system must replicate data.
But as soon as a distributed system
replicates data, a tradeoff between
consistency and latency arises.
- Abadi, 2010
The PACELC theorem (Abadi, 2010)
In a system that replicates data:
If a partition (P) is detected, how does the system trade off
○ (A) Availability or
○ (C) Consistency
Else (E) how does the system trade off
○ (L) Latency or
○ (C) Consistency
DDBS P+A P+C E+L E+C
Dynamo,
Cassandra,
Riak
Mongo,
H-Store, VoltDb
Yahoo! PNUTS
Comparing NoSQL databases using PACELC
References
Images and title ideas from:
○ http://blog.nahurst.com/visual-guide-to-nosql-systems
○ http://digbigdata.com/know-thy-cap-theorem-for-nosql/
Detailed references at:
○ http://www.bardoloi.com/blog/2017/03/06/pacelc-theorem/
thanks!
Any questions?
You can find me at
@bardoloi

More Related Content

What's hot (18)

Distributed systems and consistency
Distributed systems and consistency
seldo
 
Beyond Strong Consistency
Beyond Strong Consistency
jsinglet
 
BASE: An Acid Alternative
BASE: An Acid Alternative
Hiroshi Ono
 
CAP Theorem
CAP Theorem
Vikash Kodati
 
Consistency in Distributed Systems
Consistency in Distributed Systems
Shane Johnson
 
NoSQL databases, the CAP theorem, and the theory of relativity
NoSQL databases, the CAP theorem, and the theory of relativity
Lars Marius Garshol
 
The Economics of Scale: Promises and Perils of Going Distributed
The Economics of Scale: Promises and Perils of Going Distributed
Tyler Treat
 
Global Mutable State Analysis in Spring MVC Applications
Global Mutable State Analysis in Spring MVC Applications
jsinglet
 
Simple Solutions for Complex Problems
Simple Solutions for Complex Problems
Tyler Treat
 
HbaseHivePigbyRohitDubey
HbaseHivePigbyRohitDubey
Rohit Dubey
 
No sql (not only sql)
No sql (not only sql)
Priyodarshini Dhar
 
Design patterns in distributed system
Design patterns in distributed system
Tom Huynh
 
Distributed Systems: scalability and high availability
Distributed Systems: scalability and high availability
Renato Lucindo
 
Queue centric pattern
Queue centric pattern
Sagar Rao
 
Software Architectures, Week 5 - Advanced Architectures
Software Architectures, Week 5 - Advanced Architectures
Angelos Kapsimanis
 
Intro to distributed systems
Intro to distributed systems
Ahmed Soliman
 
CAP: Scaling, HA
CAP: Scaling, HA
Vitaly Peregudov
 
FAULT TOLERANCE
FAULT TOLERANCE
Poonam Yadav
 
Distributed systems and consistency
Distributed systems and consistency
seldo
 
Beyond Strong Consistency
Beyond Strong Consistency
jsinglet
 
BASE: An Acid Alternative
BASE: An Acid Alternative
Hiroshi Ono
 
Consistency in Distributed Systems
Consistency in Distributed Systems
Shane Johnson
 
NoSQL databases, the CAP theorem, and the theory of relativity
NoSQL databases, the CAP theorem, and the theory of relativity
Lars Marius Garshol
 
The Economics of Scale: Promises and Perils of Going Distributed
The Economics of Scale: Promises and Perils of Going Distributed
Tyler Treat
 
Global Mutable State Analysis in Spring MVC Applications
Global Mutable State Analysis in Spring MVC Applications
jsinglet
 
Simple Solutions for Complex Problems
Simple Solutions for Complex Problems
Tyler Treat
 
HbaseHivePigbyRohitDubey
HbaseHivePigbyRohitDubey
Rohit Dubey
 
Design patterns in distributed system
Design patterns in distributed system
Tom Huynh
 
Distributed Systems: scalability and high availability
Distributed Systems: scalability and high availability
Renato Lucindo
 
Queue centric pattern
Queue centric pattern
Sagar Rao
 
Software Architectures, Week 5 - Advanced Architectures
Software Architectures, Week 5 - Advanced Architectures
Angelos Kapsimanis
 
Intro to distributed systems
Intro to distributed systems
Ahmed Soliman
 

Viewers also liked (19)

Pancreatitis and peritonitis
Pancreatitis and peritonitis
Victor Vk
 
Study pre-registration: Benefits and considerations
Study pre-registration: Benefits and considerations
Krzysztof Gorgolewski
 
Jbpm6
Jbpm6
Prabakar Singaram
 
lv2 y v2 - formato y estructura compositiva
lv2 y v2 - formato y estructura compositiva
Coqui Podestá
 
3Com 20-0469-001 A
3Com 20-0469-001 A
savomir
 
PMP Lecture 3: Project Management Processes
PMP Lecture 3: Project Management Processes
Mohamed Loey
 
ребус метод
ребус метод
Валентина Сідненко
 
Taller 6 tarea organizador gráfico
Taller 6 tarea organizador gráfico
Isamalia Muniz
 
3клас. матем осн здоровя
3клас. матем осн здоровя
Андрій Сенчакевич
 
Constelacion (1)
Constelacion (1)
Andrea Aguirre Gómez
 
EFFECTIVE BUSINESS DECISION MAKING CONCEPTS AND PROCESS
EFFECTIVE BUSINESS DECISION MAKING CONCEPTS AND PROCESS
VISHAL VERMA LAKHNAWI JI
 
Second trimestric soft markers of aneuploidy
Second trimestric soft markers of aneuploidy
Special Fetal Care Unit Ain Shams University Hospital
 
PMP Lecture 2: Project Management Framework
PMP Lecture 2: Project Management Framework
Mohamed Loey
 
SMELOANS
SMELOANS
LOAN KING
 
Strategic Level of Confidence Matrix 2017
Strategic Level of Confidence Matrix 2017
David Christensen
 
Geog. 102 geography of agriculture
Geog. 102 geography of agriculture
Susan White
 
Laboratory Method Verification, March 2017
Laboratory Method Verification, March 2017
Ola Elgaddar
 
NAVIGATION ALERT SYSTEM FOR FISHERMEN WITH SOLAR POWER HARVESTING
NAVIGATION ALERT SYSTEM FOR FISHERMEN WITH SOLAR POWER HARVESTING
AM Publications
 
Fun Core Gym Pdf
Fun Core Gym Pdf
ITALY COFFEE TEA STORE
 
Pancreatitis and peritonitis
Pancreatitis and peritonitis
Victor Vk
 
Study pre-registration: Benefits and considerations
Study pre-registration: Benefits and considerations
Krzysztof Gorgolewski
 
lv2 y v2 - formato y estructura compositiva
lv2 y v2 - formato y estructura compositiva
Coqui Podestá
 
3Com 20-0469-001 A
3Com 20-0469-001 A
savomir
 
PMP Lecture 3: Project Management Processes
PMP Lecture 3: Project Management Processes
Mohamed Loey
 
Taller 6 tarea organizador gráfico
Taller 6 tarea organizador gráfico
Isamalia Muniz
 
EFFECTIVE BUSINESS DECISION MAKING CONCEPTS AND PROCESS
EFFECTIVE BUSINESS DECISION MAKING CONCEPTS AND PROCESS
VISHAL VERMA LAKHNAWI JI
 
PMP Lecture 2: Project Management Framework
PMP Lecture 2: Project Management Framework
Mohamed Loey
 
Strategic Level of Confidence Matrix 2017
Strategic Level of Confidence Matrix 2017
David Christensen
 
Geog. 102 geography of agriculture
Geog. 102 geography of agriculture
Susan White
 
Laboratory Method Verification, March 2017
Laboratory Method Verification, March 2017
Ola Elgaddar
 
NAVIGATION ALERT SYSTEM FOR FISHERMEN WITH SOLAR POWER HARVESTING
NAVIGATION ALERT SYSTEM FOR FISHERMEN WITH SOLAR POWER HARVESTING
AM Publications
 
Ad

Similar to Lightning talk: highly scalable databases and the PACELC theorem (20)

NoSQL Evolution
NoSQL Evolution
Abdul Manaf
 
Hbase hive pig
Hbase hive pig
Xuhong Zhang
 
No sql databases
No sql databases
Ankit Dubey
 
To SQL or NoSQL, that is the question
To SQL or NoSQL, that is the question
Krishnakumar S
 
NoSQL
NoSQL
RithikRaj25
 
Data Engineering for Data Scientists
Data Engineering for Data Scientists
jlacefie
 
cse40822-CAP.pptx
cse40822-CAP.pptx
NedaaHamed1
 
Big data 101 for beginners riga dev days
Big data 101 for beginners riga dev days
Duyhai Doan
 
CS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduce
J Singh
 
Database Throwdown Introduction
Database Throwdown Introduction
Sean Collins
 
CAP and BASE
CAP and BASE
Dinesh Varadharajan
 
Lecture-04-Principles of data management.pdf
Lecture-04-Principles of data management.pdf
manimozhi98
 
Hbase hivepig
Hbase hivepig
Radha Krishna
 
17-NoSQL.pptx
17-NoSQL.pptx
levichan1
 
CM2-Data model for Big Data chapter2.pdf
CM2-Data model for Big Data chapter2.pdf
ArsimKrasniqi5
 
6269441.ppt
6269441.ppt
Swapna Jk
 
A Critique of the CAP Theorem by Martin Kleppmann
A Critique of the CAP Theorem by Martin Kleppmann
mustafa sarac
 
NoSQL with Microsoft Azure
NoSQL with Microsoft Azure
Khalid Salama
 
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
Felix Gessert
 
Cap in depth
Cap in depth
Ioanna Tsalouchidou
 
No sql databases
No sql databases
Ankit Dubey
 
To SQL or NoSQL, that is the question
To SQL or NoSQL, that is the question
Krishnakumar S
 
Data Engineering for Data Scientists
Data Engineering for Data Scientists
jlacefie
 
cse40822-CAP.pptx
cse40822-CAP.pptx
NedaaHamed1
 
Big data 101 for beginners riga dev days
Big data 101 for beginners riga dev days
Duyhai Doan
 
CS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduce
J Singh
 
Database Throwdown Introduction
Database Throwdown Introduction
Sean Collins
 
Lecture-04-Principles of data management.pdf
Lecture-04-Principles of data management.pdf
manimozhi98
 
17-NoSQL.pptx
17-NoSQL.pptx
levichan1
 
CM2-Data model for Big Data chapter2.pdf
CM2-Data model for Big Data chapter2.pdf
ArsimKrasniqi5
 
A Critique of the CAP Theorem by Martin Kleppmann
A Critique of the CAP Theorem by Martin Kleppmann
mustafa sarac
 
NoSQL with Microsoft Azure
NoSQL with Microsoft Azure
Khalid Salama
 
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
Felix Gessert
 
Ad

Recently uploaded (20)

AudGram Review: Build Visually Appealing, AI-Enhanced Audiograms to Engage Yo...
AudGram Review: Build Visually Appealing, AI-Enhanced Audiograms to Engage Yo...
SOFTTECHHUB
 
Oracle Cloud Infrastructure AI Foundations
Oracle Cloud Infrastructure AI Foundations
VICTOR MAESTRE RAMIREZ
 
Enabling BIM / GIS integrations with Other Systems with FME
Enabling BIM / GIS integrations with Other Systems with FME
Safe Software
 
Floods in Valencia: Two FME-Powered Stories of Data Resilience
Floods in Valencia: Two FME-Powered Stories of Data Resilience
Safe Software
 
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
AmirStern2
 
“Why It’s Critical to Have an Integrated Development Methodology for Edge AI,...
“Why It’s Critical to Have an Integrated Development Methodology for Edge AI,...
Edge AI and Vision Alliance
 
Crypto Super 500 - 14th Report - June2025.pdf
Crypto Super 500 - 14th Report - June2025.pdf
Stephen Perrenod
 
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance
 
“Addressing Evolving AI Model Challenges Through Memory and Storage,” a Prese...
“Addressing Evolving AI Model Challenges Through Memory and Storage,” a Prese...
Edge AI and Vision Alliance
 
FME for Distribution & Transmission Integrity Management Program (DIMP & TIMP)
FME for Distribution & Transmission Integrity Management Program (DIMP & TIMP)
Safe Software
 
The State of Web3 Industry- Industry Report
The State of Web3 Industry- Industry Report
Liveplex
 
Kubernetes Security Act Now Before It’s Too Late
Kubernetes Security Act Now Before It’s Too Late
Michael Furman
 
Your startup on AWS - How to architect and maintain a Lean and Mean account
Your startup on AWS - How to architect and maintain a Lean and Mean account
angelo60207
 
Providing an OGC API Processes REST Interface for FME Flow
Providing an OGC API Processes REST Interface for FME Flow
Safe Software
 
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Anish Kumar
 
Introduction to Typescript - GDG On Campus EUE
Introduction to Typescript - GDG On Campus EUE
Google Developer Group On Campus European Universities in Egypt
 
TrustArc Webinar - 2025 Global Privacy Survey
TrustArc Webinar - 2025 Global Privacy Survey
TrustArc
 
FIDO Seminar: Targeting Trust: The Future of Identity in the Workforce.pptx
FIDO Seminar: Targeting Trust: The Future of Identity in the Workforce.pptx
FIDO Alliance
 
Oracle Cloud and AI Specialization Program
Oracle Cloud and AI Specialization Program
VICTOR MAESTRE RAMIREZ
 
No-Code Workflows for CAD & 3D Data: Scaling AI-Driven Infrastructure
No-Code Workflows for CAD & 3D Data: Scaling AI-Driven Infrastructure
Safe Software
 
AudGram Review: Build Visually Appealing, AI-Enhanced Audiograms to Engage Yo...
AudGram Review: Build Visually Appealing, AI-Enhanced Audiograms to Engage Yo...
SOFTTECHHUB
 
Oracle Cloud Infrastructure AI Foundations
Oracle Cloud Infrastructure AI Foundations
VICTOR MAESTRE RAMIREZ
 
Enabling BIM / GIS integrations with Other Systems with FME
Enabling BIM / GIS integrations with Other Systems with FME
Safe Software
 
Floods in Valencia: Two FME-Powered Stories of Data Resilience
Floods in Valencia: Two FME-Powered Stories of Data Resilience
Safe Software
 
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
AmirStern2
 
“Why It’s Critical to Have an Integrated Development Methodology for Edge AI,...
“Why It’s Critical to Have an Integrated Development Methodology for Edge AI,...
Edge AI and Vision Alliance
 
Crypto Super 500 - 14th Report - June2025.pdf
Crypto Super 500 - 14th Report - June2025.pdf
Stephen Perrenod
 
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance Seminar State of Passkeys.pptx
FIDO Alliance
 
“Addressing Evolving AI Model Challenges Through Memory and Storage,” a Prese...
“Addressing Evolving AI Model Challenges Through Memory and Storage,” a Prese...
Edge AI and Vision Alliance
 
FME for Distribution & Transmission Integrity Management Program (DIMP & TIMP)
FME for Distribution & Transmission Integrity Management Program (DIMP & TIMP)
Safe Software
 
The State of Web3 Industry- Industry Report
The State of Web3 Industry- Industry Report
Liveplex
 
Kubernetes Security Act Now Before It’s Too Late
Kubernetes Security Act Now Before It’s Too Late
Michael Furman
 
Your startup on AWS - How to architect and maintain a Lean and Mean account
Your startup on AWS - How to architect and maintain a Lean and Mean account
angelo60207
 
Providing an OGC API Processes REST Interface for FME Flow
Providing an OGC API Processes REST Interface for FME Flow
Safe Software
 
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Anish Kumar
 
TrustArc Webinar - 2025 Global Privacy Survey
TrustArc Webinar - 2025 Global Privacy Survey
TrustArc
 
FIDO Seminar: Targeting Trust: The Future of Identity in the Workforce.pptx
FIDO Seminar: Targeting Trust: The Future of Identity in the Workforce.pptx
FIDO Alliance
 
Oracle Cloud and AI Specialization Program
Oracle Cloud and AI Specialization Program
VICTOR MAESTRE RAMIREZ
 
No-Code Workflows for CAD & 3D Data: Scaling AI-Driven Infrastructure
No-Code Workflows for CAD & 3D Data: Scaling AI-Driven Infrastructure
Safe Software
 

Lightning talk: highly scalable databases and the PACELC theorem

  • 2. 1. Traditional Databases Recap of the ACID constraints
  • 3. “Traditional” databases operate with the Transaction paradigm that guarantees certain properties (A) Atomicity (C) Consistency (I) Isolation (D) Durability
  • 4. The ACID Guarantees 1. Atomicity Each transaction must be “all or nothing” - if any part fails, the whole transaction must be rolled back as if it never happened. 2. Consistency The end-state of a transaction must follow all the rules defined in the database: data constraints, cascades, triggers etc. 3. Isolation The result of 2 concurrent operations should be the same as if they occurred in sequential order. 4. Durability A transaction, once committed, will survive permanently even if the system fails. This includes disk crashes, power outages, etc.
  • 5. Locking ● Read / write / range locks How do they do this? Concurrency Control ● 2-phase commit (2PC), 3PC protocols ● Distributed locks
  • 6. But then came the 2000s
  • 8. Traditional RDBMSs were not designed for the needs of modern web applications Global Scale Netflix knows which movies you watched, when, at what point(s) you paused and for how long, etc. It then replicates that data across 3 global data centers. Volume In 2008, Facebook had only 100 million users and needed 8,000 shards of MySQL. Today it has ~ 1.86 Billion users. Speed In 2013 Twitter was recording 150,000 new tweets/second every single day.
  • 9. What to do? Scale up! (?) - Increase memory, cores, CPU - Cache reads with memcached - Master-slave replication - Sharding
  • 12. “DMBS research is about ACID (mostly). But we forfeit “C” and “I” for availability, graceful degradation, and performance. This tradeoff is fundamental. - Eric Brewer, 2000
  • 13. Eric Brewer proposed a new set of properties: BASE Soft State Basically Available Eventual consistency System is always available for clients (but may not be consistent) Database is no longer in charge of “valid” data state. The app is now responsible. If all goes well, all clients will eventually see the same thing. Probably.
  • 14. In the world of BASE parameters, A different set of priorities rule Availability is most important Weak consistency (i.e. stale data) is okay Approximate answers are okay Aggressive (optimistic) algorithms are okay Simple, fast, easy evolution of the schema is important
  • 15. A new set of constraints: the CAP Theorem It is impossible for a distributed computer system to simultaneously provide more than 2 of these 3 guarantees: Consistency Availability Partition tolerance (Eric Brewer, 1998-2000)
  • 16. The CAP Parameters 1. Consistency* All clients get the same view of the data, or they get an error (i.e. every read receives the most recent write) 2. Availability All clients can always read and always write (i.e. every request receives a non-error response) 3. Partition tolerance The system functions even if some nodes are unavailable (i.e. system operates despite an arbitrary number of messages being dropped by the network between nodes)
  • 18. All NoSQL databases live somewhere on this spectrum, based on how they’re tuned ACID BASE ● What levels of availability do you choose to provide? ● What levels of consistency do you choose to provide? ● What do you do when a partition is detected? ● How do you recover from a partition event?
  • 20. 2010: Daniel Abadi (Yale) says CAP is misleading The trade-offs defined by CAP’s “pick any 2” are misleading: ● The only time you need to make a trade-off is when there is a partition event (P) ● Systems that sacrifice C must do so all the time ● But systems that sacrifice A only need to do so when there’s a partition Most importantly, you don’t give up C to gain A You give up C to get another missing ingredient: L
  • 21. LATENCY Latency = how long must a client request wait for your response?
  • 22. Imagine replicating data across global data centers Data Center 1 Data Center 2 Data Center 3 Data Center 4 Data Center n Data Center 5
  • 23. “A high availability requirement implies that the system must replicate data. But as soon as a distributed system replicates data, a tradeoff between consistency and latency arises. - Abadi, 2010
  • 24. The PACELC theorem (Abadi, 2010) In a system that replicates data: If a partition (P) is detected, how does the system trade off ○ (A) Availability or ○ (C) Consistency Else (E) how does the system trade off ○ (L) Latency or ○ (C) Consistency
  • 25. DDBS P+A P+C E+L E+C Dynamo, Cassandra, Riak Mongo, H-Store, VoltDb Yahoo! PNUTS Comparing NoSQL databases using PACELC
  • 26. References Images and title ideas from: ○ http://blog.nahurst.com/visual-guide-to-nosql-systems ○ http://digbigdata.com/know-thy-cap-theorem-for-nosql/ Detailed references at: ○ http://www.bardoloi.com/blog/2017/03/06/pacelc-theorem/
  • 27. thanks! Any questions? You can find me at @bardoloi