SlideShare a Scribd company logo
Why your RDBMS’s fails at scale
And one built for it…..
DataStax: from validation to momentum.
400+
Employees
$190M
Funding
500+
Customers
Founded in April 2010
Santa Clara • San Francisco • Austin •
London • Paris • Berlin • Tokyo • Sydney
(Series E – Sept. 2014) 30% +
2016, 2017 World’s Best
100 Cloud Companies
Ranked #1 in multiple operational
database categories
© 2017 DataStax, All Rights Reserved. Company Confidential
Let’s take
a moment
Your business interacts with people,
processes and things all the time
© 2017 DataStax, All Rights Reserved. Company Confidential
CONTEXTUAL
Real-time, globally distributed cloud
applications must meet expectations.
ALWAYS-ON DISTRIBUTED SCALABLEREAL-TIME
© 2017 DataStax, All Rights Reserved. Company Confidential
Netflix disrupted video distribution
and creation with a cloud application
70 million
Customers
400
Cities
125 million
Hours Watched per Day
© DataStax, All Rights Reserved.5
Microsoft remains a leader
in collaboration with a cloud application
#1
Deployed App
in Enterprises
5 Million
Events Per
Organization a Month
60 Million
Monthly Active Users
6
© DataStax, All Rights Reserved.7
No Downtime: 4 Black Fridays in a row
Potted history of the Database
Database and the Internet
1970 Invented by
E.F. Codd at IBM
1979 First
commercial RDBMS
available (Oracle V2)
1983 official birth of
internet or TCP/IP
1986 SQL becomes
international
standard
1993 WWW finally
available
1995 First internet
based applications
arrive
Explosion of Cloud Applications
Some of the issues faced
• How do you scale the database?
• Add more RAM
• Add more CPU
• Add faster and more disks
• How do you do this?
• Bring the database OFFLINE
• Vertical scaling has a finite limit
Some of the issues faced
• How do you scale client connections?
• Add a connection pool
• But this has a finite limit
• Adds complexity
Listener
Connection Pool
Single Points of Failure
• With a single database we have a SPOF
• Use replication
• Problem solved
• But now
• Single Master
• Scales for Reads not Writes
• Action needed if Master goes down
• Only suitable for LAN deployments
Master
Read Only
Subscriber
But How to Horizontally Scale
• Shard your data across databases
• Each shard needs a replica
• Need a load balancer
• Just showing 2 shards
• Things get more complicated
• Could have multiple read only
subscribers
A-M
N-Z
Master
Read Only
Subscriber
L
o
a
d
b
a
l
a
n
c
e
r
D
What about multiple Data Centres?
• Extremely complicated
• Difficult to support Active
Active
• Need to consider conflicts
• More Disaster Recovery than
Disaster Avoidance
Traditional Data Models don’t help
• Normalised Data Model
• Random seeks result in high
volume of I/O operations
• Joins extremely expensive
• Won’t scale horizontally
• De-Normalised Data Model
• Sequential seek to return results
• Joins eliminated
• Scales indefinitely
1:M
M:N
Summary
• Traditional Databases developed before the web and cloud based
applications
• Scaling up results in downtime
• Single node is a single point of failure
• Number of client connections finite
• Add a read only replica for high availability
• Shard to horizontally scale
• Data Center support extremely difficult
• Data model not built for horizontal scale
A new approach is required
17
Client/Server
1990s
Cloud
Today
Web
2000s
© DataStax, All Rights Reserved.
Scaling out solves the distributed problem
18 © DataStax, All Rights Reserved.
SCALE-OUT APP
LAYER
SCALE-OUT DATA
LAYER
MASTER-SLAVE DATABASE
San
Francisco
New York
London
So What’s the answer?
• Distributed masterless NoSQL
Database
• Continuous Availability
• Disaster Avoidance
• Linear Scale Performance
• Add nodes to scale
• Runs on Commodity Hardware
• Cloud or on Premise or Hybrid
Linear Scalability
• Have More Data? Add more nodes.
• Need More Throughput? Add more nodes.
http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html
9000 Nodes
700 Nodes
400 Nodes
Continuous Availability
• Nodes Down != Database Down
• Datacenter Down != Database Down
• Upgrade != Database Down
Platform for Cloud Applications
DataStax is a registered trademark of DataStax, Inc. and its subsidiaries in the United States and/or
The most innovative companies use DataStax
© DataStax, All Rights Reserved.23
2010 2012 2014 2016 2017
Key takeaways
• Why your RDBMS fails at scale
• Fundamentally not built for cloud based applications
• World’s leading brands rely on DataStax for globally distributed data
management
• Next steps: Download today at www.datastax.com and
register for your DataStax Academy account for free online training
DataStax: The power behind the moment
24
Backup Slides
25
ACID is a lie with data replication
Scenario: client with read-heavy workload decides to add asynchronous
replication, so there is lag for propagating data from master to the slave.
• Consistency: If a client decides to do a read to the slave before the data
is replicated, it’s going to get the old data back, which means loss of
consistency
• Atomicity: not having the correct data results in the failure of entire
transaction
• Isolation: receiving the old data means loss of isolation
• Durability: client will receive the old data and not the data it had written to
the master node
When applying RDBMS to Big Data replication, ACID collapses
26
CAP tradeoffs
• Relational databases choose strong consistency over high availability
• Latency between data centers makes consistency impractical
• NoSQL databases like Cassandra choose high availability and partition tolerance over
consistency.
• Data is replicated asynchronously across multiple data centers. We are LIMITED
by the speed of light making consistency impossible.
• Lets you specify consistency level (one replica vs majority of replicas) suitable for
your application
Can’t be both consistent and highly available during a network partition
27
Replication Complexity in RDBMS
© 2015 DataStax, All Rights
Reserved.28
*Source: Oracle Database 12c New Features, Slide 17. (http://bit.ly/1MIxKc1)
HTTP Application Message Queue
Streaming
Analytics
Batch
Analytics
Real-time
DSE Real-time Analytics IoT Reference Architecture
© 2015 DataStax, All Rights
Reserved.29
Ad

Recommended

Beyond Batch: Is ETL still relevant in the API economy?
Beyond Batch: Is ETL still relevant in the API economy?
SnapLogic
 
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
DataStax
 
Introduction: Architecting for Scale
Introduction: Architecting for Scale
DataStax
 
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
DataStax
 
Be a Successful DBA in the World of Cloud and On-Premises Data
Be a Successful DBA in the World of Cloud and On-Premises Data
Grant Fritchey
 
Data Warehouse in Cloud
Data Warehouse in Cloud
Pawan Bhargava
 
Webinar: Customer Experience in Banking - a CTO's Perspective
Webinar: Customer Experience in Banking - a CTO's Perspective
DataStax
 
Designing a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for Dummies
DataStax
 
Scaling Multi-Cloud Deployments with Denodo: Automated Infrastructure Management
Scaling Multi-Cloud Deployments with Denodo: Automated Infrastructure Management
Denodo
 
Fundraising and Technology: A Match Made in the Cloud
Fundraising and Technology: A Match Made in the Cloud
Blackbaud
 
Get Mainframe and IBM i Data to Snowflake
Get Mainframe and IBM i Data to Snowflake
Precisely
 
Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies
SnapLogic
 
Altis AWS Snowflake Practice
Altis AWS Snowflake Practice
SamanthaSwain7
 
Presentation by Bart Gielen (DataSense) at the Data Vault Modelling and Data ...
Presentation by Bart Gielen (DataSense) at the Data Vault Modelling and Data ...
Patrick Van Renterghem
 
Cloud Computing and Big Data
Cloud Computing and Big Data
Robert Keahey
 
Don’t Bring Old Problems to Your New Cloud Data Warehouse
Don’t Bring Old Problems to Your New Cloud Data Warehouse
Precisely
 
Postgres Vision 2018: Five Sharding Data Models
Postgres Vision 2018: Five Sharding Data Models
EDB
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
DataStax
 
Webinar - Data Management for the "Right-Now" Economy - The 5 Key Ingredients
Webinar - Data Management for the "Right-Now" Economy - The 5 Key Ingredients
DataStax
 
Microsof azure class 1- intro
Microsof azure class 1- intro
MHMuhammadAli1
 
Data lake
Data lake
GHAZOUANI WAEL
 
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
SnapLogic
 
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
DataStax
 
How to Choose a Host for a Big Data Project
How to Choose a Host for a Big Data Project
Peak Hosting
 
A Mashup with Backbone
A Mashup with Backbone
C/D/H Technology Consultants
 
Fixing data science & Accelerating Artificial Super Intelligence Development
Fixing data science & Accelerating Artificial Super Intelligence Development
ManojKumarR41
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
DataStax
 
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
DataStax
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
DataStax
 
DataStax
DataStax
Michael Shaler
 

More Related Content

What's hot (20)

Scaling Multi-Cloud Deployments with Denodo: Automated Infrastructure Management
Scaling Multi-Cloud Deployments with Denodo: Automated Infrastructure Management
Denodo
 
Fundraising and Technology: A Match Made in the Cloud
Fundraising and Technology: A Match Made in the Cloud
Blackbaud
 
Get Mainframe and IBM i Data to Snowflake
Get Mainframe and IBM i Data to Snowflake
Precisely
 
Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies
SnapLogic
 
Altis AWS Snowflake Practice
Altis AWS Snowflake Practice
SamanthaSwain7
 
Presentation by Bart Gielen (DataSense) at the Data Vault Modelling and Data ...
Presentation by Bart Gielen (DataSense) at the Data Vault Modelling and Data ...
Patrick Van Renterghem
 
Cloud Computing and Big Data
Cloud Computing and Big Data
Robert Keahey
 
Don’t Bring Old Problems to Your New Cloud Data Warehouse
Don’t Bring Old Problems to Your New Cloud Data Warehouse
Precisely
 
Postgres Vision 2018: Five Sharding Data Models
Postgres Vision 2018: Five Sharding Data Models
EDB
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
DataStax
 
Webinar - Data Management for the "Right-Now" Economy - The 5 Key Ingredients
Webinar - Data Management for the "Right-Now" Economy - The 5 Key Ingredients
DataStax
 
Microsof azure class 1- intro
Microsof azure class 1- intro
MHMuhammadAli1
 
Data lake
Data lake
GHAZOUANI WAEL
 
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
SnapLogic
 
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
DataStax
 
How to Choose a Host for a Big Data Project
How to Choose a Host for a Big Data Project
Peak Hosting
 
A Mashup with Backbone
A Mashup with Backbone
C/D/H Technology Consultants
 
Fixing data science & Accelerating Artificial Super Intelligence Development
Fixing data science & Accelerating Artificial Super Intelligence Development
ManojKumarR41
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
DataStax
 
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
DataStax
 
Scaling Multi-Cloud Deployments with Denodo: Automated Infrastructure Management
Scaling Multi-Cloud Deployments with Denodo: Automated Infrastructure Management
Denodo
 
Fundraising and Technology: A Match Made in the Cloud
Fundraising and Technology: A Match Made in the Cloud
Blackbaud
 
Get Mainframe and IBM i Data to Snowflake
Get Mainframe and IBM i Data to Snowflake
Precisely
 
Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies
SnapLogic
 
Altis AWS Snowflake Practice
Altis AWS Snowflake Practice
SamanthaSwain7
 
Presentation by Bart Gielen (DataSense) at the Data Vault Modelling and Data ...
Presentation by Bart Gielen (DataSense) at the Data Vault Modelling and Data ...
Patrick Van Renterghem
 
Cloud Computing and Big Data
Cloud Computing and Big Data
Robert Keahey
 
Don’t Bring Old Problems to Your New Cloud Data Warehouse
Don’t Bring Old Problems to Your New Cloud Data Warehouse
Precisely
 
Postgres Vision 2018: Five Sharding Data Models
Postgres Vision 2018: Five Sharding Data Models
EDB
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
DataStax
 
Webinar - Data Management for the "Right-Now" Economy - The 5 Key Ingredients
Webinar - Data Management for the "Right-Now" Economy - The 5 Key Ingredients
DataStax
 
Microsof azure class 1- intro
Microsof azure class 1- intro
MHMuhammadAli1
 
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
SnapLogic
 
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
DataStax
 
How to Choose a Host for a Big Data Project
How to Choose a Host for a Big Data Project
Peak Hosting
 
Fixing data science & Accelerating Artificial Super Intelligence Development
Fixing data science & Accelerating Artificial Super Intelligence Development
ManojKumarR41
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
DataStax
 
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
DataStax
 

Similar to Datastax - Why Your RDBMS fails at scale (20)

Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
DataStax
 
DataStax
DataStax
Michael Shaler
 
How much money do you lose every time your ecommerce site goes down?
How much money do you lose every time your ecommerce site goes down?
DataStax
 
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax
 
John Glendenning - Real time data driven services in the Cloud
John Glendenning - Real time data driven services in the Cloud
WeAreEsynergy
 
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
DataStax
 
Five Lessons in Distributed Databases
Five Lessons in Distributed Databases
jbellis
 
Slides: Relational to NoSQL Migration
Slides: Relational to NoSQL Migration
DATAVERSITY
 
The Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data Solution
DATAVERSITY
 
The Big Data Ecosystem for Financial Services
The Big Data Ecosystem for Financial Services
DataStax
 
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
DataStax Academy
 
What ya gonna do?
What ya gonna do?
CQD
 
Beyond Aurora. Scale-out SQL databases for AWS
Beyond Aurora. Scale-out SQL databases for AWS
Clustrix
 
CWIN17 Frankfurt / data_stax_personalisatontopowercx
CWIN17 Frankfurt / data_stax_personalisatontopowercx
Capgemini
 
Big iron 2 (published)
Big iron 2 (published)
Ben Stopford
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data Solution
DataStax
 
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
Dave Anselmi
 
Database Architecture & Scaling Strategies, in the Cloud & on the Rack
Database Architecture & Scaling Strategies, in the Cloud & on the Rack
Clustrix
 
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
DataStax
 
Scaling data on public clouds
Scaling data on public clouds
Liran Zelkha
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
DataStax
 
How much money do you lose every time your ecommerce site goes down?
How much money do you lose every time your ecommerce site goes down?
DataStax
 
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax
 
John Glendenning - Real time data driven services in the Cloud
John Glendenning - Real time data driven services in the Cloud
WeAreEsynergy
 
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
DataStax
 
Five Lessons in Distributed Databases
Five Lessons in Distributed Databases
jbellis
 
Slides: Relational to NoSQL Migration
Slides: Relational to NoSQL Migration
DATAVERSITY
 
The Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data Solution
DATAVERSITY
 
The Big Data Ecosystem for Financial Services
The Big Data Ecosystem for Financial Services
DataStax
 
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
DataStax Academy
 
What ya gonna do?
What ya gonna do?
CQD
 
Beyond Aurora. Scale-out SQL databases for AWS
Beyond Aurora. Scale-out SQL databases for AWS
Clustrix
 
CWIN17 Frankfurt / data_stax_personalisatontopowercx
CWIN17 Frankfurt / data_stax_personalisatontopowercx
Capgemini
 
Big iron 2 (published)
Big iron 2 (published)
Ben Stopford
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data Solution
DataStax
 
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
Scaling RDBMS on AWS- ClustrixDB @AWS Meetup 20160711
Dave Anselmi
 
Database Architecture & Scaling Strategies, in the Cloud & on the Rack
Database Architecture & Scaling Strategies, in the Cloud & on the Rack
Clustrix
 
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
DataStax
 
Scaling data on public clouds
Scaling data on public clouds
Liran Zelkha
 
Ad

Recently uploaded (20)

最新版美国芝加哥大学毕业证(UChicago毕业证书)原版定制
最新版美国芝加哥大学毕业证(UChicago毕业证书)原版定制
taqyea
 
presentation4.pdf Intro to mcmc methodss
presentation4.pdf Intro to mcmc methodss
SergeyTsygankov6
 
Crafting-Research-Recommendations Grade 12.pptx
Crafting-Research-Recommendations Grade 12.pptx
DaryllWhere
 
最新版美国佐治亚大学毕业证(UGA毕业证书)原版定制
最新版美国佐治亚大学毕业证(UGA毕业证书)原版定制
Taqyea
 
Indigo_Airlines_Strategy_Presentation.pptx
Indigo_Airlines_Strategy_Presentation.pptx
mukeshpurohit991
 
最新版美国约翰霍普金斯大学毕业证(JHU毕业证书)原版定制
最新版美国约翰霍普金斯大学毕业证(JHU毕业证书)原版定制
Taqyea
 
Measurecamp Copenhagen - Consent Context
Measurecamp Copenhagen - Consent Context
Human37
 
Flextronics Employee Safety Data-Project-2.pptx
Flextronics Employee Safety Data-Project-2.pptx
kilarihemadri
 
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
 
lecture12.pdf Introduction to bioinformatics
lecture12.pdf Introduction to bioinformatics
SergeyTsygankov6
 
最新版美国加利福尼亚大学旧金山法学院毕业证(UCLawSF毕业证书)定制
最新版美国加利福尼亚大学旧金山法学院毕业证(UCLawSF毕业证书)定制
taqyea
 
最新版意大利米兰大学毕业证(UNIMI毕业证书)原版定制
最新版意大利米兰大学毕业证(UNIMI毕业证书)原版定制
taqyea
 
定制OCAD学生卡加拿大安大略艺术与设计大学成绩单范本,OCAD成绩单复刻
定制OCAD学生卡加拿大安大略艺术与设计大学成绩单范本,OCAD成绩单复刻
taqyed
 
Shifting Focus on AI: How it Can Make a Positive Difference
Shifting Focus on AI: How it Can Make a Positive Difference
1508 A/S
 
Artigo - Playing to Win.planejamento docx
Artigo - Playing to Win.planejamento docx
KellyXavier15
 
一比一原版(TUC毕业证书)开姆尼茨工业大学毕业证如何办理
一比一原版(TUC毕业证书)开姆尼茨工业大学毕业证如何办理
taqyed
 
Communication_Skills_Class10_Visual.pptx
Communication_Skills_Class10_Visual.pptx
namanrastogi70555
 
Camuflaje Tipos Características Militar 2025.ppt
Camuflaje Tipos Características Militar 2025.ppt
e58650738
 
Starbucks in the Indian market through its joint venture.
Starbucks in the Indian market through its joint venture.
sales480687
 
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
CristineGraceAcuyan
 
最新版美国芝加哥大学毕业证(UChicago毕业证书)原版定制
最新版美国芝加哥大学毕业证(UChicago毕业证书)原版定制
taqyea
 
presentation4.pdf Intro to mcmc methodss
presentation4.pdf Intro to mcmc methodss
SergeyTsygankov6
 
Crafting-Research-Recommendations Grade 12.pptx
Crafting-Research-Recommendations Grade 12.pptx
DaryllWhere
 
最新版美国佐治亚大学毕业证(UGA毕业证书)原版定制
最新版美国佐治亚大学毕业证(UGA毕业证书)原版定制
Taqyea
 
Indigo_Airlines_Strategy_Presentation.pptx
Indigo_Airlines_Strategy_Presentation.pptx
mukeshpurohit991
 
最新版美国约翰霍普金斯大学毕业证(JHU毕业证书)原版定制
最新版美国约翰霍普金斯大学毕业证(JHU毕业证书)原版定制
Taqyea
 
Measurecamp Copenhagen - Consent Context
Measurecamp Copenhagen - Consent Context
Human37
 
Flextronics Employee Safety Data-Project-2.pptx
Flextronics Employee Safety Data-Project-2.pptx
kilarihemadri
 
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
 
lecture12.pdf Introduction to bioinformatics
lecture12.pdf Introduction to bioinformatics
SergeyTsygankov6
 
最新版美国加利福尼亚大学旧金山法学院毕业证(UCLawSF毕业证书)定制
最新版美国加利福尼亚大学旧金山法学院毕业证(UCLawSF毕业证书)定制
taqyea
 
最新版意大利米兰大学毕业证(UNIMI毕业证书)原版定制
最新版意大利米兰大学毕业证(UNIMI毕业证书)原版定制
taqyea
 
定制OCAD学生卡加拿大安大略艺术与设计大学成绩单范本,OCAD成绩单复刻
定制OCAD学生卡加拿大安大略艺术与设计大学成绩单范本,OCAD成绩单复刻
taqyed
 
Shifting Focus on AI: How it Can Make a Positive Difference
Shifting Focus on AI: How it Can Make a Positive Difference
1508 A/S
 
Artigo - Playing to Win.planejamento docx
Artigo - Playing to Win.planejamento docx
KellyXavier15
 
一比一原版(TUC毕业证书)开姆尼茨工业大学毕业证如何办理
一比一原版(TUC毕业证书)开姆尼茨工业大学毕业证如何办理
taqyed
 
Communication_Skills_Class10_Visual.pptx
Communication_Skills_Class10_Visual.pptx
namanrastogi70555
 
Camuflaje Tipos Características Militar 2025.ppt
Camuflaje Tipos Características Militar 2025.ppt
e58650738
 
Starbucks in the Indian market through its joint venture.
Starbucks in the Indian market through its joint venture.
sales480687
 
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
624753984-Annex-A3-RPMS-Tool-for-Proficient-Teachers-SY-2024-2025.pdf
CristineGraceAcuyan
 
Ad

Datastax - Why Your RDBMS fails at scale

  • 1. Why your RDBMS’s fails at scale And one built for it…..
  • 2. DataStax: from validation to momentum. 400+ Employees $190M Funding 500+ Customers Founded in April 2010 Santa Clara • San Francisco • Austin • London • Paris • Berlin • Tokyo • Sydney (Series E – Sept. 2014) 30% + 2016, 2017 World’s Best 100 Cloud Companies Ranked #1 in multiple operational database categories © 2017 DataStax, All Rights Reserved. Company Confidential
  • 3. Let’s take a moment Your business interacts with people, processes and things all the time © 2017 DataStax, All Rights Reserved. Company Confidential
  • 4. CONTEXTUAL Real-time, globally distributed cloud applications must meet expectations. ALWAYS-ON DISTRIBUTED SCALABLEREAL-TIME © 2017 DataStax, All Rights Reserved. Company Confidential
  • 5. Netflix disrupted video distribution and creation with a cloud application 70 million Customers 400 Cities 125 million Hours Watched per Day © DataStax, All Rights Reserved.5
  • 6. Microsoft remains a leader in collaboration with a cloud application #1 Deployed App in Enterprises 5 Million Events Per Organization a Month 60 Million Monthly Active Users 6
  • 7. © DataStax, All Rights Reserved.7 No Downtime: 4 Black Fridays in a row
  • 8. Potted history of the Database Database and the Internet 1970 Invented by E.F. Codd at IBM 1979 First commercial RDBMS available (Oracle V2) 1983 official birth of internet or TCP/IP 1986 SQL becomes international standard 1993 WWW finally available 1995 First internet based applications arrive
  • 9. Explosion of Cloud Applications
  • 10. Some of the issues faced • How do you scale the database? • Add more RAM • Add more CPU • Add faster and more disks • How do you do this? • Bring the database OFFLINE • Vertical scaling has a finite limit
  • 11. Some of the issues faced • How do you scale client connections? • Add a connection pool • But this has a finite limit • Adds complexity Listener Connection Pool
  • 12. Single Points of Failure • With a single database we have a SPOF • Use replication • Problem solved • But now • Single Master • Scales for Reads not Writes • Action needed if Master goes down • Only suitable for LAN deployments Master Read Only Subscriber
  • 13. But How to Horizontally Scale • Shard your data across databases • Each shard needs a replica • Need a load balancer • Just showing 2 shards • Things get more complicated • Could have multiple read only subscribers A-M N-Z Master Read Only Subscriber L o a d b a l a n c e r D
  • 14. What about multiple Data Centres? • Extremely complicated • Difficult to support Active Active • Need to consider conflicts • More Disaster Recovery than Disaster Avoidance
  • 15. Traditional Data Models don’t help • Normalised Data Model • Random seeks result in high volume of I/O operations • Joins extremely expensive • Won’t scale horizontally • De-Normalised Data Model • Sequential seek to return results • Joins eliminated • Scales indefinitely 1:M M:N
  • 16. Summary • Traditional Databases developed before the web and cloud based applications • Scaling up results in downtime • Single node is a single point of failure • Number of client connections finite • Add a read only replica for high availability • Shard to horizontally scale • Data Center support extremely difficult • Data model not built for horizontal scale
  • 17. A new approach is required 17 Client/Server 1990s Cloud Today Web 2000s © DataStax, All Rights Reserved.
  • 18. Scaling out solves the distributed problem 18 © DataStax, All Rights Reserved. SCALE-OUT APP LAYER SCALE-OUT DATA LAYER MASTER-SLAVE DATABASE
  • 19. San Francisco New York London So What’s the answer? • Distributed masterless NoSQL Database • Continuous Availability • Disaster Avoidance • Linear Scale Performance • Add nodes to scale • Runs on Commodity Hardware • Cloud or on Premise or Hybrid
  • 20. Linear Scalability • Have More Data? Add more nodes. • Need More Throughput? Add more nodes. http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html 9000 Nodes 700 Nodes 400 Nodes
  • 21. Continuous Availability • Nodes Down != Database Down • Datacenter Down != Database Down • Upgrade != Database Down
  • 22. Platform for Cloud Applications DataStax is a registered trademark of DataStax, Inc. and its subsidiaries in the United States and/or
  • 23. The most innovative companies use DataStax © DataStax, All Rights Reserved.23 2010 2012 2014 2016 2017
  • 24. Key takeaways • Why your RDBMS fails at scale • Fundamentally not built for cloud based applications • World’s leading brands rely on DataStax for globally distributed data management • Next steps: Download today at www.datastax.com and register for your DataStax Academy account for free online training DataStax: The power behind the moment 24
  • 26. ACID is a lie with data replication Scenario: client with read-heavy workload decides to add asynchronous replication, so there is lag for propagating data from master to the slave. • Consistency: If a client decides to do a read to the slave before the data is replicated, it’s going to get the old data back, which means loss of consistency • Atomicity: not having the correct data results in the failure of entire transaction • Isolation: receiving the old data means loss of isolation • Durability: client will receive the old data and not the data it had written to the master node When applying RDBMS to Big Data replication, ACID collapses 26
  • 27. CAP tradeoffs • Relational databases choose strong consistency over high availability • Latency between data centers makes consistency impractical • NoSQL databases like Cassandra choose high availability and partition tolerance over consistency. • Data is replicated asynchronously across multiple data centers. We are LIMITED by the speed of light making consistency impossible. • Lets you specify consistency level (one replica vs majority of replicas) suitable for your application Can’t be both consistent and highly available during a network partition 27
  • 28. Replication Complexity in RDBMS © 2015 DataStax, All Rights Reserved.28 *Source: Oracle Database 12c New Features, Slide 17. (http://bit.ly/1MIxKc1)
  • 29. HTTP Application Message Queue Streaming Analytics Batch Analytics Real-time DSE Real-time Analytics IoT Reference Architecture © 2015 DataStax, All Rights Reserved.29