SlideShare a Scribd company logo
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs

More Related Content

What's hot (20)

PPTX
How Workload Prioritization Reduces Your Datacenter Footprint
ScyllaDB
 
PPTX
How to be Successful with Scylla
ScyllaDB
 
PPTX
Using ScyllaDB with JanusGraph for Cyber Security
ScyllaDB
 
PPTX
Building a Distributed Data Streaming Architecture for Modern Hardware with S...
ScyllaDB
 
PPTX
Sizing Your Scylla Cluster
ScyllaDB
 
PPTX
Scylla’s Journey Towards Being an Elastic Cloud Native Database
ScyllaDB
 
PPTX
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
ScyllaDB
 
PPTX
How SkyElectric Uses Scylla to Power Its Smart Energy Platform
ScyllaDB
 
PPTX
Scylla Summit 2018: Scylla 3.0 and Beyond
ScyllaDB
 
PPTX
Performance Testing: Scylla vs. Cassandra vs. Datastax
ScyllaDB
 
PDF
Introducing Scylla Open Source 4.0
ScyllaDB
 
PPTX
How ReversingLabs Serves File Reputation Service for 10B Files
ScyllaDB
 
PPTX
GPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
ScyllaDB
 
PPTX
ScyllaDB's Avi Kivity on UDF, UDA, and the Future
ScyllaDB
 
PPTX
Scylla Summit 2019 Keynote - Avi Kivity
ScyllaDB
 
PDF
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
ScyllaDB
 
PPTX
Captial One: Why Stream Data as Part of Data Transformation?
ScyllaDB
 
PPTX
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
ScyllaDB
 
PDF
Seastar Summit 2019 vectorized.io
ScyllaDB
 
PPTX
Powering a Graph Data System with Scylla + JanusGraph
ScyllaDB
 
How Workload Prioritization Reduces Your Datacenter Footprint
ScyllaDB
 
How to be Successful with Scylla
ScyllaDB
 
Using ScyllaDB with JanusGraph for Cyber Security
ScyllaDB
 
Building a Distributed Data Streaming Architecture for Modern Hardware with S...
ScyllaDB
 
Sizing Your Scylla Cluster
ScyllaDB
 
Scylla’s Journey Towards Being an Elastic Cloud Native Database
ScyllaDB
 
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
ScyllaDB
 
How SkyElectric Uses Scylla to Power Its Smart Energy Platform
ScyllaDB
 
Scylla Summit 2018: Scylla 3.0 and Beyond
ScyllaDB
 
Performance Testing: Scylla vs. Cassandra vs. Datastax
ScyllaDB
 
Introducing Scylla Open Source 4.0
ScyllaDB
 
How ReversingLabs Serves File Reputation Service for 10B Files
ScyllaDB
 
GPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
ScyllaDB
 
ScyllaDB's Avi Kivity on UDF, UDA, and the Future
ScyllaDB
 
Scylla Summit 2019 Keynote - Avi Kivity
ScyllaDB
 
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
ScyllaDB
 
Captial One: Why Stream Data as Part of Data Transformation?
ScyllaDB
 
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
ScyllaDB
 
Seastar Summit 2019 vectorized.io
ScyllaDB
 
Powering a Graph Data System with Scylla + JanusGraph
ScyllaDB
 

Similar to High-Load Storage of Users’ Actions with ScyllaDB and HDDs (20)

PDF
Using ScyllaDB for Real-Time Read-Heavy Workloads.pdf
ScyllaDB
 
PDF
Dissecting Real-World Database Performance Dilemmas
ScyllaDB
 
PDF
Dissecting Real-World Database Performance Dilemmas
ScyllaDB
 
PDF
Using ScyllaDB for Real-Time Write-Heavy Workloads
ScyllaDB
 
PDF
Fast NoSQL from HDDs?
ScyllaDB
 
PDF
Scylla Summit 2022: Operating at Monstrous Scales: Benchmarking Petabyte Work...
ScyllaDB
 
PDF
Replacing Your Cache with ScyllaDB by Felipe Cardeneti Mendes and Tomasz Grabiec
ScyllaDB
 
PDF
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
ScyllaDB
 
PPTX
Replacing Your Cache with ScyllaDB
ScyllaDB
 
PPTX
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
ScyllaDB
 
PPTX
4 use cases for C* to Scylla
◄ ★ Jack Pavlov ★ ►
 
PPTX
Understanding Storage I/O Under Load
ScyllaDB
 
PDF
How to achieve no compromise performance and availability
ScyllaDB
 
PDF
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
confluent
 
PPTX
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
ScyllaDB
 
PPTX
Real-Time or Analytics Workloads... Why Not Both?
ScyllaDB
 
PDF
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
ScyllaDB
 
PDF
What Developers Need to Unlearn for High Performance NoSQL
ScyllaDB
 
PDF
Scylla Summit 2017: Intel Optane SSDs as the New Accelerator in Your Data Center
ScyllaDB
 
PPTX
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Using ScyllaDB for Real-Time Read-Heavy Workloads.pdf
ScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
ScyllaDB
 
Dissecting Real-World Database Performance Dilemmas
ScyllaDB
 
Using ScyllaDB for Real-Time Write-Heavy Workloads
ScyllaDB
 
Fast NoSQL from HDDs?
ScyllaDB
 
Scylla Summit 2022: Operating at Monstrous Scales: Benchmarking Petabyte Work...
ScyllaDB
 
Replacing Your Cache with ScyllaDB by Felipe Cardeneti Mendes and Tomasz Grabiec
ScyllaDB
 
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
ScyllaDB
 
Replacing Your Cache with ScyllaDB
ScyllaDB
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
ScyllaDB
 
4 use cases for C* to Scylla
◄ ★ Jack Pavlov ★ ►
 
Understanding Storage I/O Under Load
ScyllaDB
 
How to achieve no compromise performance and availability
ScyllaDB
 
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
confluent
 
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
ScyllaDB
 
Real-Time or Analytics Workloads... Why Not Both?
ScyllaDB
 
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
ScyllaDB
 
What Developers Need to Unlearn for High Performance NoSQL
ScyllaDB
 
Scylla Summit 2017: Intel Optane SSDs as the New Accelerator in Your Data Center
ScyllaDB
 
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Ad

More from ScyllaDB (20)

PDF
Understanding The True Cost of DynamoDB Webinar
ScyllaDB
 
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
PDF
New Ways to Reduce Database Costs with ScyllaDB
ScyllaDB
 
PDF
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
PDF
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
ScyllaDB
 
PDF
Leading a High-Stakes Database Migration
ScyllaDB
 
PDF
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
ScyllaDB
 
PDF
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
ScyllaDB
 
PDF
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB
 
PDF
ScyllaDB: 10 Years and Beyond by Dor Laor
ScyllaDB
 
PDF
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
ScyllaDB
 
PDF
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
ScyllaDB
 
PDF
Vector Search with ScyllaDB by Szymon Wasik
ScyllaDB
 
PDF
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
ScyllaDB
 
PDF
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
ScyllaDB
 
PDF
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
ScyllaDB
 
PDF
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
ScyllaDB
 
PDF
Lessons Learned from Building a Serverless Notifications System by Srushith R...
ScyllaDB
 
PDF
A Dist Sys Programmer's Journey into AI by Piotr Sarna
ScyllaDB
 
Understanding The True Cost of DynamoDB Webinar
ScyllaDB
 
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
ScyllaDB
 
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
ScyllaDB
 
New Ways to Reduce Database Costs with ScyllaDB
ScyllaDB
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
ScyllaDB
 
Leading a High-Stakes Database Migration
ScyllaDB
 
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
ScyllaDB
 
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
ScyllaDB
 
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB
 
ScyllaDB: 10 Years and Beyond by Dor Laor
ScyllaDB
 
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
ScyllaDB
 
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
ScyllaDB
 
Vector Search with ScyllaDB by Szymon Wasik
ScyllaDB
 
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
ScyllaDB
 
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
ScyllaDB
 
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
ScyllaDB
 
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
ScyllaDB
 
Lessons Learned from Building a Serverless Notifications System by Srushith R...
ScyllaDB
 
A Dist Sys Programmer's Journey into AI by Piotr Sarna
ScyllaDB
 
Ad

Recently uploaded (20)

PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 

Editor's Notes

  • #4: Let’s talk numbers Does not include bots, only real users
  • #7: We store every action User may want to see what happened in his mailbox Another examples: investigating possible attacks, sorting out user complaints
  • #9: The thing that we wanted to replace in this scheme was the storage
  • #10: Writes prevail 1000 times
  • #13: The thing that we wanted to replace in this scheme was the storage
  • #14: Tell why we have different amount of nodes in different dcs Think what to answer to questions about CL=ONE We want to be available when a DC goes down, it’s ok for us to serve inconsistent read requests
  • #15: All user data is split by weeks and projects
  • #16: Ambiguous number of network requests to other nodes Can’t trasform all those writes to reads We create another table and duplicate all writes there from the app
  • #17: All user data is split by weeks and projects
  • #18: Latencies are measured from client RPS == API rps + RF + secondary index
  • #19: Remind that we are talking about HDDs
  • #21: They do not recommend hdds It is reasonable
  • #22: In ssd setups it will be probably set to some large value like number of shards The most accurate way is to run benchmarks with different values for num-io-queue
  • #24: Lets say one node failed and we know the exact moment of time when it happened Normally nodetool repair would run full scan but we now the exact moment when problem happened We need to go to nodes from a different DC, transfer data to the affected node and run nodetool repair
  • #25: Refresh will finish soon, then go compactions that do not overload cluster and in our case finished in 6 hours
  • #26: Latencies stay in a reasonable range Resharding is slow but faster than repair and does not overaload cluster
  • #27: Dedicated a whole section for problems with HDDs, what for