A Planet-Scale Database for Low Latency Transactional Apps by Yugabyte

1YugaByte Confidential © 2018 All rights reserved.
Introducing YugaByte DB
Karthik Ranganathan - CTO
Cloud Native Data Day, Nov 2018

Overview
• YugaByte DB – Our roots…and why they matter
• Why YugaByte DB is a better choice for Cloud-Native, Online applications … for
either SQL or NoSQL
• Advantages of Partnering with YugaByte

About Us
Kannan
Muthukkaruppan, CEO
Karthik
Ranganathan, CTO
Mikhail Bautin,
Software Architect
Created Cassandra & HBase
Scaled data platforms for growth from
30 Million to 1.4 Billion Users at Facebook
Team members from Oracle, Google, LinkedIN,
Nutanix, and MapR have built three or four
databases including YugaByte
Founders
Funding From Leading VCs in Cloud and Strategic Investors

YugaByte story starts with….Facebook in 2007

Facebook in 2008-2009…..
How to scale to a billion users?
Also: How to survive the week?

What happens at 1 billion users?
Dozens of petabytes
Billions of IOPS
Scale out frequently
Rolling upgrades – zero downtime!

Transformation of Facebook
How we achieved scale
Few
datacenters
Traditional
servers
Monolithic applications and
services
Geo-distributed
DC’s
Containers for
applications
Microservices
based design
Its all about developer agility

How did the Tech leaders simplify this?
Application Tier (Stateless Microservices)
Custom Data Platform
Transactional, Performant, Global
But there’s no general platform for the enterprise

Typical Stack Today
Fragile infrastructure with many moving parts
Datacenter 1
SQL Master SQL Slave
Application Tier (Stateless Microservices)
Datacenter 2

NoSQL + SQL Cloud Native
YugaByte DB Eliminates Complexity in a Cloud-
native World

tablet 1’
tablet 1’
YugaByte DB: Developer Agility & Operational Simplicity
tablet 1’
Cassandra++
Redis++
BETA
Self-Healing, Fault-Tolerant
Auto Sharding & Rebalancing
Distributed ACID Transactions
Global Data Distribution
High Throughput
Deep Cloud Integration
Standard API’s
Purpose Built Data Fabric

Do More (or less) With
Your Cassandra Apps

Doing More (or Less)
More Developer Agility:
• Extending Cassandra:
• Strong consistency
• Consistent secondary indexes
• JSON data
• Distributed transactions
• 10x data per node
• Superior performance
• 2.7x throughput
• 50% lower P99 latency
• Streaming ingest performance without
separate SST table load pipeline
Less Operational Complexity:
• Fewer nodes
• Expand in minutes not days
• Less time maintaining, tuning and
managing:
• No read repairs or anti-entropy
• No tombstones or deletes reappear
• No garbage collection pauses
• Reduced RTO and RPO
• More frequent backups

YugaByte DB: 10x More Data per Node
High-density benchmark:
• 26TB over 4 YugaByte DB nodes compared to 30 nodes for Cassandra
• 385K reads/sec (0.25 ms) & 6.5K writes/sec on "Recent Data” Workload
• Expand to 5 nodes (complete in 8 hours), data available in 5 minutes
• Induced node failure. Cluster rebalanced in 2.2 hours
10x Density Minutes not days

YugaByte DB: Improved Apache Cassandra Performance
2.6x
2.2x
1.9x
3.9x
3.0x
2.3x
2.7x Throughput

Best Distributed SQL

Best Distributed SQL: Background
• Directly Re-use PostgreSQL
• Re-use of PostgreSQL code base versus re-write in Go
• 100% compatible with PostgreSQL
• YugaByte will stay in synch with PostgreSQL (e.g. PostgreSQL v 12)
• Core data engine built for cloud-native environments
• Customized combination of RocksDB, Raft and PostgreSQL
• Database written in C++
• Superior performance
• Team members have developed and operated multiple databases
• Developed HBase, Cassandra, Presto, Hive and now YugaByte.
• Extensive hands-on implementation experience

Re-use not Re-write of PostgreSQL
CLIENT Postmaster
(Authentication, authorization)
Rewriter Planner
OptimizerExecutor
WAL Writer BG Writer…
DISK
Reuse
Stateless
PostgreSQL

CLIENT Postmaster
Rewriter Planner
OptimizerExecutor
YugaByte Node YugaByte Node …… YugaByte Node
Replace table
storage with
YugaByte DB

CLIENT Postmaster
Rewriter Planner
OptimizerExecutor
YugaByte Node YugaByte Node …… YugaByte Node
Enhance
optimizer and
executor for
distributed DB

YugaByte PostgreSQL feature-set support
Expect to support most PostgreSQL features
• All data types
• Built-in functions and expressions
• Various kinds of joins
• Constraints (primary key, foreign key, unique, not null, check)
• Secondary indexes (including multi-column and covering columns)
• Distributed transactions (Serializable and Snapshot Isolation)
• Views
• Stored Procedures
• Triggers
YugaByte DB inherits features
developed in PostgreSQL.
Vendors who re-write PostgreSQL
will need to reimplement from
scratch

Best Cloud-native DB
Orchestration

Transformation of Infrastructure & Apps
Few
datacenters
Traditional
servers
Monolithic applications and
services
Geo-distributed
DC’s
Containers for
applications
Microservices
based design

Operational Data Tier has Lagged
?
legacy databases
Open Source,
Cloud-native Databases

Why YugaByte DB for Cloud-native Apps?
legacy databases
Open Source,
Cloud-native Databases
• Deep cloud integration
• K8s integration including PKS
• Intent based deployments
• Self-service development

Deep Cloud Integration including K8s

REAL-WORLD CASE STUDIES

1. MySQL master-slave replication
2. Cassandra cross-DC queue for cache updates
3. Per-DC Couchbase for caching
Current State
Global User Identity - login, change password, view profile

With YugaByte DB
Unified platform
Zero data loss
even on region failures
Add new regions with ease
1-click Deployment of Primary Cluster and Read Replicas
Read Replicas
Global User Identity - login, change password, view profile

Redis cluster for low latency reads
Fragile (manually sharded & load balanced)
Expensive (entire dataset in memory)
On-premises only, need hybrid/public cloud scaling and distribution
DB
Current State
Financial Data Service – Market Data

Financial Data Service
Higher release velocity
Cost-efficient storage
Faster cloud migration
1-click Deploy of Redis as a Primary Database
With YugaByte DB

Real-time analytics – large datasets with indexes
Crypto-Currency Fraud Detection
• Production dataset size is 4TB+
• Regular load of crypto transactions
• High throughput write and reads
• Spark analytics for fraud detection
• Need batch loadable secondary index
• Bit coin to wallet and reverse lookup
• Replaces
• Datastax Enterprise (Cassandra)

Better user features, geo-distributed data
Financial Market Data
• Improve service SLAs for B2B users
• Low latency reads on large scans
• Fix cache inconsistency
• Multi-region replication (USAsia)
• Dataset size is 10TB+
• Replaces
• Redis Enterprise
• AWS RDS MySQL

• Deep Experience building High Performance, Scale-Out Databases
• Reduced Risk:
• Open source
• Open standards – PostgreSQL, Cassandra and Redis
• Multi-model
• Cloud-agnostic: Private and Hybrid
• Alignment with your eco-system (Cassandra, PostgreSQL, Redis, Multi-Cloud, PKS)
• Economic benefits:
• Cost savings: Hardware, Software Licensing, Operational
• Flexible Commercial model
Why Partner with YugaByte?

Q&A

A Planet-Scale Database for Low Latency Transactional Apps by Yugabyte

More Related Content

What's hot (19)

Similar to A Planet-Scale Database for Low Latency Transactional Apps by Yugabyte (20)

More from VMware Tanzu (20)

Recently uploaded (20)

A Planet-Scale Database for Low Latency Transactional Apps by Yugabyte

Editor's Notes