SlideShare a Scribd company logo
Revolutionary Storage
For Modern Applications
Sanjay Sabnis
@sabhub1
Big Data Science Meetup
@Paviliondata
05/25/2018
Agenda
• Welcome
• Big Data Application Demands
• Next Generation Storage for Big Data Applications
• Big Data Use Cases
It is About Data
Expectations
Speed
Performance
Latency
Accuracy
Solving the Scalability Problem
RACK
Adding More Nodes
Add More memory
Upgrade Networking
• Need more storage? Add more nodes ßà Comes with Compute As
well
• Under-utilization of storage - Islands of storage still exist
• Limited by Rack Level Data Management at Scale
• Network is not up to date to utilize new features of hardware
Infrastructure
Connectivity Cognition
2.0 3.0
How do we connect
the world?
How do we make
sense of the world?
Crossing the Chasm
AI
Autonomous Vehicles
Image Recognition
Von Neumann
Time to rethink infrastructure, tooling, and
development practices.
Modern Applications Requirements
• Compute/Network/Storage
• Rack Awareness/DC Awareness – Data Locality/HA
• Master/Slave - Scalable
• Master-less - Scalable
• IOPS, Bandwidth – High Data Transfer Rates – 25/50/100 GB is
Standard now.
• Storage Awareness? – This is something new!
• Non-compute Centric Data Management
The Compute & Storage Disconnect
• Compute and Storage Age Differently
• Compute has Moore’s Law, what about storage?
• Replacing Compute calls for replacing disks – Fixed Density, more $$$
• We all have been using SATA drives
• There is a new interface called NVMe for SSD (Non Volatile Memory
Express)
It is a logical device interface specification
Storage Protocol Differences
SATA NVMe
End of Freeway
11
Comparing NVMe to SATA
SATA SSD NVMe SSD NVMe Difference
Read BW (MB/s) 500 3300 6.6X
4K Read IOPS 64K 830K 13X
Write BW (MB/s) 475 2100 4.4X
4K Write IOPS 5K 200K 40X
4K Mixed IOPS (70:30 R:W) 11K 550K 55X
DATA INTENSIVE WORKLOADS
Analytics, IoT, Streaming Media, AI/ML, Databases
NVMe-oF SSD Array
• High Performance
• Cost Efficient
• High Utilization
• Scalable (14 TB to 1 PB)
• HA
• PaYG Model (Pay as You Grow)
NVMe-Based Storage for Big Data
Pavilion All-NVMe Storage Array
13
Advantages for Big Data Deployments
Reduce per-rack costs up to 72%
Improve Storage Utilization 2X+
Free up stranded capacity residing on DAS
Management Flexibility
Less raw storage deployed lowers IT Admin Costs
Move data sets from one server to another without copying
Reduce Infrastructure
Less Servers required, or consolidate more DB instances per
server
Eliminate DAS SSDs
Leverage Full-Performance, Space-Efficient Copies
14
Performance (Latency & Bandwidth) of Direct Attached Storage
Serviceability and Data Management of Shared Storage
DAS
Performance and Cost Advantages
Index More Data With Splunk
Lower Costs of
noSQL Deployments
Using networked Pavilion storage instead of direct-attached SSDs gives better performance per
server, allowing you to reduce server count and size, plus gain the cost advantages of a SAN
15
Pavilion All-NVMe Storage for Big Data
120 GB/S
PERFORMANCE
Up to 40 x 100GE
Ports
MODULAR
14TB – 1PB
CAPACITY
Up To 20, Active-
active Controllers
RESILIENCY
4 RU
DENSITY
Raid-6,
Snapshots, Thin
Provisioning
DATA MANAGEMENT
NVME & NVMEOF
100% STANDARDS COMPLIANT
X86, 2.5” NVMe
SSD
STANDARD OFF-THE-SHELF
COMPONENTS
1/10TH
$COST/IOPS
DISRUPTIVE ECONOMICS
Shared Block Storage For Big Data Applications
ü Hosts connected using
25/40/50/100Gb Ethernet
ü NVMe block storage presented to
host servers using
community/standard NVMeoF driver
ü No custom host software required
ü 10s of micro-second latency
ü Latency of DAS SSDs
ü Full HA capability and hot-pluggable
components
Thin-Provisioned
NVMe volumes
presented to the
host server
17
Management Integration
18
Rest API
Use Cases
Cassandra
C*
C*
C*
1
2
3
Volumes for Node2
a2 b2 c2
a Commit Log
b Data
c Log
Volumes for Node1
a1 b1 c1
Rack Scale
• Dense Compute Rack
• Easy to Add or Replace nodes
• Integrates into DevOps using Rest API
• Thin provisioning to save flash resources
• Increase Volume Size Dynamically
• Manage instant data copies using Rest API
~ 1PB Storage
Snapshot/Clone
Data Backup/Restore
Rack 1 Rack 2
Adding a New Shard • Adding Shard to the Cluster
• Add shard to scale the MongoDB cluster horizontally
• Affects the balance of chunks among the shards of a cluster for all
existing sharded collections.
• The balancer will begin migrating chunks so that the cluster will
achieve balance
• Rebalance will affect existing Read/Write and IOPS performance.
PRIMARY
SECONDARY
SECONDARY
SHARD 1
PRIMARY
SECONDARY
SECONDARY
SHARD 2
APP SERVER APP SERVERAPP SERVER
REPLICA SET 1 REPLICA SET 2
Present Individual Volumes
For each node
PRIMARY
SECONDARY
SECONDARY
SHARD 3
REPLICA SET 3
New Shard
>
> Speed Shard Rebalancing
• Pavilion Advantages
• No sizing activity required.
• No impact of no. of parallel chunk migrations, same IOPS for all with 40 ports
• Pre Configure Pavilion volumes for future shard expansion to automate the
scaling activity
• Over provision the volume size to alleviate IOPS performance as data grows
>
MongoDB - Leveraging Snapshots and Clones
PRIMARY
SECONDARY
SECONDARY
.
.
SECONDARY
Instant Clone
Point in Time Instant Pavilion Snapshots
PRODUCTION
PRIMARY
SECONDARY
SECONDARY
.
.
.
DEV/QA/PREPROD
Backup/Archive
Instant Clone
Use Clone to Scale Replica Set Use Clone to spin up DEV/QA/PREPROD quickly
Pavilion Instant Clones
SECONDARY
Replication
• Scale MongoDB infrastructure without downtime
• Rapid volume cloning capabilities allow for new backup and deployment strategies
• Instant cloning makes node recovery and replacement easy
Reduce Splunk Indexer Sprawl
PAVILION DATA CONFIDENTIAL & PROPRIETARY 23
HOT
WARM
COLD
FROZEN
Tier 1 - $$$$
Tier 2 - $$$
Tier 3 - $$
Tier 4 - $
Backup
Read-Only Snapshots
QA/Dev/PreProd Testing
R/W Clones
Consolidate All Splunk Data on One High-Speed Storage Platform, Simplify Backup and Copy Management
Addressing Splunk Challenges
24
Splunk Solution Design Considerations
Insufficient disk I/O is the most common limitation in Splunk infrastructure
Pavilion delivers over 100 GB/s of bandwidth, and 20 Million IOPS from a
compact, 4U Chassis, which can power even the largest Splunk
deployments
Review the disk subsystem requirements before provisioning your hardware
Pavilion’s scalable platform allows you to focus on the needs on the
compute infrastructure instead of storage
More disks (specifically, more spindles) are better for indexing performance
Pavilion’s low latency storage platform eliminates storage as the indexing
bottleneck
Total throughput of the entire system is important.
Pavilion delivers significant improvements in performance and improves
decision times.
The ratio of disks to disk controllers in a particular system should be higher,
similar to how you provision a database host
Pavilion’s performance and capacity allows for easy storage configuration.
Hot Bucket – Cannot Backup
Take backup of any volume any time without performance overhead on
indexing nodes by using the Pavilion Snapshot feature
Modernize Database Deployments
25
ü Simplify Infrastructure by disaggregating storage
into a centralized, rack-scale appliance
ü Leverage shared storage resources at the speed
and latency of local SSDs
ü Reduce raw flash required
ü Independently scale compute, networking and
storage to maximize flexibility
ü Move to ‘storage-less’ 1U servers to increase
compute density per rack
ü Centralize storage resources to facilitate easy
backup and restore
ü Instantly deploy new copies of the database for
test/dev/QA purposes
DENSE
Compute CLUSTER
Other Use Cases
…………
New Data Architectures
Centralized Logging
“We are a log Management Company that happens
To Stream Videos”
-Netflix Chief Architect
Log Monitoring/Forwarding/….
No Log Forwarding from each Node
Save CPU Cycles
Container Architecture - Cloud
• Fits into Kubernetes or OpenStack implementations
• Integrate Pavilion REST API with Cinder Wrapper provided by the Pavilion
• Storage can be used as Static or Dynamic Volume provisioning
• Fits readily into DevOps CI/CD setup with provided REST API interfaces
• Utilize the Pavilion Snapshot, Clone and volume migration features to manage data beyond lifecycle of the virtual image
• Supports Block Storage, NFS ( S3 support in near future ).
Kubernetes
Pod
Nova
KeyStone
Boot
Launch
Authentication
Persistent
Volume
Docker
Kubernetes Cluster - Datacenter
OpenStack
CSI
Wrapper
Cinder Block
Storage Volumes
Rack Scale Flash Array
Docker’s Containers-as-a-service
(CaaS) platform that can run atop
cloud-based infrastructure such as
OpenStack, or on bare metal
infrastructure, providing complete
application lifecycle management
for container deployments.
HiBD (Hi-Performance Big Data)
• NVMe-oF opens up opportunity for commoditizing the HiBD
• RDMA + NVMe = Killer IOPS & Bandwidth
• Lots of Development has been done using RDMA-based HiBD
Apache Crail - Incubating
Pavilion - 120 GB/S
With DAS Latency
Crail is designed from ground up
for modern high-performance
networking and storage
hardware (RDMA, NVMe, NVMf,
etc.). It leverages user-level I/O to
access hardware directly from
the application context, providing
bare-metal I/O performance to
analytics workloads.
Storage Awareness
Revolutionary Storage for Modern Databases, Applications and Infrastrcture
Ad

More Related Content

What's hot (20)

Cisco UCS Integrated Infrastructure for Big Data with Cassandra
Cisco UCS Integrated Infrastructure for Big Data with CassandraCisco UCS Integrated Infrastructure for Big Data with Cassandra
Cisco UCS Integrated Infrastructure for Big Data with Cassandra
DataStax Academy
 
SAP ASCS on Kubernetes - A Proposal
SAP ASCS on Kubernetes - A ProposalSAP ASCS on Kubernetes - A Proposal
SAP ASCS on Kubernetes - A Proposal
Gary Jackson MBCS
 
Running Analytics at the Speed of Your Business
Running Analytics at the Speed of Your BusinessRunning Analytics at the Speed of Your Business
Running Analytics at the Speed of Your Business
Redis Labs
 
Building A Diverse Geo-Architecture For Cloud Native Applications In One Day
Building A Diverse Geo-Architecture For Cloud Native Applications In One DayBuilding A Diverse Geo-Architecture For Cloud Native Applications In One Day
Building A Diverse Geo-Architecture For Cloud Native Applications In One Day
VMware Tanzu
 
Building a Hybrid Cloud Solution
Building a Hybrid Cloud Solution Building a Hybrid Cloud Solution
Building a Hybrid Cloud Solution
Cloudian
 
Building Apps with Distributed In-Memory Computing Using Apache Geode
Building Apps with Distributed In-Memory Computing Using Apache GeodeBuilding Apps with Distributed In-Memory Computing Using Apache Geode
Building Apps with Distributed In-Memory Computing Using Apache Geode
PivotalOpenSourceHub
 
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
In-Memory Computing Summit
 
Scaling HDFS at Xiaomi
Scaling HDFS at XiaomiScaling HDFS at Xiaomi
Scaling HDFS at Xiaomi
DataWorks Summit
 
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage MattersRed Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red_Hat_Storage
 
Containerized Hadoop beyond Kubernetes
Containerized Hadoop beyond KubernetesContainerized Hadoop beyond Kubernetes
Containerized Hadoop beyond Kubernetes
DataWorks Summit
 
Responding to Digital Transformation With RDS Database Technology
Responding to Digital Transformation With RDS Database TechnologyResponding to Digital Transformation With RDS Database Technology
Responding to Digital Transformation With RDS Database Technology
Alibaba Cloud
 
What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?
DataWorks Summit
 
SAP HANA System Replication (HSR) versus SAP Replication Server (SRS)
SAP HANA System Replication (HSR) versus SAP Replication Server (SRS)SAP HANA System Replication (HSR) versus SAP Replication Server (SRS)
SAP HANA System Replication (HSR) versus SAP Replication Server (SRS)
Gary Jackson MBCS
 
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red_Hat_Storage
 
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
HBaseCon 2015: DeathStar - Easy, Dynamic,  Multi-tenant HBase via YARNHBaseCon 2015: DeathStar - Easy, Dynamic,  Multi-tenant HBase via YARN
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
HBaseCon
 
Apache Flink & Kudu: a connector to develop Kappa architectures
Apache Flink & Kudu: a connector to develop Kappa architecturesApache Flink & Kudu: a connector to develop Kappa architectures
Apache Flink & Kudu: a connector to develop Kappa architectures
Nacho García Fernández
 
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetHBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
Cloudera, Inc.
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference Architectures
Red_Hat_Storage
 
#GeodeSummit - Redis to Geode Adaptor
#GeodeSummit - Redis to Geode Adaptor#GeodeSummit - Redis to Geode Adaptor
#GeodeSummit - Redis to Geode Adaptor
PivotalOpenSourceHub
 
RedisConf18 - Redis Enterprise on Cloud Native Platforms
RedisConf18 - Redis Enterprise on Cloud  Native  Platforms RedisConf18 - Redis Enterprise on Cloud  Native  Platforms
RedisConf18 - Redis Enterprise on Cloud Native Platforms
Redis Labs
 
Cisco UCS Integrated Infrastructure for Big Data with Cassandra
Cisco UCS Integrated Infrastructure for Big Data with CassandraCisco UCS Integrated Infrastructure for Big Data with Cassandra
Cisco UCS Integrated Infrastructure for Big Data with Cassandra
DataStax Academy
 
SAP ASCS on Kubernetes - A Proposal
SAP ASCS on Kubernetes - A ProposalSAP ASCS on Kubernetes - A Proposal
SAP ASCS on Kubernetes - A Proposal
Gary Jackson MBCS
 
Running Analytics at the Speed of Your Business
Running Analytics at the Speed of Your BusinessRunning Analytics at the Speed of Your Business
Running Analytics at the Speed of Your Business
Redis Labs
 
Building A Diverse Geo-Architecture For Cloud Native Applications In One Day
Building A Diverse Geo-Architecture For Cloud Native Applications In One DayBuilding A Diverse Geo-Architecture For Cloud Native Applications In One Day
Building A Diverse Geo-Architecture For Cloud Native Applications In One Day
VMware Tanzu
 
Building a Hybrid Cloud Solution
Building a Hybrid Cloud Solution Building a Hybrid Cloud Solution
Building a Hybrid Cloud Solution
Cloudian
 
Building Apps with Distributed In-Memory Computing Using Apache Geode
Building Apps with Distributed In-Memory Computing Using Apache GeodeBuilding Apps with Distributed In-Memory Computing Using Apache Geode
Building Apps with Distributed In-Memory Computing Using Apache Geode
PivotalOpenSourceHub
 
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
In-Memory Computing Summit
 
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage MattersRed Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red Hat Storage Day Atlanta - Why Software Defined Storage Matters
Red_Hat_Storage
 
Containerized Hadoop beyond Kubernetes
Containerized Hadoop beyond KubernetesContainerized Hadoop beyond Kubernetes
Containerized Hadoop beyond Kubernetes
DataWorks Summit
 
Responding to Digital Transformation With RDS Database Technology
Responding to Digital Transformation With RDS Database TechnologyResponding to Digital Transformation With RDS Database Technology
Responding to Digital Transformation With RDS Database Technology
Alibaba Cloud
 
What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?
DataWorks Summit
 
SAP HANA System Replication (HSR) versus SAP Replication Server (SRS)
SAP HANA System Replication (HSR) versus SAP Replication Server (SRS)SAP HANA System Replication (HSR) versus SAP Replication Server (SRS)
SAP HANA System Replication (HSR) versus SAP Replication Server (SRS)
Gary Jackson MBCS
 
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red Hat Storage Day Atlanta - Persistent Storage for Linux Containers
Red_Hat_Storage
 
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
HBaseCon 2015: DeathStar - Easy, Dynamic,  Multi-tenant HBase via YARNHBaseCon 2015: DeathStar - Easy, Dynamic,  Multi-tenant HBase via YARN
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
HBaseCon
 
Apache Flink & Kudu: a connector to develop Kappa architectures
Apache Flink & Kudu: a connector to develop Kappa architecturesApache Flink & Kudu: a connector to develop Kappa architectures
Apache Flink & Kudu: a connector to develop Kappa architectures
Nacho García Fernández
 
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetHBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
Cloudera, Inc.
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference Architectures
Red_Hat_Storage
 
#GeodeSummit - Redis to Geode Adaptor
#GeodeSummit - Redis to Geode Adaptor#GeodeSummit - Redis to Geode Adaptor
#GeodeSummit - Redis to Geode Adaptor
PivotalOpenSourceHub
 
RedisConf18 - Redis Enterprise on Cloud Native Platforms
RedisConf18 - Redis Enterprise on Cloud  Native  Platforms RedisConf18 - Redis Enterprise on Cloud  Native  Platforms
RedisConf18 - Redis Enterprise on Cloud Native Platforms
Redis Labs
 

Similar to Revolutionary Storage for Modern Databases, Applications and Infrastrcture (20)

Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
SUSE Italy
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
Santanu Dey
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community
 
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Community
 
VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld 2013: Virtualizing Databases: Doing IT Right VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld
 
Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...
Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...
Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...
Tesora
 
Red hat ceph storage customer presentation
Red hat ceph storage customer presentationRed hat ceph storage customer presentation
Red hat ceph storage customer presentation
Rodrigo Missiaggia
 
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
Jeff Chu
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
Peter Clapham
 
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
Splunk
 
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Red_Hat_Storage
 
New Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference ArchitecturesNew Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference Architectures
Kamesh Pemmaraju
 
VMworld 2015: Advanced SQL Server on vSphere
VMworld 2015: Advanced SQL Server on vSphereVMworld 2015: Advanced SQL Server on vSphere
VMworld 2015: Advanced SQL Server on vSphere
VMworld
 
Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications
OpenEBS
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY
 
Cloud - High Availability @ Low Cost - Workshop - Gurpreet ahuja
Cloud - High Availability @ Low Cost - Workshop - Gurpreet ahujaCloud - High Availability @ Low Cost - Workshop - Gurpreet ahuja
Cloud - High Availability @ Low Cost - Workshop - Gurpreet ahuja
ResellerClub
 
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld
 
Red Hat Storage Roadmap
Red Hat Storage RoadmapRed Hat Storage Roadmap
Red Hat Storage Roadmap
Colleen Corrice
 
Red Hat Storage Roadmap
Red Hat Storage RoadmapRed Hat Storage Roadmap
Red Hat Storage Roadmap
Red_Hat_Storage
 
Leveraging OpenStack Cinder for Peak Application Performance
Leveraging OpenStack Cinder for Peak Application PerformanceLeveraging OpenStack Cinder for Peak Application Performance
Leveraging OpenStack Cinder for Peak Application Performance
NetApp
 
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
SUSE Italy
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
Santanu Dey
 
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community Talk on High-Performance Solid Sate Ceph
Ceph Community
 
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Day New York 2014: Best Practices for Ceph-Powered Implementations of St...
Ceph Community
 
VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld 2013: Virtualizing Databases: Doing IT Right VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld
 
Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...
Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...
Percona Live 4/14/15: Leveraging open stack cinder for peak application perfo...
Tesora
 
Red hat ceph storage customer presentation
Red hat ceph storage customer presentationRed hat ceph storage customer presentation
Red hat ceph storage customer presentation
Rodrigo Missiaggia
 
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
Jeff Chu
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
Peter Clapham
 
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
SplunkLive! Nutanix Session - Turnkey and scalable infrastructure for Splunk ...
Splunk
 
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Red_Hat_Storage
 
New Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference ArchitecturesNew Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference Architectures
Kamesh Pemmaraju
 
VMworld 2015: Advanced SQL Server on vSphere
VMworld 2015: Advanced SQL Server on vSphereVMworld 2015: Advanced SQL Server on vSphere
VMworld 2015: Advanced SQL Server on vSphere
VMworld
 
Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications
OpenEBS
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY
 
Cloud - High Availability @ Low Cost - Workshop - Gurpreet ahuja
Cloud - High Availability @ Low Cost - Workshop - Gurpreet ahujaCloud - High Availability @ Low Cost - Workshop - Gurpreet ahuja
Cloud - High Availability @ Low Cost - Workshop - Gurpreet ahuja
ResellerClub
 
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld
 
Leveraging OpenStack Cinder for Peak Application Performance
Leveraging OpenStack Cinder for Peak Application PerformanceLeveraging OpenStack Cinder for Peak Application Performance
Leveraging OpenStack Cinder for Peak Application Performance
NetApp
 
Ad

Recently uploaded (20)

Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Ad

Revolutionary Storage for Modern Databases, Applications and Infrastrcture

  • 1. Revolutionary Storage For Modern Applications Sanjay Sabnis @sabhub1 Big Data Science Meetup @Paviliondata 05/25/2018
  • 2. Agenda • Welcome • Big Data Application Demands • Next Generation Storage for Big Data Applications • Big Data Use Cases
  • 3. It is About Data
  • 5. Solving the Scalability Problem RACK Adding More Nodes Add More memory Upgrade Networking • Need more storage? Add more nodes ßà Comes with Compute As well • Under-utilization of storage - Islands of storage still exist • Limited by Rack Level Data Management at Scale • Network is not up to date to utilize new features of hardware
  • 6. Infrastructure Connectivity Cognition 2.0 3.0 How do we connect the world? How do we make sense of the world?
  • 7. Crossing the Chasm AI Autonomous Vehicles Image Recognition Von Neumann Time to rethink infrastructure, tooling, and development practices.
  • 8. Modern Applications Requirements • Compute/Network/Storage • Rack Awareness/DC Awareness – Data Locality/HA • Master/Slave - Scalable • Master-less - Scalable • IOPS, Bandwidth – High Data Transfer Rates – 25/50/100 GB is Standard now. • Storage Awareness? – This is something new! • Non-compute Centric Data Management
  • 9. The Compute & Storage Disconnect • Compute and Storage Age Differently • Compute has Moore’s Law, what about storage? • Replacing Compute calls for replacing disks – Fixed Density, more $$$ • We all have been using SATA drives • There is a new interface called NVMe for SSD (Non Volatile Memory Express) It is a logical device interface specification
  • 10. Storage Protocol Differences SATA NVMe End of Freeway
  • 11. 11 Comparing NVMe to SATA SATA SSD NVMe SSD NVMe Difference Read BW (MB/s) 500 3300 6.6X 4K Read IOPS 64K 830K 13X Write BW (MB/s) 475 2100 4.4X 4K Write IOPS 5K 200K 40X 4K Mixed IOPS (70:30 R:W) 11K 550K 55X
  • 12. DATA INTENSIVE WORKLOADS Analytics, IoT, Streaming Media, AI/ML, Databases NVMe-oF SSD Array • High Performance • Cost Efficient • High Utilization • Scalable (14 TB to 1 PB) • HA • PaYG Model (Pay as You Grow) NVMe-Based Storage for Big Data
  • 14. Advantages for Big Data Deployments Reduce per-rack costs up to 72% Improve Storage Utilization 2X+ Free up stranded capacity residing on DAS Management Flexibility Less raw storage deployed lowers IT Admin Costs Move data sets from one server to another without copying Reduce Infrastructure Less Servers required, or consolidate more DB instances per server Eliminate DAS SSDs Leverage Full-Performance, Space-Efficient Copies 14 Performance (Latency & Bandwidth) of Direct Attached Storage Serviceability and Data Management of Shared Storage DAS
  • 15. Performance and Cost Advantages Index More Data With Splunk Lower Costs of noSQL Deployments Using networked Pavilion storage instead of direct-attached SSDs gives better performance per server, allowing you to reduce server count and size, plus gain the cost advantages of a SAN 15
  • 16. Pavilion All-NVMe Storage for Big Data 120 GB/S PERFORMANCE Up to 40 x 100GE Ports MODULAR 14TB – 1PB CAPACITY Up To 20, Active- active Controllers RESILIENCY 4 RU DENSITY Raid-6, Snapshots, Thin Provisioning DATA MANAGEMENT NVME & NVMEOF 100% STANDARDS COMPLIANT X86, 2.5” NVMe SSD STANDARD OFF-THE-SHELF COMPONENTS 1/10TH $COST/IOPS DISRUPTIVE ECONOMICS
  • 17. Shared Block Storage For Big Data Applications ü Hosts connected using 25/40/50/100Gb Ethernet ü NVMe block storage presented to host servers using community/standard NVMeoF driver ü No custom host software required ü 10s of micro-second latency ü Latency of DAS SSDs ü Full HA capability and hot-pluggable components Thin-Provisioned NVMe volumes presented to the host server 17
  • 20. Cassandra C* C* C* 1 2 3 Volumes for Node2 a2 b2 c2 a Commit Log b Data c Log Volumes for Node1 a1 b1 c1 Rack Scale • Dense Compute Rack • Easy to Add or Replace nodes • Integrates into DevOps using Rest API • Thin provisioning to save flash resources • Increase Volume Size Dynamically • Manage instant data copies using Rest API ~ 1PB Storage Snapshot/Clone Data Backup/Restore Rack 1 Rack 2
  • 21. Adding a New Shard • Adding Shard to the Cluster • Add shard to scale the MongoDB cluster horizontally • Affects the balance of chunks among the shards of a cluster for all existing sharded collections. • The balancer will begin migrating chunks so that the cluster will achieve balance • Rebalance will affect existing Read/Write and IOPS performance. PRIMARY SECONDARY SECONDARY SHARD 1 PRIMARY SECONDARY SECONDARY SHARD 2 APP SERVER APP SERVERAPP SERVER REPLICA SET 1 REPLICA SET 2 Present Individual Volumes For each node PRIMARY SECONDARY SECONDARY SHARD 3 REPLICA SET 3 New Shard > > Speed Shard Rebalancing • Pavilion Advantages • No sizing activity required. • No impact of no. of parallel chunk migrations, same IOPS for all with 40 ports • Pre Configure Pavilion volumes for future shard expansion to automate the scaling activity • Over provision the volume size to alleviate IOPS performance as data grows >
  • 22. MongoDB - Leveraging Snapshots and Clones PRIMARY SECONDARY SECONDARY . . SECONDARY Instant Clone Point in Time Instant Pavilion Snapshots PRODUCTION PRIMARY SECONDARY SECONDARY . . . DEV/QA/PREPROD Backup/Archive Instant Clone Use Clone to Scale Replica Set Use Clone to spin up DEV/QA/PREPROD quickly Pavilion Instant Clones SECONDARY Replication • Scale MongoDB infrastructure without downtime • Rapid volume cloning capabilities allow for new backup and deployment strategies • Instant cloning makes node recovery and replacement easy
  • 23. Reduce Splunk Indexer Sprawl PAVILION DATA CONFIDENTIAL & PROPRIETARY 23 HOT WARM COLD FROZEN Tier 1 - $$$$ Tier 2 - $$$ Tier 3 - $$ Tier 4 - $ Backup Read-Only Snapshots QA/Dev/PreProd Testing R/W Clones Consolidate All Splunk Data on One High-Speed Storage Platform, Simplify Backup and Copy Management
  • 24. Addressing Splunk Challenges 24 Splunk Solution Design Considerations Insufficient disk I/O is the most common limitation in Splunk infrastructure Pavilion delivers over 100 GB/s of bandwidth, and 20 Million IOPS from a compact, 4U Chassis, which can power even the largest Splunk deployments Review the disk subsystem requirements before provisioning your hardware Pavilion’s scalable platform allows you to focus on the needs on the compute infrastructure instead of storage More disks (specifically, more spindles) are better for indexing performance Pavilion’s low latency storage platform eliminates storage as the indexing bottleneck Total throughput of the entire system is important. Pavilion delivers significant improvements in performance and improves decision times. The ratio of disks to disk controllers in a particular system should be higher, similar to how you provision a database host Pavilion’s performance and capacity allows for easy storage configuration. Hot Bucket – Cannot Backup Take backup of any volume any time without performance overhead on indexing nodes by using the Pavilion Snapshot feature
  • 25. Modernize Database Deployments 25 ü Simplify Infrastructure by disaggregating storage into a centralized, rack-scale appliance ü Leverage shared storage resources at the speed and latency of local SSDs ü Reduce raw flash required ü Independently scale compute, networking and storage to maximize flexibility ü Move to ‘storage-less’ 1U servers to increase compute density per rack ü Centralize storage resources to facilitate easy backup and restore ü Instantly deploy new copies of the database for test/dev/QA purposes DENSE Compute CLUSTER
  • 27. New Data Architectures Centralized Logging “We are a log Management Company that happens To Stream Videos” -Netflix Chief Architect Log Monitoring/Forwarding/…. No Log Forwarding from each Node Save CPU Cycles
  • 28. Container Architecture - Cloud • Fits into Kubernetes or OpenStack implementations • Integrate Pavilion REST API with Cinder Wrapper provided by the Pavilion • Storage can be used as Static or Dynamic Volume provisioning • Fits readily into DevOps CI/CD setup with provided REST API interfaces • Utilize the Pavilion Snapshot, Clone and volume migration features to manage data beyond lifecycle of the virtual image • Supports Block Storage, NFS ( S3 support in near future ). Kubernetes Pod Nova KeyStone Boot Launch Authentication Persistent Volume Docker Kubernetes Cluster - Datacenter OpenStack CSI Wrapper Cinder Block Storage Volumes Rack Scale Flash Array Docker’s Containers-as-a-service (CaaS) platform that can run atop cloud-based infrastructure such as OpenStack, or on bare metal infrastructure, providing complete application lifecycle management for container deployments.
  • 29. HiBD (Hi-Performance Big Data) • NVMe-oF opens up opportunity for commoditizing the HiBD • RDMA + NVMe = Killer IOPS & Bandwidth • Lots of Development has been done using RDMA-based HiBD Apache Crail - Incubating Pavilion - 120 GB/S With DAS Latency Crail is designed from ground up for modern high-performance networking and storage hardware (RDMA, NVMe, NVMf, etc.). It leverages user-level I/O to access hardware directly from the application context, providing bare-metal I/O performance to analytics workloads. Storage Awareness