SlideShare a Scribd company logo
The Rise of Data in Motion
Serverless and Cloud-Native Event Streaming on AWS with Confluent Cloud
© 2021, Amazon Web Services, Inc. or its Affiliates.
Customers want more value from their data
Used by
many people
Growing
Exponentially
From new
sources
Increasingly
diverse
Analyzed by many
applications
© 2021, Amazon Web Services, Inc. or its Affiliates.
• The number of “smart” devices is
projected to be 200 billion by 2020
(over 100X increase in ten years)
• 90% of the data in the world was generated
in the last 2 years
• There are 2.5 quintillion bytes of
data created each day, and this
pace is accelerating
The volume of data being produced is increasing
Source
© 2021, Amazon Web Services, Inc. or its Affiliates.
Customers moving from traditional data
warehouse approach
Data silos to
OLTP ERP CRM LO
B
DW Silo 1
Business
Intelligence
Device
s
Web Sensors Socia
l
DW Silo 2
Business
Intelligence
Data Lake
Non-
relational
databases
Machine
learning
Data
warehousing
Log
analytics
Big data
processing
Relational
databases
© 2021, Amazon Web Services, Inc. or its Affiliates.
Lake House architecture
SCALABLE DATA LAKES
PURPOSE-BUILT
DATA SERVICES
SEAMLESS
DATA MOVEMENT
UNIFIED GOVERNANCE
PERFORMANT AND
COST-EFFECTIVE
Non-relational
databases
Machine
learning
Data
warehousing
Log
analytics
Big data
processing
Relational
databases
Data lake
© 2021, Amazon Web Services, Inc. or its Affiliates.
Lake House architecture on AWS
SCALABLE DATA LAKES
PURPOSE-BUILT
DATA SERVICES
SEAMLESS
DATA MOVEMENT
UNIFIED GOVERNANCE
PERFORMANT AND
COST-EFFECTIVE
Amazon
DynamoDB
Amazon
SageMaker
Amazon
Redshift
Amazon
Elasticsearch
Service
Amazon
EMR
Amazon
S3
Amazon
Aurora
Amazon
Athena
© 2021, Amazon Web Services, Inc. or its Affiliates.
The value of data diminishes over time
This is a fundamental paradigm shift...
8
Infrastructure
as code
Data in motion
as continuous
streams of events
Future of the
datacenter
Future of data
Cloud
Event
Streaming
An Event Streaming Platform is the
Underpinning of an Event-driven Architecture
9
MES
ERP
Sensors
Mobile
Customer 360
Real-time
Alerting System
Data warehouse
Producers
Consumers
Streams of real time events
Stream processing
apps
Connectors
Connectors
Stream processing
apps
Supplier
Alert
Forecast
Inventory Customer
Order
Car Engine Car Self-driving Car
Confluent Completes Apache Kafka
Truly CLOUD-NATIVE experience
at the edge, in the data center,
and in the cloud
Confluent Cloud
A fully managed, cloud-native service for Apache Kafka
Confluent Platform
A complete, enterprise-grade distribution of Apache Kafka
Confluent for
Kubernetes
Ansible
Playbooks
Packages:
Docker, RPMs,
Tarball
Public Cloud Workloads Edge and On-Premise Workloads
On Kubernetes On VMs / Bare Metal
Wavelength
STREAM
PROCESSING
CONNECTORS
Example Architecture for Event Streaming
ksqlDB
KStreams
Processing Data in Motion with Confluent Cloud on AWS
Dashboard
Oracle
DB
Oracle
CDC
CONNECTOR
Salesforce CDC
CONNECTOR
Salesforce
Source / Sink
CONNECTOR
Fraud Detection App
Context-specific Customer 360
13
Electrical retailer
Hyper-personalized online retail experience,
turning each customer visit into a one-on-one
marketing opportunity
Correlation of historical customer data with real-
time digital signals
Maximize customer satisfaction and revenue
growth, increased customer conversions
https://www.confluent.io/customers/ao/
Ingest & Process
Capture event streams with a consistent data structure using
Schema Registry, develop real-time ETL pipelines with a lightweight
SQL syntax using ksqlDB & unify real-time streams with batch
processing using +100 Confluent Connectors
Derive insights from data in real-time
Mobile
Web
IoT
Data store
AWS & On-prem
Amazon
S3
S3 Sink
ANALYZE
Amazon
Redshift
AWS Lake
Formation
Amazon
Athena
Redshift Sink
TRANSFORM
Amazon
EMR
AWS Data
Pipeline
AWS
Glue
Source
connectors
Store & Analyze
Stream data with Confluent pre-built Connectors into your
AWS data lake or data warehouse to execute queries on vast
amounts of streaming data for real-time and batch analytics
VISUALIZE
Amazon
Elasticsearch
Schema
Registry
ksqlDB
Events
Real-time analytics
Serverless integration
Connect existing and apps & data stores in a repeatable way without
having to manage- Apache Kafka, Schema Registry to maintain
app compatibility, ksqlDB to develop real-time apps with SQL syntax
and Connect for effortless integrations with Lambda & data stores
AWS serverless platform
Stop provisioning, maintaining or administering servers for
backend components such as compute, databases and
storage so that you can focus on increasing agility and
innovation for your developer teams
Increase developer agility & speed of innovation
Apps
Microservices
ksqlDB
Schema
Registry
COMPUTE
AWS
Lambda
Data stores
REST Proxy
& Clients
Source
Connectors
Lambda
Sink
DATA STORES
Amazon
DynamoDB
Amazon
Aurora
STORAGE
Amazon
S3
S3 Sink
ANALYTICS
Amazon
Athena
Amazon
Redshift
Serverless app integration
Accelerate modernization from on-prem to AWS
Redshift Sink
Lambda Sink
AWS Direct
Connect
LEGACY EDW
MAINFRAME
LEGACY DB
JDBC / CDC
connectors
Connect
Leverage +100 Confluent pre-built connectors to
continuously bring valuable data from existing
services on-prem including enterprise data
warehouse, databases and mainframes
Modernize
Increase agility in getting applications to market
and reduce TCO when freeing up resources to
focus on value generating activities and not in
managing servers
On-prem AWS Cloud
Bridge
Hybrid cloud streaming
with consistent, event-
driven architecture for
modern apps
On-prem to AWS modernization
Amazon Athena
AWS Glue
SageMaker
Lake Formation
Amazon
DynamoDB
Amazon
Aurora
S3 Sink
Data Streams
Apps
ksqlDB
Cluster
Linking
Low Latency 5G Use Cases
with AWS Wavelength (based on AWS Outposts) and Confluent
Global Event Streaming
Streaming Replication between Clusters across Cloud, On-Prem and Edge
Bridge to Databases, Data Lakes, Apps, APIs, SaaS
Aggregate Small Footprint
Edge Deployments with
Replication (Aggregation)
Simplify Disaster Recovery
Operations with
Multi-Region Clusters
for RPO=0 and RTO~0
Stream Data Globally with
Replication and Cluster Linking
18
Omnichannel Retail
Time
P
C3 C2
C1
Sales Talk on site in
Car Dealership
Right now
Location-based
Customer Action
Customer 360
(Website, Mobile App, On Site in Store, In-Car)
Car Configurator
10 and 8 days ago
Context-specific
Marketing Campaign
90 and 60 days ago
AWS
Lambda
Omnichannel Retail
Time
P
C3 C2
C1
Machine Learning
Context-specific
Recommendations
Location-based
Customer Action
Customer 360
(Business Intelligence, Machine Learning)
Machine Learning
Train Recommendation Engine
Reporting
All Customer Interactions
Amazon
Athena
Amazon
SageMaker
CRM
3rd party
payment
provider
Context-specific
real-time upsell
Customer data
Payment processing and
fraud detection as a service
Manager
Get report
API
Customer Customer
Customer
data
Train
schedule
Payment
data
Loyalty
information
Streams of real time events
Customer
data
Train
schedule
Payment
data
Loyalty
information
Streams of real time events
Customer
data
Train
schedule
Payment
data
Loyalty
information
Streams of real time events
Hybrid Retail Architecture
Point of Sale
(POS) Loyalty
System
Local Inventory
Management
Payment Discount
Customer
data
Train
schedule
Payment
data
Loyalty
information
Streams of real time events
Global Inventory
Management
Event Streaming at the Edge in the
Smart Retail Store
Item Availability
Disconnected Edge
Time
P
C3 C2
C1
Context-specific
Advertisement
Real-time
(Milliseconds)
Location-based
Customer Action
Always on (even “offline”)
Replayability
Reduced traffic cost
Better latency
Payment Processing
Near Real-time
(Seconds)
Replication to Cloud
Batch
(Depending on Network Bandwidth)
Live Demo
Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture
Confluent Schema Registry
The de-facto schemas metadata repository for data in motion
Schema
Registry
Producer
Kafka
serializer
Kafka
deserializer
Consumer
Kafka
3. Produce message
with schema ID
5. Consume message
with schema ID
6. Ask for schema
given schema ID
7. Return
schema
Invalid
message
Invalid
message
4. Is this a valid
schema ID?
1. Register schema
2. Return
schema ID
Confluent Cloud Data Governance
Data Quality
Increase data trust
● Schemas management UI
● Broker-side schema ID validation
Data Catalog
Classify, organize, discover
● Search and discover schemas
metadata
● Manage data classifications
● Classify schemas with tags
Data Lineage
Turn data visibility on
● Visualize complex data in
motion pipelines
● Audit data movement across
systems
NOW IN EARLY-ACCESS
27
Car Engine Car Self-driving Car
Confluent Completes Apache Kafka
Confluent Cloud + : Accelerate Business Value for Customers
Topline Impacting New
Experiences
● Event-driven & real-time
● Unify data across org. w/ Kafka
data fabric (Schema Reg,..)
● AWS Analytics, Redshift, ML
connectors
Mitigate Risk
● Higher Service Quality &
Resilience with 99.95% SLA
● Deep Kafka expertise & innovation
● Elastic billing/pricing
Developer Agility
● Focus on innovation (not data
infrastructure)
● Leverage full Kafka OSS
ecosystem + AWS services
Faster Time to Market
● ~50-75% faster time to market*
● Streamline hybrid cloud
migration with no complex lift-n-
shift
● Maintain business continuity
Lower Kafka TCO
● ~25-50% lower TCO *
● GBps-scale & fast deployments
for global expansion
● Deploy Kafka at scale in 1 week
Maximize ROI
● ~200% ROI per Forrester study
● Save 10s of $Ms with legacy
offload to AWS with Confluent
Replicator
* For customers that don’t already have Kafka based system in-market
* TCO assessment to be analyzed for specific customer scenarios
Questions? Feedback?

More Related Content

What's hot (20)

PDF
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Kai Wähner
 
PDF
Domain Driven Data: Apache Kafka® and the Data Mesh
confluent
 
PDF
Fundamentals of Apache Kafka
Chhavi Parasher
 
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
PPTX
An Introduction to Confluent Cloud: Apache Kafka as a Service
confluent
 
PDF
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
PDF
Stream Processing with Apache Kafka and .NET
confluent
 
PDF
Apache Kafka in the Transportation and Logistics
Kai Wähner
 
PDF
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 
PDF
Apache Kafka – (Pattern and) Anti-Pattern
confluent
 
PDF
Apache Kafka Introduction
Amita Mirajkar
 
PDF
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Kai Wähner
 
PPTX
Azure storage
Adam Skibicki
 
PDF
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Kai Wähner
 
PDF
Apache Kafka Fundamentals for Architects, Admins and Developers
confluent
 
PDF
Data engineering design patterns
Valdas Maksimavičius
 
PDF
Producer Performance Tuning for Apache Kafka
Jiangjie Qin
 
PPSX
Apache Flink, AWS Kinesis, Analytics
Araf Karsh Hamid
 
PPTX
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Simplilearn
 
PDF
From Zero to Hero with Kafka Connect
confluent
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Kai Wähner
 
Domain Driven Data: Apache Kafka® and the Data Mesh
confluent
 
Fundamentals of Apache Kafka
Chhavi Parasher
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
An Introduction to Confluent Cloud: Apache Kafka as a Service
confluent
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
Stream Processing with Apache Kafka and .NET
confluent
 
Apache Kafka in the Transportation and Logistics
Kai Wähner
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 
Apache Kafka – (Pattern and) Anti-Pattern
confluent
 
Apache Kafka Introduction
Amita Mirajkar
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Kai Wähner
 
Azure storage
Adam Skibicki
 
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Kai Wähner
 
Apache Kafka Fundamentals for Architects, Admins and Developers
confluent
 
Data engineering design patterns
Valdas Maksimavičius
 
Producer Performance Tuning for Apache Kafka
Jiangjie Qin
 
Apache Flink, AWS Kinesis, Analytics
Araf Karsh Hamid
 
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Simplilearn
 
From Zero to Hero with Kafka Connect
confluent
 

Similar to Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture (20)

PDF
Build real-time streaming data pipelines to AWS with Confluent
confluent
 
PDF
Confluent_AWS_ImmersionDay_Q42023.pdf
Ahmed791434
 
PPTX
Confluent:AWS - GameDay.pptx
Ahmed791434
 
PDF
App modernization on AWS with Apache Kafka and Confluent Cloud
Kai Wähner
 
PDF
Building Modern Streaming Analytics with Confluent on AWS
confluent
 
PDF
Single View of Data
confluent
 
PDF
Confluent Partner Tech Talk with Reply
confluent
 
PDF
Strategies For Migrating From SQL to NoSQL — The Apache Kafka Way
ScyllaDB
 
PDF
Streaming Time Series Data With Kenny Gorman and Elena Cuevas | Current 2022
HostedbyConfluent
 
PPTX
Confluent-Ably-AWS-ID-2023 - GSlide.pptx
Ahmed791434
 
PDF
Real-Time Analytics with Confluent and MemSQL
SingleStore
 
PPTX
Unlock value with Confluent and AWS.pptx
Ahmed791434
 
PDF
Big data on aws
Serkan Özal
 
PDF
Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...
HostedbyConfluent
 
PDF
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
HostedbyConfluent
 
PDF
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
confluent
 
PPTX
AWS Immersion Day Mapfre - Confluent
confluent
 
PPTX
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
Precisely
 
PDF
Confluent Partner Tech Talk with QLIK
confluent
 
PDF
Building Real-Time Serverless Data Applications With Joseph Morais and Adam W...
HostedbyConfluent
 
Build real-time streaming data pipelines to AWS with Confluent
confluent
 
Confluent_AWS_ImmersionDay_Q42023.pdf
Ahmed791434
 
Confluent:AWS - GameDay.pptx
Ahmed791434
 
App modernization on AWS with Apache Kafka and Confluent Cloud
Kai Wähner
 
Building Modern Streaming Analytics with Confluent on AWS
confluent
 
Single View of Data
confluent
 
Confluent Partner Tech Talk with Reply
confluent
 
Strategies For Migrating From SQL to NoSQL — The Apache Kafka Way
ScyllaDB
 
Streaming Time Series Data With Kenny Gorman and Elena Cuevas | Current 2022
HostedbyConfluent
 
Confluent-Ably-AWS-ID-2023 - GSlide.pptx
Ahmed791434
 
Real-Time Analytics with Confluent and MemSQL
SingleStore
 
Unlock value with Confluent and AWS.pptx
Ahmed791434
 
Big data on aws
Serkan Özal
 
Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...
HostedbyConfluent
 
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
HostedbyConfluent
 
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
confluent
 
AWS Immersion Day Mapfre - Confluent
confluent
 
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
Precisely
 
Confluent Partner Tech Talk with QLIK
confluent
 
Building Real-Time Serverless Data Applications With Joseph Morais and Adam W...
HostedbyConfluent
 
Ad

More from Kai Wähner (20)

PDF
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Kai Wähner
 
PDF
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kai Wähner
 
PDF
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Kai Wähner
 
PDF
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Kai Wähner
 
PDF
Apache Kafka in the Healthcare Industry
Kai Wähner
 
PDF
Apache Kafka in the Healthcare Industry
Kai Wähner
 
PDF
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Kai Wähner
 
PDF
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
PDF
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Kai Wähner
 
PDF
Apache Kafka Landscape for Automotive and Manufacturing
Kai Wähner
 
PDF
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
PPTX
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Kai Wähner
 
PDF
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Kai Wähner
 
PDF
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Kai Wähner
 
PDF
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Kai Wähner
 
PDF
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Kai Wähner
 
PDF
IBM Cloud Pak for Integration with Confluent Platform powered by Apache Kafka
Kai Wähner
 
PDF
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Kai Wähner
 
PDF
Apache Kafka in the Insurance Industry
Kai Wähner
 
PDF
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Kai Wähner
 
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Kai Wähner
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kai Wähner
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Kai Wähner
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Kai Wähner
 
Apache Kafka in the Healthcare Industry
Kai Wähner
 
Apache Kafka in the Healthcare Industry
Kai Wähner
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Kai Wähner
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Kai Wähner
 
Apache Kafka Landscape for Automotive and Manufacturing
Kai Wähner
 
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Kai Wähner
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Kai Wähner
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Kai Wähner
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Kai Wähner
 
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Kai Wähner
 
IBM Cloud Pak for Integration with Confluent Platform powered by Apache Kafka
Kai Wähner
 
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Kai Wähner
 
Apache Kafka in the Insurance Industry
Kai Wähner
 
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Kai Wähner
 
Ad

Recently uploaded (20)

PDF
Linux Certificate of Completion - LabEx Certificate
VICTOR MAESTRE RAMIREZ
 
PPTX
Comprehensive Guide: Shoviv Exchange to Office 365 Migration Tool 2025
Shoviv Software
 
PPTX
Engineering the Java Web Application (MVC)
abhishekoza1981
 
PPTX
MiniTool Power Data Recovery Full Crack Latest 2025
muhammadgurbazkhan
 
DOCX
Import Data Form Excel to Tally Services
Tally xperts
 
PDF
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked} 2025
hashhshs786
 
PPTX
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
PPTX
Fundamentals_of_Microservices_Architecture.pptx
MuhammadUzair504018
 
PPTX
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
PPTX
3uTools Full Crack Free Version Download [Latest] 2025
muhammadgurbazkhan
 
PDF
Executive Business Intelligence Dashboards
vandeslie24
 
PPTX
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
PPTX
The Role of a PHP Development Company in Modern Web Development
SEO Company for School in Delhi NCR
 
PDF
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
PDF
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Safe Software
 
PDF
Efficient, Automated Claims Processing Software for Insurers
Insurance Tech Services
 
PDF
GetOnCRM Speeds Up Agentforce 3 Deployment for Enterprise AI Wins.pdf
GetOnCRM Solutions
 
PDF
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
PPTX
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
Linux Certificate of Completion - LabEx Certificate
VICTOR MAESTRE RAMIREZ
 
Comprehensive Guide: Shoviv Exchange to Office 365 Migration Tool 2025
Shoviv Software
 
Engineering the Java Web Application (MVC)
abhishekoza1981
 
MiniTool Power Data Recovery Full Crack Latest 2025
muhammadgurbazkhan
 
Import Data Form Excel to Tally Services
Tally xperts
 
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
Capcut Pro Crack For PC Latest Version {Fully Unlocked} 2025
hashhshs786
 
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
Fundamentals_of_Microservices_Architecture.pptx
MuhammadUzair504018
 
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
3uTools Full Crack Free Version Download [Latest] 2025
muhammadgurbazkhan
 
Executive Business Intelligence Dashboards
vandeslie24
 
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
The Role of a PHP Development Company in Modern Web Development
SEO Company for School in Delhi NCR
 
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Safe Software
 
Efficient, Automated Claims Processing Software for Insurers
Insurance Tech Services
 
GetOnCRM Speeds Up Agentforce 3 Deployment for Enterprise AI Wins.pdf
GetOnCRM Solutions
 
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 

Serverless Kafka on AWS as Part of a Cloud-native Data Lake Architecture

  • 1. The Rise of Data in Motion Serverless and Cloud-Native Event Streaming on AWS with Confluent Cloud
  • 2. © 2021, Amazon Web Services, Inc. or its Affiliates. Customers want more value from their data Used by many people Growing Exponentially From new sources Increasingly diverse Analyzed by many applications
  • 3. © 2021, Amazon Web Services, Inc. or its Affiliates. • The number of “smart” devices is projected to be 200 billion by 2020 (over 100X increase in ten years) • 90% of the data in the world was generated in the last 2 years • There are 2.5 quintillion bytes of data created each day, and this pace is accelerating The volume of data being produced is increasing Source
  • 4. © 2021, Amazon Web Services, Inc. or its Affiliates. Customers moving from traditional data warehouse approach Data silos to OLTP ERP CRM LO B DW Silo 1 Business Intelligence Device s Web Sensors Socia l DW Silo 2 Business Intelligence Data Lake Non- relational databases Machine learning Data warehousing Log analytics Big data processing Relational databases
  • 5. © 2021, Amazon Web Services, Inc. or its Affiliates. Lake House architecture SCALABLE DATA LAKES PURPOSE-BUILT DATA SERVICES SEAMLESS DATA MOVEMENT UNIFIED GOVERNANCE PERFORMANT AND COST-EFFECTIVE Non-relational databases Machine learning Data warehousing Log analytics Big data processing Relational databases Data lake
  • 6. © 2021, Amazon Web Services, Inc. or its Affiliates. Lake House architecture on AWS SCALABLE DATA LAKES PURPOSE-BUILT DATA SERVICES SEAMLESS DATA MOVEMENT UNIFIED GOVERNANCE PERFORMANT AND COST-EFFECTIVE Amazon DynamoDB Amazon SageMaker Amazon Redshift Amazon Elasticsearch Service Amazon EMR Amazon S3 Amazon Aurora Amazon Athena
  • 7. © 2021, Amazon Web Services, Inc. or its Affiliates. The value of data diminishes over time
  • 8. This is a fundamental paradigm shift... 8 Infrastructure as code Data in motion as continuous streams of events Future of the datacenter Future of data Cloud Event Streaming
  • 9. An Event Streaming Platform is the Underpinning of an Event-driven Architecture 9 MES ERP Sensors Mobile Customer 360 Real-time Alerting System Data warehouse Producers Consumers Streams of real time events Stream processing apps Connectors Connectors Stream processing apps Supplier Alert Forecast Inventory Customer Order
  • 10. Car Engine Car Self-driving Car Confluent Completes Apache Kafka
  • 11. Truly CLOUD-NATIVE experience at the edge, in the data center, and in the cloud Confluent Cloud A fully managed, cloud-native service for Apache Kafka Confluent Platform A complete, enterprise-grade distribution of Apache Kafka Confluent for Kubernetes Ansible Playbooks Packages: Docker, RPMs, Tarball Public Cloud Workloads Edge and On-Premise Workloads On Kubernetes On VMs / Bare Metal Wavelength
  • 12. STREAM PROCESSING CONNECTORS Example Architecture for Event Streaming ksqlDB KStreams Processing Data in Motion with Confluent Cloud on AWS Dashboard Oracle DB Oracle CDC CONNECTOR Salesforce CDC CONNECTOR Salesforce Source / Sink CONNECTOR Fraud Detection App
  • 13. Context-specific Customer 360 13 Electrical retailer Hyper-personalized online retail experience, turning each customer visit into a one-on-one marketing opportunity Correlation of historical customer data with real- time digital signals Maximize customer satisfaction and revenue growth, increased customer conversions https://www.confluent.io/customers/ao/
  • 14. Ingest & Process Capture event streams with a consistent data structure using Schema Registry, develop real-time ETL pipelines with a lightweight SQL syntax using ksqlDB & unify real-time streams with batch processing using +100 Confluent Connectors Derive insights from data in real-time Mobile Web IoT Data store AWS & On-prem Amazon S3 S3 Sink ANALYZE Amazon Redshift AWS Lake Formation Amazon Athena Redshift Sink TRANSFORM Amazon EMR AWS Data Pipeline AWS Glue Source connectors Store & Analyze Stream data with Confluent pre-built Connectors into your AWS data lake or data warehouse to execute queries on vast amounts of streaming data for real-time and batch analytics VISUALIZE Amazon Elasticsearch Schema Registry ksqlDB Events Real-time analytics
  • 15. Serverless integration Connect existing and apps & data stores in a repeatable way without having to manage- Apache Kafka, Schema Registry to maintain app compatibility, ksqlDB to develop real-time apps with SQL syntax and Connect for effortless integrations with Lambda & data stores AWS serverless platform Stop provisioning, maintaining or administering servers for backend components such as compute, databases and storage so that you can focus on increasing agility and innovation for your developer teams Increase developer agility & speed of innovation Apps Microservices ksqlDB Schema Registry COMPUTE AWS Lambda Data stores REST Proxy & Clients Source Connectors Lambda Sink DATA STORES Amazon DynamoDB Amazon Aurora STORAGE Amazon S3 S3 Sink ANALYTICS Amazon Athena Amazon Redshift Serverless app integration
  • 16. Accelerate modernization from on-prem to AWS Redshift Sink Lambda Sink AWS Direct Connect LEGACY EDW MAINFRAME LEGACY DB JDBC / CDC connectors Connect Leverage +100 Confluent pre-built connectors to continuously bring valuable data from existing services on-prem including enterprise data warehouse, databases and mainframes Modernize Increase agility in getting applications to market and reduce TCO when freeing up resources to focus on value generating activities and not in managing servers On-prem AWS Cloud Bridge Hybrid cloud streaming with consistent, event- driven architecture for modern apps On-prem to AWS modernization Amazon Athena AWS Glue SageMaker Lake Formation Amazon DynamoDB Amazon Aurora S3 Sink Data Streams Apps ksqlDB Cluster Linking
  • 17. Low Latency 5G Use Cases with AWS Wavelength (based on AWS Outposts) and Confluent
  • 18. Global Event Streaming Streaming Replication between Clusters across Cloud, On-Prem and Edge Bridge to Databases, Data Lakes, Apps, APIs, SaaS Aggregate Small Footprint Edge Deployments with Replication (Aggregation) Simplify Disaster Recovery Operations with Multi-Region Clusters for RPO=0 and RTO~0 Stream Data Globally with Replication and Cluster Linking 18
  • 19. Omnichannel Retail Time P C3 C2 C1 Sales Talk on site in Car Dealership Right now Location-based Customer Action Customer 360 (Website, Mobile App, On Site in Store, In-Car) Car Configurator 10 and 8 days ago Context-specific Marketing Campaign 90 and 60 days ago AWS Lambda
  • 20. Omnichannel Retail Time P C3 C2 C1 Machine Learning Context-specific Recommendations Location-based Customer Action Customer 360 (Business Intelligence, Machine Learning) Machine Learning Train Recommendation Engine Reporting All Customer Interactions Amazon Athena Amazon SageMaker
  • 21. CRM 3rd party payment provider Context-specific real-time upsell Customer data Payment processing and fraud detection as a service Manager Get report API Customer Customer Customer data Train schedule Payment data Loyalty information Streams of real time events Customer data Train schedule Payment data Loyalty information Streams of real time events Customer data Train schedule Payment data Loyalty information Streams of real time events Hybrid Retail Architecture
  • 22. Point of Sale (POS) Loyalty System Local Inventory Management Payment Discount Customer data Train schedule Payment data Loyalty information Streams of real time events Global Inventory Management Event Streaming at the Edge in the Smart Retail Store Item Availability
  • 23. Disconnected Edge Time P C3 C2 C1 Context-specific Advertisement Real-time (Milliseconds) Location-based Customer Action Always on (even “offline”) Replayability Reduced traffic cost Better latency Payment Processing Near Real-time (Seconds) Replication to Cloud Batch (Depending on Network Bandwidth)
  • 26. Confluent Schema Registry The de-facto schemas metadata repository for data in motion Schema Registry Producer Kafka serializer Kafka deserializer Consumer Kafka 3. Produce message with schema ID 5. Consume message with schema ID 6. Ask for schema given schema ID 7. Return schema Invalid message Invalid message 4. Is this a valid schema ID? 1. Register schema 2. Return schema ID
  • 27. Confluent Cloud Data Governance Data Quality Increase data trust ● Schemas management UI ● Broker-side schema ID validation Data Catalog Classify, organize, discover ● Search and discover schemas metadata ● Manage data classifications ● Classify schemas with tags Data Lineage Turn data visibility on ● Visualize complex data in motion pipelines ● Audit data movement across systems NOW IN EARLY-ACCESS 27
  • 28. Car Engine Car Self-driving Car Confluent Completes Apache Kafka
  • 29. Confluent Cloud + : Accelerate Business Value for Customers Topline Impacting New Experiences ● Event-driven & real-time ● Unify data across org. w/ Kafka data fabric (Schema Reg,..) ● AWS Analytics, Redshift, ML connectors Mitigate Risk ● Higher Service Quality & Resilience with 99.95% SLA ● Deep Kafka expertise & innovation ● Elastic billing/pricing Developer Agility ● Focus on innovation (not data infrastructure) ● Leverage full Kafka OSS ecosystem + AWS services Faster Time to Market ● ~50-75% faster time to market* ● Streamline hybrid cloud migration with no complex lift-n- shift ● Maintain business continuity Lower Kafka TCO ● ~25-50% lower TCO * ● GBps-scale & fast deployments for global expansion ● Deploy Kafka at scale in 1 week Maximize ROI ● ~200% ROI per Forrester study ● Save 10s of $Ms with legacy offload to AWS with Confluent Replicator * For customers that don’t already have Kafka based system in-market * TCO assessment to be analyzed for specific customer scenarios