SlideShare a Scribd company logo
1
Introduction to Confluent Operator
to establish a Cloud-Native Confluent Platform
and provide a Kafka Operator for Kubernetes
Kai Waehner
Technology Evangelist
contact@kai-waehner.de
LinkedIn
@KaiWaehner
www.confluent.io
www.kai-waehner.de
2
Agenda
● Cloud Native vs. SaaS / Serverless Kafka
● The Emergence of Kubernetes
● Kafka on K8s Deployment Challenges
● Confluent Operator as Kafka Operator
● Q&A
Confluent
Operator
3
Agenda
● Cloud Native vs. SaaS / Serverless Kafka
● The Emergence of Kubernetes
● Kafka on K8s Deployment Challenges
● Confluent Operator as Kafka Operator
● Q&A
Confluent
Operator
4
Business Digitalization Trends are Driving the Need to Process
Events at a whole new Scale, Speed and Efficiency
Mobile Cloud Microservices Internet of Things Machine Learning
The world has changed!
5Best-of-breed Platforms, Partners and Services for Multi-cloud Streams
Private Cloud
Deploy on bare-metal, VMs,
containers or Kubernetes in your
datacenter with Confluent Platform
and Confluent Operator
Public Cloud
Implement self-managed in the public
cloud or adopt a fully managed service
with Confluent Cloud
Hybrid Cloud
Build a persistent bridge between
datacenter and cloud with
Confluent Replicator
Confluent
Replicator
VM
SELF MANAGED FULLY MANAGED
6
Software as a Service (SaaS)
● a software distribution model in
which a third-party provider hosts
applications and makes them
available to customers over the
Internet.
● provides SLAs like uptime
guarantees, throughput, latency, etc.
● Depending on your definition, some
also call this Serverless (BaaS,
Backend as a Service) for
infrastructure components
7
Confluent Cloud
Cloud-Native Confluent Platform Fully-Managed Service
Available on the leading public clouds with mission-critical SLAs.
Serverless Kafka characteristics:
Pay-as-you-go, elastic auto-scaling, abstracting infrastructure (topics not brokers)
8
Confluent Cloud, What does Fully-managed Mean?
Infrastructure
management
(commodity)
Scaling
● Upgrades (latest stable version of Kafka)
● Patching
● Maintenance
● Sizing (retention, latency, throughput, storage, etc.)
● Data balancing for optimal performance
● Performance tuning for real-time and latency requirements
● Fixing Kafka bugs
● Uptime monitoring and proactive remediation of issues
● Recovery support from data corruption
● Scaling the cluster as needed
● Data balancing the cluster as nodes are added
● Support for any Kafka issue with less than 60 minute response time
Infra-as-a-Service
Harness full power of Kafka
Kafka-specific
management
Platform-as-a-Service
Evolve as you
need
Future-proof
Mission-critical reliability
Most Kafka as a Service offerings are partially-managed
9
What is Cloud Native?
Benefits
• Scalable
• Flexible
• Agile
• Elastic
• Automated
• Etc.
10
The Twelve-Factor App
What is Cloud Native?
https://12factor.net/
11
What is Cloud Native?
10 key characteristics (one of many definitions)
● Packaged as lightweight containers
● Developed with best-of-breed languages and frameworks
● Designed as loosely coupled microservices
● Centered around APIs for interaction and collaboration
● Architected with a clean separation of stateless and stateful services
● Isolated from server and operating system dependencies
● Deployed on self-service, elastic, cloud infrastructure
● Managed through agile DevOps processes
● Automated capabilities
● Defined, policy-driven resource allocation
https://thenewstack.io/10-key-attributes-of-cloud-native-applications/
12
Agenda
● Cloud Native vs. SaaS / Serverless Kafka
● The Emergence of Kubernetes
● Kafka on K8s Deployment Challenges
● Confluent Operator as Kafka Operator
● Q&A
Confluent
Operator
13
Cloud-Native Platforms in last 5 years
14
Kubernetes won the battle!
15
Kubernetes innovation not stopping…
● Cloud providers provide Kubernetes as a service
○ AWS, Azure, GCP, …
○ Not all are mature yet J
● Stateful deployments leverage the Kubernetes
Operator pattern
○ For many infrastructure components, like databases, messaging, etc.
○ Community projects, vendors solutions (open source vs. proprietary)
● Service Mesh
○ Envoy, Istio, Linkerd, ...
○ Pull request for Kafka protocol support in Envoy / Istio accepted
recently
16
Evolution of Kafka DevOps
Shell scripts
Ansible/Chef Docker
Kubernetes
17
Agenda
● Cloud Native vs. SaaS / Serverless Kafka
● The Emergence of Kubernetes
● Kafka on K8s Deployment Challenges
● Confluent Operator as Kafka Operator
● Q&A
Confluent
Operator
18
Kafkaesque world of Kafka on Kubernetes
19
Kafka on Kubernetes –
It’s tricky ©
• Translating an existing architecture to Kubernetes
• Failover handling and data balancing
• Communication between ZooKeeper, Kafka Brokers, Clients (Java,
REST, Connect, KSQL), Schema Registry, etc.
• External access from / to outside Kubernetes cluster
• Persistent storage options on prem and in the cloud
• Security configuration
• Rolling upgrades
• Etc.
20
Agenda
● Cloud Native vs. SaaS / Serverless Kafka
● The Emergence of Kubernetes
● Kafka on K8s Deployment Challenges
● Confluent Operator as Kafka Operator
● Q&A
Confluent
Operator
● Global-scale
● Real-time
● Persistent Storage
● Stream Processing
Apache Kafka: The De-facto Standard for Real-Time Event Streaming
Edge
Cloud
Data LakeDatabases
Datacenter
IoT
SaaS AppsMobile
Microservices Machine
Learning
Apache Kafka
22
Real-Time
Inventory
Real-Time
Fraud
Detection
Real-Time
Customer 360
Machine
Learning
Models
Real-Time
Data
Transformation
...
Contextual Event-Driven Applications
Universal Event Pipeline
Data Stores Logs 3rd Party Apps Custom Apps/Microservices
TREAMSSTREAMS
CONNECT CLIENTS
23Confluent establishes Freedom of Choice
● SaaS or Self-Managed or Hybrid
● Confluent’s vision is to introduce cloud native
capabilities to Confluent Platform and enable users who
want a cloud native experience in on-premises and self-
managed cloud environments
● Introducing Confluent Operator
24
Confluent Platform
Operations and Security
Development & Stream Processing
Support,services,training&partners
Apache Kafka
Security plugins | Role-Based Access Control
Control Center | Replicator | Auto Data Balancer | Operator
Connectors
Clients | REST Proxy
MQTT Proxy | Schema Registry
KSQL
Connect Continuous Commit Log Streams
Complete Event
Streaming Platform
Mission-critical
Reliability
Freedom of Choice
Datacenter Public Cloud Confluent Cloud
Self-Managed Software Fully-Managed Service
25
Confluent Operator
● Deployment and management
automation for Confluent
Platform on Kubernetes
● Including Apache Kafka, Zookeeper, Schema
Registry, Connect, Control Center, Replicator, KSQL
● For organizations standardized on
Kubernetes as platform runtime
● Operationalizes years of experience
running Kafka on Kubernetes on the
leading public clouds
Confluent Platform
Confluent Operator
Kubernetes
AWS Azure GCP
RH OpenShift Mesosphere Pivotal
On-Premises Cloud
Docker Images
Automate Deployment of Confluent Platform on Kubernetes on Any Platform at Any Scale
26
Confluent Operator enables you to:
Automate provisioning of
Kafka pods in minutes
Monitor SLAs through Confluent
Control Center or Prometheus
Scale Kafka elastically &
Automate rolling updates
Built on our first hand knowledge
of running Confluent at scale
Cloud-Native Deployment of Kafka and Confluent Platform
Confluent’s Kubernetes Journey
05/2017
Confluent Cloud
Early Access
2016
Confluent Cloud
Development
11/2017
Confluent Cloud
GA (AWS)
07/2019
Confluent Operator GA
(Confluent Platform)
2019
Confluent Cloud GA
on AWS, GCP, Azure
28
Confluent Operator:
A custom Kubernetes Controller
API
Server
Scheduler
Controllers &
Custom Controllers
PODS
PVs
ConfigMaps
StatefulSets
Nodes and pods are where
Applications run on Kubernetes
Applications use objects like
StatefulSets, Configmaps, PVs
Custom Controllers create custom
resources that provide unique
application functionality:
● Upgrades, Elasticity, Kafka
Operational Logic
Custom
Resources
Master Node
Worker Nodes
29Helm - The Confluent Operator Package Manager
● Confluent Operator leverages Helm Charts to
deploy, upgrade and uninstall Confluent
Platform custom resources and pods
● Configuration front end for users to specify
how a Confluent Platform Cluster is
deployed:
○ # of replicas for Kafka, Zookeeper
○ Security and Authentication configuration
○ Persistent Storage configuration
● Cluster configuration edits are also
performed using Helm
Operator Helm
Charts - yaml
30
Kubernetes
Cluster K8 NodeK8 NodeK8 Node
Replicator
Pod
C3 Pod SR Pod
K8 Node
Confluent Operator Deployment
Operator
Kafka
Pod
ZK Pod
Persistent Volumes
(AWS EBS, GCE Persistent Disk, Local Persistent Volume, etc.)
External
Access
Load
Balancers
Configurations
ConfigMapsKSQL Pod
REST Proxy
Pod
31
Confluent Operator - Automated Provisioning
32
Confluent Operator - Automated Security
Configuration
SASL PLAIN, SASL_SSL, TLS with
Mutual Authentication
Automate configuration of truststores
and keystores with secret objects
Automate configuration of Kafka and
all Confluent Platform Components
33
Confluent Operator - Scale Horizontally
Automate Scaling:
Spin up new brokers, connect workers
easily
Distribute partitions to new brokers:
Determine balancing plan
Execute balancing plan
Monitor Resources
34
Confluent Operator - Rolling Upgrade of all
components
Automated Rolling Upgrades of all
components - Kafka Brokers,
Zookeeper, Connect, Control Center
Kafka Broker Upgrades:
1. Stop the broker, upgrade Kafka
2. Wait for Partition Leader reassignment
3. Start the upgraded broker
4. Wait for zero under-replicated partitions
5. Upgrade the next broker
Agenda
● Cloud Native vs. SaaS / Serverless Kafka
● The Emergence of Kubernetes
● Kafka on K8s Deployment Challenges
● Confluent Operator as Kafka Operator
● Q&A
Confluent
Operator
37
Kai Waehner
Technology Evangelist
contact@kai-waehner.de
@KaiWaehner
www.confluent.io
www.kai-waehner.de
LinkedIn
Questions? Feedback?
Let’s connect!

More Related Content

What's hot (20)

PDF
[IBM 서버] 노후서버는 왜 교체해야 하는가
(Joe), Sanghun Kim
 
PDF
Oracle GoldenGate入門
オラクルエンジニア通信
 
PDF
M12_数百台の開発サーバをリフトアンドシフト! Azure Migrate 活用ポイント [Microsoft Japan Digital Days]
日本マイクロソフト株式会社
 
PPTX
Spring Cloud Data Flow で構成される IIJ IoTサービス
Kenji Kondo
 
PDF
Distributed stream processing with Apache Kafka
confluent
 
PPTX
Click-Through Example for Flink’s KafkaConsumer Checkpointing
Robert Metzger
 
PPTX
Web App for Containers + MySQLでコンテナ対応したRailsアプリを作ろう!
Yoichi Kawasaki
 
PDF
OpenStack Ironic - Bare Metal-as-a-Service
Ramon Acedo Rodriguez
 
PDF
Virtualization with KVM (Kernel-based Virtual Machine)
Novell
 
PDF
手作業なしの安定環境実現に向けたZabbix活用方法紹介+Zabbix2.4最新機能紹介
Daisuke Ikeda
 
PDF
Amazon EKS上の開発体験を最大化するプレビュー環境の作り方
理弘 山崎
 
PDF
Confluent Cloud로 이벤트 기반 마이크로서비스 10배 확장하기 with 29CM
confluent
 
PPTX
PIW ISE best practices
Sergey Kucherenko
 
PDF
System Center 2012, Endpoint Protectionの運用
Suguru Kunii
 
PDF
Under the Hood: Open vSwitch & OpenFlow in XCP & XenServer
The Linux Foundation
 
PDF
Monitor every app, in every stage, with free and open Elastic APM
Elasticsearch
 
PDF
Achieving CI/CD with Kubernetes
Ramit Surana
 
PPTX
[NDC17] Kubernetes로 개발서버 간단히 찍어내기
SeungYong Oh
 
PPT
IBM WebSphere Application Server traditional and Docker
David Currie
 
PDF
Kyma: Extending Business systems with Kubernetes, Istio and <fill the blank>.
SAP HANA Cloud Platform
 
[IBM 서버] 노후서버는 왜 교체해야 하는가
(Joe), Sanghun Kim
 
Oracle GoldenGate入門
オラクルエンジニア通信
 
M12_数百台の開発サーバをリフトアンドシフト! Azure Migrate 活用ポイント [Microsoft Japan Digital Days]
日本マイクロソフト株式会社
 
Spring Cloud Data Flow で構成される IIJ IoTサービス
Kenji Kondo
 
Distributed stream processing with Apache Kafka
confluent
 
Click-Through Example for Flink’s KafkaConsumer Checkpointing
Robert Metzger
 
Web App for Containers + MySQLでコンテナ対応したRailsアプリを作ろう!
Yoichi Kawasaki
 
OpenStack Ironic - Bare Metal-as-a-Service
Ramon Acedo Rodriguez
 
Virtualization with KVM (Kernel-based Virtual Machine)
Novell
 
手作業なしの安定環境実現に向けたZabbix活用方法紹介+Zabbix2.4最新機能紹介
Daisuke Ikeda
 
Amazon EKS上の開発体験を最大化するプレビュー環境の作り方
理弘 山崎
 
Confluent Cloud로 이벤트 기반 마이크로서비스 10배 확장하기 with 29CM
confluent
 
PIW ISE best practices
Sergey Kucherenko
 
System Center 2012, Endpoint Protectionの運用
Suguru Kunii
 
Under the Hood: Open vSwitch & OpenFlow in XCP & XenServer
The Linux Foundation
 
Monitor every app, in every stage, with free and open Elastic APM
Elasticsearch
 
Achieving CI/CD with Kubernetes
Ramit Surana
 
[NDC17] Kubernetes로 개발서버 간단히 찍어내기
SeungYong Oh
 
IBM WebSphere Application Server traditional and Docker
David Currie
 
Kyma: Extending Business systems with Kubernetes, Istio and <fill the blank>.
SAP HANA Cloud Platform
 

Similar to Confluent Operator as Cloud-Native Kafka Operator for Kubernetes (20)

PDF
OSO Confluent GitOps Demo
Sion Smith
 
PPTX
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
confluent
 
PDF
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
confluent
 
PPTX
AWS Immersion Day Mapfre - Confluent
confluent
 
PDF
Apache Kafka 2.3 + Confluent Platform 5.3 => What's New?
Kai Wähner
 
PDF
DIMT 2023 SG - Hands-on Workshop_ Getting started with Confluent Cloud.pdf
confluent
 
PDF
Citi Tech Talk: Hybrid Cloud
confluent
 
PDF
Why Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it Yourself
DATAVERSITY
 
PDF
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
confluent
 
PPTX
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
confluent
 
PPTX
Cloud native Kafka | Sascha Holtbruegge and Margaretha Erber, HiveMQ
HostedbyConfluent
 
PDF
The Bridge to Cloud (Peter Gustafsson, Confluent) London 2019 Confluent Strea...
confluent
 
PDF
DIMT '23 Session_Demo_ Latest Innovations Breakout.pdf
confluent
 
PDF
How Confluent Completes the Event Streaming Platform (Addison Huddy & Dan Ros...
HostedbyConfluent
 
PDF
stackconf 2020 | The path to a Serverless-native era with Kubernetes by Paolo...
NETWAYS
 
PDF
Benefits of Stream Processing and Apache Kafka Use Cases
confluent
 
PDF
Reinventing Kafka in the Data Streaming Era - Jun Rao
confluent
 
PDF
Kubernetes-Native DevOps: For Apache Kafka® with Confluent
confluent
 
PDF
Moving 150 TB of data resiliently on Kafka With Quorum Controller on Kubernet...
HostedbyConfluent
 
PDF
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
confluent
 
OSO Confluent GitOps Demo
Sion Smith
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
confluent
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
confluent
 
AWS Immersion Day Mapfre - Confluent
confluent
 
Apache Kafka 2.3 + Confluent Platform 5.3 => What's New?
Kai Wähner
 
DIMT 2023 SG - Hands-on Workshop_ Getting started with Confluent Cloud.pdf
confluent
 
Citi Tech Talk: Hybrid Cloud
confluent
 
Why Cloud-Native Kafka Matters: 4 Reasons to Stop Managing it Yourself
DATAVERSITY
 
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
confluent
 
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
confluent
 
Cloud native Kafka | Sascha Holtbruegge and Margaretha Erber, HiveMQ
HostedbyConfluent
 
The Bridge to Cloud (Peter Gustafsson, Confluent) London 2019 Confluent Strea...
confluent
 
DIMT '23 Session_Demo_ Latest Innovations Breakout.pdf
confluent
 
How Confluent Completes the Event Streaming Platform (Addison Huddy & Dan Ros...
HostedbyConfluent
 
stackconf 2020 | The path to a Serverless-native era with Kubernetes by Paolo...
NETWAYS
 
Benefits of Stream Processing and Apache Kafka Use Cases
confluent
 
Reinventing Kafka in the Data Streaming Era - Jun Rao
confluent
 
Kubernetes-Native DevOps: For Apache Kafka® with Confluent
confluent
 
Moving 150 TB of data resiliently on Kafka With Quorum Controller on Kubernet...
HostedbyConfluent
 
Bridge to Cloud: Using Apache Kafka to Migrate to AWS
confluent
 
Ad

More from Kai Wähner (20)

PDF
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Kai Wähner
 
PDF
When NOT to use Apache Kafka?
Kai Wähner
 
PDF
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kai Wähner
 
PDF
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 
PDF
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Kai Wähner
 
PDF
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Kai Wähner
 
PDF
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Kai Wähner
 
PDF
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Kai Wähner
 
PDF
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Kai Wähner
 
PDF
Apache Kafka in the Healthcare Industry
Kai Wähner
 
PDF
Apache Kafka in the Healthcare Industry
Kai Wähner
 
PDF
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Kai Wähner
 
PDF
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
PDF
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Kai Wähner
 
PDF
Apache Kafka Landscape for Automotive and Manufacturing
Kai Wähner
 
PDF
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
PPTX
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Kai Wähner
 
PDF
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Kai Wähner
 
PDF
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Kai Wähner
 
PDF
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Kai Wähner
 
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Kai Wähner
 
When NOT to use Apache Kafka?
Kai Wähner
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kai Wähner
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Kai Wähner
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Kai Wähner
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Kai Wähner
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Kai Wähner
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Kai Wähner
 
Apache Kafka in the Healthcare Industry
Kai Wähner
 
Apache Kafka in the Healthcare Industry
Kai Wähner
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Kai Wähner
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Kai Wähner
 
Apache Kafka Landscape for Automotive and Manufacturing
Kai Wähner
 
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Kai Wähner
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Kai Wähner
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Kai Wähner
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Kai Wähner
 
Ad

Recently uploaded (20)

PDF
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
PPTX
Hardware(Central Processing Unit ) CU and ALU
RizwanaKalsoom2
 
PPTX
MailsDaddy Outlook OST to PST converter.pptx
abhishekdutt366
 
PDF
Beyond Binaries: Understanding Diversity and Allyship in a Global Workplace -...
Imma Valls Bernaus
 
PDF
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
PDF
vMix Pro 28.0.0.42 Download vMix Registration key Bundle
kulindacore
 
PPTX
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
PPTX
Perfecting XM Cloud for Multisite Setup.pptx
Ahmed Okour
 
PDF
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Safe Software
 
PPTX
How Odoo Became a Game-Changer for an IT Company in Manufacturing ERP
SatishKumar2651
 
PDF
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
PPTX
Tally software_Introduction_Presentation
AditiBansal54083
 
PDF
Thread In Android-Mastering Concurrency for Responsive Apps.pdf
Nabin Dhakal
 
PDF
Revenue streams of the Wazirx clone script.pdf
aaronjeffray
 
PPTX
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 
PPTX
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
PPTX
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
PDF
Efficient, Automated Claims Processing Software for Insurers
Insurance Tech Services
 
PDF
Alarm in Android-Scheduling Timed Tasks Using AlarmManager in Android.pdf
Nabin Dhakal
 
PPTX
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pptx
Varsha Nayak
 
Alexander Marshalov - How to use AI Assistants with your Monitoring system Q2...
VictoriaMetrics
 
Hardware(Central Processing Unit ) CU and ALU
RizwanaKalsoom2
 
MailsDaddy Outlook OST to PST converter.pptx
abhishekdutt366
 
Beyond Binaries: Understanding Diversity and Allyship in a Global Workplace -...
Imma Valls Bernaus
 
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
vMix Pro 28.0.0.42 Download vMix Registration key Bundle
kulindacore
 
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
Perfecting XM Cloud for Multisite Setup.pptx
Ahmed Okour
 
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Safe Software
 
How Odoo Became a Game-Changer for an IT Company in Manufacturing ERP
SatishKumar2651
 
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
Tally software_Introduction_Presentation
AditiBansal54083
 
Thread In Android-Mastering Concurrency for Responsive Apps.pdf
Nabin Dhakal
 
Revenue streams of the Wazirx clone script.pdf
aaronjeffray
 
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
Efficient, Automated Claims Processing Software for Insurers
Insurance Tech Services
 
Alarm in Android-Scheduling Timed Tasks Using AlarmManager in Android.pdf
Nabin Dhakal
 
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pptx
Varsha Nayak
 

Confluent Operator as Cloud-Native Kafka Operator for Kubernetes

  • 1. 1 Introduction to Confluent Operator to establish a Cloud-Native Confluent Platform and provide a Kafka Operator for Kubernetes Kai Waehner Technology Evangelist [email protected] LinkedIn @KaiWaehner www.confluent.io www.kai-waehner.de
  • 2. 2 Agenda ● Cloud Native vs. SaaS / Serverless Kafka ● The Emergence of Kubernetes ● Kafka on K8s Deployment Challenges ● Confluent Operator as Kafka Operator ● Q&A Confluent Operator
  • 3. 3 Agenda ● Cloud Native vs. SaaS / Serverless Kafka ● The Emergence of Kubernetes ● Kafka on K8s Deployment Challenges ● Confluent Operator as Kafka Operator ● Q&A Confluent Operator
  • 4. 4 Business Digitalization Trends are Driving the Need to Process Events at a whole new Scale, Speed and Efficiency Mobile Cloud Microservices Internet of Things Machine Learning The world has changed!
  • 5. 5Best-of-breed Platforms, Partners and Services for Multi-cloud Streams Private Cloud Deploy on bare-metal, VMs, containers or Kubernetes in your datacenter with Confluent Platform and Confluent Operator Public Cloud Implement self-managed in the public cloud or adopt a fully managed service with Confluent Cloud Hybrid Cloud Build a persistent bridge between datacenter and cloud with Confluent Replicator Confluent Replicator VM SELF MANAGED FULLY MANAGED
  • 6. 6 Software as a Service (SaaS) ● a software distribution model in which a third-party provider hosts applications and makes them available to customers over the Internet. ● provides SLAs like uptime guarantees, throughput, latency, etc. ● Depending on your definition, some also call this Serverless (BaaS, Backend as a Service) for infrastructure components
  • 7. 7 Confluent Cloud Cloud-Native Confluent Platform Fully-Managed Service Available on the leading public clouds with mission-critical SLAs. Serverless Kafka characteristics: Pay-as-you-go, elastic auto-scaling, abstracting infrastructure (topics not brokers)
  • 8. 8 Confluent Cloud, What does Fully-managed Mean? Infrastructure management (commodity) Scaling ● Upgrades (latest stable version of Kafka) ● Patching ● Maintenance ● Sizing (retention, latency, throughput, storage, etc.) ● Data balancing for optimal performance ● Performance tuning for real-time and latency requirements ● Fixing Kafka bugs ● Uptime monitoring and proactive remediation of issues ● Recovery support from data corruption ● Scaling the cluster as needed ● Data balancing the cluster as nodes are added ● Support for any Kafka issue with less than 60 minute response time Infra-as-a-Service Harness full power of Kafka Kafka-specific management Platform-as-a-Service Evolve as you need Future-proof Mission-critical reliability Most Kafka as a Service offerings are partially-managed
  • 9. 9 What is Cloud Native? Benefits • Scalable • Flexible • Agile • Elastic • Automated • Etc.
  • 10. 10 The Twelve-Factor App What is Cloud Native? https://12factor.net/
  • 11. 11 What is Cloud Native? 10 key characteristics (one of many definitions) ● Packaged as lightweight containers ● Developed with best-of-breed languages and frameworks ● Designed as loosely coupled microservices ● Centered around APIs for interaction and collaboration ● Architected with a clean separation of stateless and stateful services ● Isolated from server and operating system dependencies ● Deployed on self-service, elastic, cloud infrastructure ● Managed through agile DevOps processes ● Automated capabilities ● Defined, policy-driven resource allocation https://thenewstack.io/10-key-attributes-of-cloud-native-applications/
  • 12. 12 Agenda ● Cloud Native vs. SaaS / Serverless Kafka ● The Emergence of Kubernetes ● Kafka on K8s Deployment Challenges ● Confluent Operator as Kafka Operator ● Q&A Confluent Operator
  • 15. 15 Kubernetes innovation not stopping… ● Cloud providers provide Kubernetes as a service ○ AWS, Azure, GCP, … ○ Not all are mature yet J ● Stateful deployments leverage the Kubernetes Operator pattern ○ For many infrastructure components, like databases, messaging, etc. ○ Community projects, vendors solutions (open source vs. proprietary) ● Service Mesh ○ Envoy, Istio, Linkerd, ... ○ Pull request for Kafka protocol support in Envoy / Istio accepted recently
  • 16. 16 Evolution of Kafka DevOps Shell scripts Ansible/Chef Docker Kubernetes
  • 17. 17 Agenda ● Cloud Native vs. SaaS / Serverless Kafka ● The Emergence of Kubernetes ● Kafka on K8s Deployment Challenges ● Confluent Operator as Kafka Operator ● Q&A Confluent Operator
  • 18. 18 Kafkaesque world of Kafka on Kubernetes
  • 19. 19 Kafka on Kubernetes – It’s tricky © • Translating an existing architecture to Kubernetes • Failover handling and data balancing • Communication between ZooKeeper, Kafka Brokers, Clients (Java, REST, Connect, KSQL), Schema Registry, etc. • External access from / to outside Kubernetes cluster • Persistent storage options on prem and in the cloud • Security configuration • Rolling upgrades • Etc.
  • 20. 20 Agenda ● Cloud Native vs. SaaS / Serverless Kafka ● The Emergence of Kubernetes ● Kafka on K8s Deployment Challenges ● Confluent Operator as Kafka Operator ● Q&A Confluent Operator
  • 21. ● Global-scale ● Real-time ● Persistent Storage ● Stream Processing Apache Kafka: The De-facto Standard for Real-Time Event Streaming Edge Cloud Data LakeDatabases Datacenter IoT SaaS AppsMobile Microservices Machine Learning Apache Kafka
  • 22. 22 Real-Time Inventory Real-Time Fraud Detection Real-Time Customer 360 Machine Learning Models Real-Time Data Transformation ... Contextual Event-Driven Applications Universal Event Pipeline Data Stores Logs 3rd Party Apps Custom Apps/Microservices TREAMSSTREAMS CONNECT CLIENTS
  • 23. 23Confluent establishes Freedom of Choice ● SaaS or Self-Managed or Hybrid ● Confluent’s vision is to introduce cloud native capabilities to Confluent Platform and enable users who want a cloud native experience in on-premises and self- managed cloud environments ● Introducing Confluent Operator
  • 24. 24 Confluent Platform Operations and Security Development & Stream Processing Support,services,training&partners Apache Kafka Security plugins | Role-Based Access Control Control Center | Replicator | Auto Data Balancer | Operator Connectors Clients | REST Proxy MQTT Proxy | Schema Registry KSQL Connect Continuous Commit Log Streams Complete Event Streaming Platform Mission-critical Reliability Freedom of Choice Datacenter Public Cloud Confluent Cloud Self-Managed Software Fully-Managed Service
  • 25. 25 Confluent Operator ● Deployment and management automation for Confluent Platform on Kubernetes ● Including Apache Kafka, Zookeeper, Schema Registry, Connect, Control Center, Replicator, KSQL ● For organizations standardized on Kubernetes as platform runtime ● Operationalizes years of experience running Kafka on Kubernetes on the leading public clouds Confluent Platform Confluent Operator Kubernetes AWS Azure GCP RH OpenShift Mesosphere Pivotal On-Premises Cloud Docker Images Automate Deployment of Confluent Platform on Kubernetes on Any Platform at Any Scale
  • 26. 26 Confluent Operator enables you to: Automate provisioning of Kafka pods in minutes Monitor SLAs through Confluent Control Center or Prometheus Scale Kafka elastically & Automate rolling updates Built on our first hand knowledge of running Confluent at scale Cloud-Native Deployment of Kafka and Confluent Platform
  • 27. Confluent’s Kubernetes Journey 05/2017 Confluent Cloud Early Access 2016 Confluent Cloud Development 11/2017 Confluent Cloud GA (AWS) 07/2019 Confluent Operator GA (Confluent Platform) 2019 Confluent Cloud GA on AWS, GCP, Azure
  • 28. 28 Confluent Operator: A custom Kubernetes Controller API Server Scheduler Controllers & Custom Controllers PODS PVs ConfigMaps StatefulSets Nodes and pods are where Applications run on Kubernetes Applications use objects like StatefulSets, Configmaps, PVs Custom Controllers create custom resources that provide unique application functionality: ● Upgrades, Elasticity, Kafka Operational Logic Custom Resources Master Node Worker Nodes
  • 29. 29Helm - The Confluent Operator Package Manager ● Confluent Operator leverages Helm Charts to deploy, upgrade and uninstall Confluent Platform custom resources and pods ● Configuration front end for users to specify how a Confluent Platform Cluster is deployed: ○ # of replicas for Kafka, Zookeeper ○ Security and Authentication configuration ○ Persistent Storage configuration ● Cluster configuration edits are also performed using Helm Operator Helm Charts - yaml
  • 30. 30 Kubernetes Cluster K8 NodeK8 NodeK8 Node Replicator Pod C3 Pod SR Pod K8 Node Confluent Operator Deployment Operator Kafka Pod ZK Pod Persistent Volumes (AWS EBS, GCE Persistent Disk, Local Persistent Volume, etc.) External Access Load Balancers Configurations ConfigMapsKSQL Pod REST Proxy Pod
  • 31. 31 Confluent Operator - Automated Provisioning
  • 32. 32 Confluent Operator - Automated Security Configuration SASL PLAIN, SASL_SSL, TLS with Mutual Authentication Automate configuration of truststores and keystores with secret objects Automate configuration of Kafka and all Confluent Platform Components
  • 33. 33 Confluent Operator - Scale Horizontally Automate Scaling: Spin up new brokers, connect workers easily Distribute partitions to new brokers: Determine balancing plan Execute balancing plan Monitor Resources
  • 34. 34 Confluent Operator - Rolling Upgrade of all components Automated Rolling Upgrades of all components - Kafka Brokers, Zookeeper, Connect, Control Center Kafka Broker Upgrades: 1. Stop the broker, upgrade Kafka 2. Wait for Partition Leader reassignment 3. Start the upgraded broker 4. Wait for zero under-replicated partitions 5. Upgrade the next broker
  • 35. Agenda ● Cloud Native vs. SaaS / Serverless Kafka ● The Emergence of Kubernetes ● Kafka on K8s Deployment Challenges ● Confluent Operator as Kafka Operator ● Q&A Confluent Operator