SlideShare a Scribd company logo
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Reliable open-source
99.9% availability SLA
Monitoring
(OMS)
Visual Studio, IntelliJ and Eclipse support for
developers and data scientists
Enterprise grade Security Kerberos
Apache Ranger
Install & use big data applications
Azure Marketplace
Azure
HDInsight
Cloud Spark and Hadoop
service for your enterprise
(Spark, Hive, MR, LLAP,
Kafka, HBase, Storm)
*IDC study “The Business Value and TCO Advantage of Apache Hadoop in the Cloud with Microsoft Azure HDInsight”
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
and many more…
• Managed Kafka clusters with 99.9% service level
SLA
• Native integration with Azure Managed Disks.
Allows for exponentially lower costs, and higher
scale.
• Scalable On Demand clusters - Kafka clusters
with 16 TB/node and Zookeeper up and running
in 15 minutes
• Rack awareness for Kafka on the Azure cloud
• Alerting and predictive cluster maintenance
through Azure Operations Management Suite
• Extensibility via one click deploy of leading ISVs
such as StreamSets
• Disaster recovery support via MirrorMaker
• Deploy End to End streaming pipelines with
Storm, Spark, Storage via automated ARM
templates in the same VNET.
Modern Data Warehouse: Real-time analytics
Unstructured data
Azure storage
Azure HDInsight (LLAP)
Azure HDInsight
(Kafka)
Analytic Dashboards
Model & ServePrep & TrainStoreIngest Intelligence
SQL DW
Azure Databricks
(Spark)
Azure HDInsight
(Spark)
Kafka is a distributed, horizontally-scalable, fault-tolerant pub-sub store
Broker 1
Producer 1
IoT Hub
Storm
Spark
Streaming
1
2
3
ZK 1 ZK 2 ZK 3
Broker 2
Broker 3
3
1
2
Topic 1
Topic 2 Topic 1
Topic 2
Topic 2
Topic 1
4 5
Setup the broker
configuration
Publish the
message
The consumer
reads the messages
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Azure
Gateway
Services
Open source Stream Processing on Azure HDInsight
Real-time applications
Long term storage
Real-time dashboards
IoT Hubs
Azure VNet Boundary
Siphon on HDInsight Kafka 8 million
EVENTS PER SECOND PEAK INGRESS
800 TB (10 GB per Sec)
INGRESS PER DAY
1,800; 450
PRODUCTION KAFKA BROKERS; TOPICS
15 Sec
99th PERCENTILE LATENCY
KEY CUSTOMER
SCENARIOS
Ads Monetization (Fast BI)
O365 Customer Fabric NRT – Tenant & User insights
BingNRT Operational Intelligence
Presto (Fast SML) interactive analysis
Delve Analytics
0
5
10
15
20
25
30
35
40
45
Jan-15
Feb-15
Mar-15
Apr-15
May-15
Jun-15
Jul-15
Aug-15
Sep-15
Oct-15
Nov-15
Dec-15
Jan-16
Feb-16
Mar-16
Apr-16
May-16
Jun-16
Jul-16
Aug-16
Sep-16
Oct-16
Nov-16
Dec-16
Throughput(inGBps)
Siphon Data Volume (Ingress and Egress)
Volume published (GBps) Volume subscribed (GBps)
0
5
10
15
20
25
Jan-15
Feb-15
Mar-15
Apr-15
May-15
Jun-15
Jul-15
Aug-15
Sep-15
Oct-15
Nov-15
Dec-15
Jan-16
Feb-16
Mar-16
Apr-16
May-16
Jun-16
Jul-16
Aug-16
Sep-16
Oct-16
Nov-16
Dec-16
Throughput(eventspersec)Millions
Siphon Events per second (Ingress and Egress)
EPS In Eps Out
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Getting Started with Kafka for HDInsight
Structured Streaming with HDInsight Kafka and Spark
Deploy HDInsight Kafka + Storm
Stream data from on-premise to HDInsight Kafka in the cloud
https://academy.microsoft.com/en-us/professional-program/big-data/
https://www.pluralsight.com/courses/spark-kafka-cassandra-applying-lambda-architecture
https://azure.microsoft.com/en-us/blog/announcing-apache-kafka-for-azure-
hdinsight-general-availability/
https://azure.microsoft.com/en-us/blog/announcing-public-preview-of-apache-kafka-
on-hdinsight-with-azure-managed-disks/
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-apache-kafka-high-
availability
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Costin$
Throughput MBps
Kafka Cost Estimator
Non Managed Disks Managed Disks
#KAFKANODES
THROUGHPUT MBPS
Kafka scale forecast
Kafka nodes (OS VHDs) Kafka nodes (managed disks)
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-apache-kafka-mirroring
https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-apache-kafka-connect-vpn-gateway
https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-apache-kafka-connect-vpn-gateway
Azure VNet Boundary
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight
Message Rate 10,000 messages/sec
Message size 150 KB upperbound
Replica count 3
Retention Policy 12 hours
Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight

More Related Content

What's hot (20)

PDF
Apache Spark At Scale in the Cloud
Databricks
 
PPTX
Apache kafka
Kumar Shivam
 
PDF
Fundamentals of Apache Kafka
Chhavi Parasher
 
PDF
Oracle Clusterware Node Management and Voting Disks
Markus Michalewicz
 
PDF
Change Data Feed in Delta
Databricks
 
PPTX
Introduction to Kafka
Akash Vacher
 
PDF
Apache Kafka Introduction
Amita Mirajkar
 
PPTX
Cloudera - The Modern Platform for Analytics
Cloudera, Inc.
 
PPTX
Apache kafka
Viswanath J
 
PPTX
LSM Trees
Chris Lohfink
 
PDF
Anatomy of Data Frame API : A deep dive into Spark Data Frame API
datamantra
 
PPTX
Programming in Spark using PySpark
Mostafa
 
PDF
Oracle Active Data Guard: Best Practices and New Features Deep Dive
Glen Hawkins
 
PDF
Oracle RAC 19c and Later - Best Practices #OOWLON
Markus Michalewicz
 
PDF
Apache Kafka Architecture & Fundamentals Explained
confluent
 
PPTX
Azure Data Storage
Ken Cenerelli
 
PPTX
Cassandra
Upaang Saxena
 
PDF
Introduction to Spark with Python
Gokhan Atil
 
PPTX
RocksDB detail
MIJIN AN
 
Apache Spark At Scale in the Cloud
Databricks
 
Apache kafka
Kumar Shivam
 
Fundamentals of Apache Kafka
Chhavi Parasher
 
Oracle Clusterware Node Management and Voting Disks
Markus Michalewicz
 
Change Data Feed in Delta
Databricks
 
Introduction to Kafka
Akash Vacher
 
Apache Kafka Introduction
Amita Mirajkar
 
Cloudera - The Modern Platform for Analytics
Cloudera, Inc.
 
Apache kafka
Viswanath J
 
LSM Trees
Chris Lohfink
 
Anatomy of Data Frame API : A deep dive into Spark Data Frame API
datamantra
 
Programming in Spark using PySpark
Mostafa
 
Oracle Active Data Guard: Best Practices and New Features Deep Dive
Glen Hawkins
 
Oracle RAC 19c and Later - Best Practices #OOWLON
Markus Michalewicz
 
Apache Kafka Architecture & Fundamentals Explained
confluent
 
Azure Data Storage
Ken Cenerelli
 
Cassandra
Upaang Saxena
 
Introduction to Spark with Python
Gokhan Atil
 
RocksDB detail
MIJIN AN
 

Similar to Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight (20)

PDF
Introduction to apache kafka, confluent and why they matter
Paolo Castagna
 
PDF
StreamAnalytix - Multi-Engine Streaming Analytics Platform
Atul Sharma
 
PDF
Fom io t_to_bigdata_step_by_step-final
Luis Filipe Silva
 
PDF
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Kai Wähner
 
PDF
Devoxx university - Kafka de haut en bas
Florent Ramiere
 
PDF
DS_2016_StreamAnalytix_real_time_streaming_analytics_platform
Aditya Singh
 
PDF
Reinventing Kafka in the Data Streaming Era - Jun Rao
confluent
 
PPTX
Webinar: Data Streaming with Apache Kafka & MongoDB
MongoDB
 
PPTX
Data Streaming with Apache Kafka & MongoDB - EMEA
Andrew Morgan
 
PDF
Budapest Data/ML - Building Modern Data Streaming Apps with NiFi, Flink and K...
Timothy Spann
 
PDF
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
HostedbyConfluent
 
PPTX
Data Streaming with Apache Kafka & MongoDB
confluent
 
PDF
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
HostedbyConfluent
 
PDF
OpenStack for VMware Administrators
Trevor Roberts Jr.
 
PDF
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner
 
PPTX
IoT and Event Streaming at Scale with Apache Kafka
confluent
 
PDF
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
PDF
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
confluent
 
PDF
The Top 5 Event Streaming Use Cases & Architectures in 2021
confluent
 
PDF
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Kai Wähner
 
Introduction to apache kafka, confluent and why they matter
Paolo Castagna
 
StreamAnalytix - Multi-Engine Streaming Analytics Platform
Atul Sharma
 
Fom io t_to_bigdata_step_by_step-final
Luis Filipe Silva
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Kai Wähner
 
Devoxx university - Kafka de haut en bas
Florent Ramiere
 
DS_2016_StreamAnalytix_real_time_streaming_analytics_platform
Aditya Singh
 
Reinventing Kafka in the Data Streaming Era - Jun Rao
confluent
 
Webinar: Data Streaming with Apache Kafka & MongoDB
MongoDB
 
Data Streaming with Apache Kafka & MongoDB - EMEA
Andrew Morgan
 
Budapest Data/ML - Building Modern Data Streaming Apps with NiFi, Flink and K...
Timothy Spann
 
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
HostedbyConfluent
 
Data Streaming with Apache Kafka & MongoDB
confluent
 
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
HostedbyConfluent
 
OpenStack for VMware Administrators
Trevor Roberts Jr.
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner
 
IoT and Event Streaming at Scale with Apache Kafka
confluent
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
confluent
 
The Top 5 Event Streaming Use Cases & Architectures in 2021
confluent
 
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Kai Wähner
 
Ad

More from Microsoft Tech Community (20)

PPTX
100 ways to use Yammer
Microsoft Tech Community
 
PPTX
10 Yammer Group Suggestions
Microsoft Tech Community
 
PPTX
Removing Security Roadblocks to IoT Deployment Success
Microsoft Tech Community
 
PPTX
Building mobile apps with Visual Studio and Xamarin
Microsoft Tech Community
 
PPTX
Best practices with Microsoft Graph: Making your applications more performant...
Microsoft Tech Community
 
PPTX
Interactive emails in Outlook with Adaptive Cards
Microsoft Tech Community
 
PPTX
Unlocking security insights with Microsoft Graph API
Microsoft Tech Community
 
PPTX
Break through the serverless barriers with Durable Functions
Microsoft Tech Community
 
PPTX
Multiplayer Server Scaling with Azure Container Instances
Microsoft Tech Community
 
PPTX
Explore Azure Cosmos DB
Microsoft Tech Community
 
PPTX
Media Streaming Apps with Azure and Xamarin
Microsoft Tech Community
 
PPTX
DevOps for Data Science
Microsoft Tech Community
 
PPTX
Real-World Solutions with PowerApps: Tips & tricks to manage your app complexity
Microsoft Tech Community
 
PPTX
Azure Functions and Microsoft Graph
Microsoft Tech Community
 
PPTX
Getting Started with Visual Studio Tools for AI
Microsoft Tech Community
 
PPTX
Using AML Python SDK
Microsoft Tech Community
 
PPTX
Mobile Workforce Location Tracking with Bing Maps
Microsoft Tech Community
 
PPTX
Cognitive Services Labs in action Anomaly detection
Microsoft Tech Community
 
PPTX
Speech Devices SDK
Microsoft Tech Community
 
PPTX
LinkedIn Learning presents: Securing web applications in ASP.NET Core 2.1
Microsoft Tech Community
 
100 ways to use Yammer
Microsoft Tech Community
 
10 Yammer Group Suggestions
Microsoft Tech Community
 
Removing Security Roadblocks to IoT Deployment Success
Microsoft Tech Community
 
Building mobile apps with Visual Studio and Xamarin
Microsoft Tech Community
 
Best practices with Microsoft Graph: Making your applications more performant...
Microsoft Tech Community
 
Interactive emails in Outlook with Adaptive Cards
Microsoft Tech Community
 
Unlocking security insights with Microsoft Graph API
Microsoft Tech Community
 
Break through the serverless barriers with Durable Functions
Microsoft Tech Community
 
Multiplayer Server Scaling with Azure Container Instances
Microsoft Tech Community
 
Explore Azure Cosmos DB
Microsoft Tech Community
 
Media Streaming Apps with Azure and Xamarin
Microsoft Tech Community
 
DevOps for Data Science
Microsoft Tech Community
 
Real-World Solutions with PowerApps: Tips & tricks to manage your app complexity
Microsoft Tech Community
 
Azure Functions and Microsoft Graph
Microsoft Tech Community
 
Getting Started with Visual Studio Tools for AI
Microsoft Tech Community
 
Using AML Python SDK
Microsoft Tech Community
 
Mobile Workforce Location Tracking with Bing Maps
Microsoft Tech Community
 
Cognitive Services Labs in action Anomaly detection
Microsoft Tech Community
 
Speech Devices SDK
Microsoft Tech Community
 
LinkedIn Learning presents: Securing web applications in ASP.NET Core 2.1
Microsoft Tech Community
 
Ad

Recently uploaded (20)

PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
Biography of Daniel Podor.pdf
Daniel Podor
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
July Patch Tuesday
Ivanti
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Biography of Daniel Podor.pdf
Daniel Podor
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
July Patch Tuesday
Ivanti
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 

Ingestion in data pipelines with Managed Kafka Clusters in Azure HDInsight

  • 4. Reliable open-source 99.9% availability SLA Monitoring (OMS) Visual Studio, IntelliJ and Eclipse support for developers and data scientists Enterprise grade Security Kerberos Apache Ranger Install & use big data applications Azure Marketplace Azure HDInsight Cloud Spark and Hadoop service for your enterprise (Spark, Hive, MR, LLAP, Kafka, HBase, Storm) *IDC study “The Business Value and TCO Advantage of Apache Hadoop in the Cloud with Microsoft Azure HDInsight”
  • 7. • Managed Kafka clusters with 99.9% service level SLA • Native integration with Azure Managed Disks. Allows for exponentially lower costs, and higher scale. • Scalable On Demand clusters - Kafka clusters with 16 TB/node and Zookeeper up and running in 15 minutes • Rack awareness for Kafka on the Azure cloud • Alerting and predictive cluster maintenance through Azure Operations Management Suite • Extensibility via one click deploy of leading ISVs such as StreamSets • Disaster recovery support via MirrorMaker • Deploy End to End streaming pipelines with Storm, Spark, Storage via automated ARM templates in the same VNET.
  • 8. Modern Data Warehouse: Real-time analytics Unstructured data Azure storage Azure HDInsight (LLAP) Azure HDInsight (Kafka) Analytic Dashboards Model & ServePrep & TrainStoreIngest Intelligence SQL DW Azure Databricks (Spark) Azure HDInsight (Spark)
  • 9. Kafka is a distributed, horizontally-scalable, fault-tolerant pub-sub store Broker 1 Producer 1 IoT Hub Storm Spark Streaming 1 2 3 ZK 1 ZK 2 ZK 3 Broker 2 Broker 3 3 1 2 Topic 1 Topic 2 Topic 1 Topic 2 Topic 2 Topic 1
  • 10. 4 5 Setup the broker configuration Publish the message The consumer reads the messages
  • 12. Azure Gateway Services Open source Stream Processing on Azure HDInsight Real-time applications Long term storage Real-time dashboards IoT Hubs Azure VNet Boundary
  • 13. Siphon on HDInsight Kafka 8 million EVENTS PER SECOND PEAK INGRESS 800 TB (10 GB per Sec) INGRESS PER DAY 1,800; 450 PRODUCTION KAFKA BROKERS; TOPICS 15 Sec 99th PERCENTILE LATENCY KEY CUSTOMER SCENARIOS Ads Monetization (Fast BI) O365 Customer Fabric NRT – Tenant & User insights BingNRT Operational Intelligence Presto (Fast SML) interactive analysis Delve Analytics 0 5 10 15 20 25 30 35 40 45 Jan-15 Feb-15 Mar-15 Apr-15 May-15 Jun-15 Jul-15 Aug-15 Sep-15 Oct-15 Nov-15 Dec-15 Jan-16 Feb-16 Mar-16 Apr-16 May-16 Jun-16 Jul-16 Aug-16 Sep-16 Oct-16 Nov-16 Dec-16 Throughput(inGBps) Siphon Data Volume (Ingress and Egress) Volume published (GBps) Volume subscribed (GBps) 0 5 10 15 20 25 Jan-15 Feb-15 Mar-15 Apr-15 May-15 Jun-15 Jul-15 Aug-15 Sep-15 Oct-15 Nov-15 Dec-15 Jan-16 Feb-16 Mar-16 Apr-16 May-16 Jun-16 Jul-16 Aug-16 Sep-16 Oct-16 Nov-16 Dec-16 Throughput(eventspersec)Millions Siphon Events per second (Ingress and Egress) EPS In Eps Out
  • 17. Getting Started with Kafka for HDInsight Structured Streaming with HDInsight Kafka and Spark Deploy HDInsight Kafka + Storm Stream data from on-premise to HDInsight Kafka in the cloud https://academy.microsoft.com/en-us/professional-program/big-data/ https://www.pluralsight.com/courses/spark-kafka-cassandra-applying-lambda-architecture https://azure.microsoft.com/en-us/blog/announcing-apache-kafka-for-azure- hdinsight-general-availability/ https://azure.microsoft.com/en-us/blog/announcing-public-preview-of-apache-kafka- on-hdinsight-with-azure-managed-disks/
  • 24. Costin$ Throughput MBps Kafka Cost Estimator Non Managed Disks Managed Disks #KAFKANODES THROUGHPUT MBPS Kafka scale forecast Kafka nodes (OS VHDs) Kafka nodes (managed disks)
  • 30. Message Rate 10,000 messages/sec Message size 150 KB upperbound Replica count 3 Retention Policy 12 hours

Editor's Notes