SlideShare a Scribd company logo
Frank McSherry, Chief Scientist
How to use Standard SQL over Kafka
From the basics to advanced use cases
You can write SQL and make
streaming infrastructure
A brief history of SQL systems
(spoiler: it ends up with streams)
OLTP
Reads
Writes
Transactions
Analytics
Dashboards
Monitoring
OLTP
Reads
Writes
Transactions
Analytics
Dashboards
Monitoring
OLTP OLAP
Reads
Writes
Transactions
Analytics
Dashboards
Monitoring
OLTP OLAP
Reads
Writes
Transactions
Analytics
Dashboards
Monitoring
(row-based) (columnar)
different designs
OLTP OLAP
Reads
Writes
Transactions
Analytics
Dashboards
Monitoring
(row-based) (columnar)
OLTP OLVM
Reads
Writes
Transactions
Analytics
Dashboards
Monitoring
(pull) (push)
different designs
Analytics
Dashboards
Monitoring
OLTP
Kafka
OLTP
Kafka
Analytics
Dashboards
Monitoring
How to use Standard SQL over Kafka: From the basics to advanced use cases | Frank McSherry, Materialize
How to use Standard SQL over Kafka: From the basics to advanced use cases | Frank McSherry, Materialize
Stream connection 1/3:
SQL systems can have
streaming internals
How to use Standard SQL over Kafka: From the basics to advanced use cases | Frank McSherry, Materialize
Stream connection 2/3:
SQL systems can consume
streams as inputs
How to use Standard SQL over Kafka: From the basics to advanced use cases | Frank McSherry, Materialize
Stream connection 3/3:
SQL systems can produce
streams as outputs
How to use Standard SQL over Kafka: From the basics to advanced use cases | Frank McSherry, Materialize
sensors
readings
max_temp_by_region
Bonus Features 1/3:
SQL over unbounded data
This could be unbounded
This is all we need to store
Bonus Features 2/3:
Focus on recent events.
Like now() but for "event time"
Bonus Features 3/3:
Event-driven SQL queries
How to use Standard SQL over Kafka: From the basics to advanced use cases | Frank McSherry, Materialize
For each query in queries ...
... restrict by the query argument ...
... keep the top three regions.
You can write SQL and make
streaming infrastructure
Streams and SQL combine as
inputs flow to systems flow to outputs
Streams and SQL lead to new idioms
working with unbounded data,
focusing on recent events,
event-driven SQL queries,
... and more. (seriously)
https://materialize.com/kafka
Head to
for free shirts, cloud sign-up, and a guide to Kafka.
- @frankmcsherry
"best DB shirt released in 2020
(great design, soft material)"
@andy_pavlo

More Related Content

What's hot (20)

PDF
Kafka Summit SF 2017 - Query the Application, Not a Database: “Interactive Qu...
confluent
 
PPTX
Confluent Kafka and KSQL: Streaming Data Pipelines Made Easy
Kairo Tavares
 
PDF
It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, Shopify
HostedbyConfluent
 
PDF
Real-time Data Streaming from Oracle to Apache Kafka
confluent
 
PDF
Easily Build a Smart Pulsar Stream Processor_Simon Crosby
StreamNative
 
PDF
Kafka as your Data Lake - is it Feasible? (Guido Schmutz, Trivadis) Kafka Sum...
HostedbyConfluent
 
PDF
Apache kafka-a distributed streaming platform
confluent
 
PDF
Use ksqlDB to migrate core-banking processing from batch to streaming | Mark ...
HostedbyConfluent
 
PDF
Distributed Data Storage & Streaming for Real-time Decisioning Using Kafka, S...
HostedbyConfluent
 
PDF
Maximize the Business Value of Machine Learning and Data Science with Kafka (...
confluent
 
PPTX
Data Integration with Apache Kafka: What, Why, How
Pat Patterson
 
PDF
Kafka Tiered Storage | Satish Duggana and Sriharsha Chintalapani, Uber
HostedbyConfluent
 
PPTX
Keep your Metadata Repository Current with Event-Driven Updates using CDC and...
confluent
 
PDF
Confluent On Azure: Why you should add Confluent to your Azure toolkit | Alic...
HostedbyConfluent
 
PDF
Data integration with Apache Kafka
confluent
 
PDF
How Kafka and MemSQL Became the Dynamic Duo (Sarung Tripathi, MemSQL) Kafka S...
HostedbyConfluent
 
PDF
Performance Tuning RocksDB for Kafka Streams’ State Stores
confluent
 
PPTX
Kafka Summit NYC 2017 Hanging Out with Your Past Self in VR
confluent
 
PDF
Using Kafka as a Database For Real-Time Transaction Processing | Chad Preisle...
HostedbyConfluent
 
PPTX
Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...
HostedbyConfluent
 
Kafka Summit SF 2017 - Query the Application, Not a Database: “Interactive Qu...
confluent
 
Confluent Kafka and KSQL: Streaming Data Pipelines Made Easy
Kairo Tavares
 
It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, Shopify
HostedbyConfluent
 
Real-time Data Streaming from Oracle to Apache Kafka
confluent
 
Easily Build a Smart Pulsar Stream Processor_Simon Crosby
StreamNative
 
Kafka as your Data Lake - is it Feasible? (Guido Schmutz, Trivadis) Kafka Sum...
HostedbyConfluent
 
Apache kafka-a distributed streaming platform
confluent
 
Use ksqlDB to migrate core-banking processing from batch to streaming | Mark ...
HostedbyConfluent
 
Distributed Data Storage & Streaming for Real-time Decisioning Using Kafka, S...
HostedbyConfluent
 
Maximize the Business Value of Machine Learning and Data Science with Kafka (...
confluent
 
Data Integration with Apache Kafka: What, Why, How
Pat Patterson
 
Kafka Tiered Storage | Satish Duggana and Sriharsha Chintalapani, Uber
HostedbyConfluent
 
Keep your Metadata Repository Current with Event-Driven Updates using CDC and...
confluent
 
Confluent On Azure: Why you should add Confluent to your Azure toolkit | Alic...
HostedbyConfluent
 
Data integration with Apache Kafka
confluent
 
How Kafka and MemSQL Became the Dynamic Duo (Sarung Tripathi, MemSQL) Kafka S...
HostedbyConfluent
 
Performance Tuning RocksDB for Kafka Streams’ State Stores
confluent
 
Kafka Summit NYC 2017 Hanging Out with Your Past Self in VR
confluent
 
Using Kafka as a Database For Real-Time Transaction Processing | Chad Preisle...
HostedbyConfluent
 
Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...
HostedbyConfluent
 

Similar to How to use Standard SQL over Kafka: From the basics to advanced use cases | Frank McSherry, Materialize (20)

PDF
Materialize: a platform for changing data
Altinity Ltd
 
PDF
Un'introduzione a Kafka Streams e KSQL... and why they matter!
Paolo Castagna
 
PPTX
Sparkflows Use Cases
Jayant Shekhar
 
PDF
SparkFlow
Takede Madiga Albert
 
PDF
Streaming, Database & Distributed Systems Bridging the Divide
Ben Stopford
 
PDF
Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...
Guido Schmutz
 
PDF
Streaming ETL with Apache Kafka and KSQL
Nick Dearden
 
PDF
Cloud lunch and learn real-time streaming in azure
Timothy Spann
 
PDF
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Helena Edelson
 
PPTX
Sql server lesson12
Ala Qunaibi
 
PPSX
Sql server lesson12
Ala Qunaibi
 
PPTX
NATE-Central-Log
Stefan Coetzee
 
PPT
Sql Server Performance Tuning
Bala Subra
 
PDF
ksqlDB Workshop
confluent
 
PDF
KSQL Deep Dive - The Open Source Streaming Engine for Apache Kafka
Kai Wähner
 
PDF
DustinVannoy_DataPipelines_AzureDataConf_Dec22.pdf
Dustin Vannoy
 
PPTX
Ultimate Free SQL Server Toolkit
Kevin Kline
 
PDF
The Rise of Streaming SQL and Evolution of Streaming Applications
Srinath Perera
 
PDF
Kai Waehner - KSQL – The Open Source SQL Streaming Engine for Apache Kafka - ...
Codemotion
 
PDF
Kai Waehner - KSQL – The Open Source SQL Streaming Engine for Apache Kafka - ...
Codemotion
 
Materialize: a platform for changing data
Altinity Ltd
 
Un'introduzione a Kafka Streams e KSQL... and why they matter!
Paolo Castagna
 
Sparkflows Use Cases
Jayant Shekhar
 
Streaming, Database & Distributed Systems Bridging the Divide
Ben Stopford
 
Spark (Structured) Streaming vs. Kafka Streams - two stream processing platfo...
Guido Schmutz
 
Streaming ETL with Apache Kafka and KSQL
Nick Dearden
 
Cloud lunch and learn real-time streaming in azure
Timothy Spann
 
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
Helena Edelson
 
Sql server lesson12
Ala Qunaibi
 
Sql server lesson12
Ala Qunaibi
 
NATE-Central-Log
Stefan Coetzee
 
Sql Server Performance Tuning
Bala Subra
 
ksqlDB Workshop
confluent
 
KSQL Deep Dive - The Open Source Streaming Engine for Apache Kafka
Kai Wähner
 
DustinVannoy_DataPipelines_AzureDataConf_Dec22.pdf
Dustin Vannoy
 
Ultimate Free SQL Server Toolkit
Kevin Kline
 
The Rise of Streaming SQL and Evolution of Streaming Applications
Srinath Perera
 
Kai Waehner - KSQL – The Open Source SQL Streaming Engine for Apache Kafka - ...
Codemotion
 
Kai Waehner - KSQL – The Open Source SQL Streaming Engine for Apache Kafka - ...
Codemotion
 
Ad

More from HostedbyConfluent (20)

PDF
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
 
PDF
Renaming a Kafka Topic | Kafka Summit London
HostedbyConfluent
 
PDF
Evolution of NRT Data Ingestion Pipeline at Trendyol
HostedbyConfluent
 
PDF
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
HostedbyConfluent
 
PDF
Exactly-once Stream Processing with Arroyo and Kafka
HostedbyConfluent
 
PDF
Fish Plays Pokemon | Kafka Summit London
HostedbyConfluent
 
PDF
Tiered Storage 101 | Kafla Summit London
HostedbyConfluent
 
PDF
Building a Self-Service Stream Processing Portal: How And Why
HostedbyConfluent
 
PDF
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
HostedbyConfluent
 
PDF
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
HostedbyConfluent
 
PDF
Navigating Private Network Connectivity Options for Kafka Clusters
HostedbyConfluent
 
PDF
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
HostedbyConfluent
 
PDF
Explaining How Real-Time GenAI Works in a Noisy Pub
HostedbyConfluent
 
PDF
TL;DR Kafka Metrics | Kafka Summit London
HostedbyConfluent
 
PDF
A Window Into Your Kafka Streams Tasks | KSL
HostedbyConfluent
 
PDF
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
HostedbyConfluent
 
PDF
Data Contracts Management: Schema Registry and Beyond
HostedbyConfluent
 
PDF
Code-First Approach: Crafting Efficient Flink Apps
HostedbyConfluent
 
PDF
Debezium vs. the World: An Overview of the CDC Ecosystem
HostedbyConfluent
 
PDF
Beyond Tiered Storage: Serverless Kafka with No Local Disks
HostedbyConfluent
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
 
Renaming a Kafka Topic | Kafka Summit London
HostedbyConfluent
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
HostedbyConfluent
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
HostedbyConfluent
 
Exactly-once Stream Processing with Arroyo and Kafka
HostedbyConfluent
 
Fish Plays Pokemon | Kafka Summit London
HostedbyConfluent
 
Tiered Storage 101 | Kafla Summit London
HostedbyConfluent
 
Building a Self-Service Stream Processing Portal: How And Why
HostedbyConfluent
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
HostedbyConfluent
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
HostedbyConfluent
 
Navigating Private Network Connectivity Options for Kafka Clusters
HostedbyConfluent
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
HostedbyConfluent
 
Explaining How Real-Time GenAI Works in a Noisy Pub
HostedbyConfluent
 
TL;DR Kafka Metrics | Kafka Summit London
HostedbyConfluent
 
A Window Into Your Kafka Streams Tasks | KSL
HostedbyConfluent
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
HostedbyConfluent
 
Data Contracts Management: Schema Registry and Beyond
HostedbyConfluent
 
Code-First Approach: Crafting Efficient Flink Apps
HostedbyConfluent
 
Debezium vs. the World: An Overview of the CDC Ecosystem
HostedbyConfluent
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
HostedbyConfluent
 
Ad

Recently uploaded (20)

PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
PDF
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Digital Circuits, important subject in CS
contactparinay1
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 

How to use Standard SQL over Kafka: From the basics to advanced use cases | Frank McSherry, Materialize