0% found this document useful (0 votes)

52 views

Stream Processing at Lyft

Lyft has built a streaming platform using Apache Flink for stream processing and Apache Kafka for messaging. The goals were to make building real-time microservices easy and solve stream processing problems once for the company. Some open problems discussed include efficiently rescaling Kafka while preserving per-key ordering, enabling dynamic computations over streams, long-term storage for real-time and historical data access, and achieving zero downtime deployments for streaming services. Lyft is still working on these challenging problems and is hiring for engineers interested in streaming systems.

Uploaded by

Jamie Grier

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

Stream Processing at Lyft

Uploaded by

Jamie Grier

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Streaming

Jamie Grier | @jgrier

1
Agenda

• Goals of Lyft’s Streaming Platform

• Streaming Platform Overview

• Why Flink

• Why Kafka

• Open problems

2
Goals of Lyft’s Streaming Platform

• Make it easy to build real-time, event-driven, stateful,

microservices

• Solve the hard parts of stream processing ONCE for the entire
company

• Be a force multiplier for other teams within Lyft

3
Streaming Platform Overview
Stream Compute
Pub/Sub Streaming Pub/Sub
Service One

Streaming
Service Two

Streaming
Service Three

Stream / Schema Deployment Metrics &

Alerts Logging
Registry Tooling Dashboards

Amazon Salt
Amazon S3 Wavefront Docker
EC2 (Conifg / Orca) 4
Lyft Streaming Platform - Streaming Compute Criteria

API Considerations: Operational Considerations

● Stateful Computation and Exactly-
● Functional / Fluent API
once Processing Semantics
● Flexible Windowing API ● Robust State Management
● Event Time Support ● Data Reprocessing (backfill)
● Asynchronous Checkpoints
● Apache Beam Support
● Back-pressure
● Stream SQL
● High throughput and low-latency
● Powerful Direct API ● Deployment Architecture
● Late Data Handling

The contenders: Apache Flink, Apache Spark Streaming, Apache Kafka Streams
5
Why Flink? API Considerations
• Functional / Fluent API
• Flexible Windowing API
• Event Time Support
• Apache Beam Support
• Stream SQL
• Powerful Direct API
• Late Data Handling

6
Why Flink? Operational Considerations

• Stateful Computation and Exactly-once Processing Semantics

• Robust State Management
• Stateful Data Reprocessing (backfill)
• Asynchronous Checkpoints
• Back-pressure
• High throughput and low-latency
• Deployment Architecture

7
Lyft Streaming Platform - Pub/Sub Criteria

Semantics / Features Operational Considerations

● Write Latency
● Durability
● Read Latency
● Consumer Fanout
● Project Maturity
● Transactions / Idempotent Writes ● Vendor Support
● Per-Key Ordering Guarantees
● Long-Term Data Storage
● Auto-Scaling

The contenders: Apache Kafka, Amazon Kinesis, Pravega

8
Why Kafka?
Pros
• Durability & Write Latency
• Read Latency & Consumer Fanout
• Transactions & Idempotent Writes
• Operational Concerns & Vendor Support
Cons
• No ordering by key, only partition
• Long term data storage still an issue
• Auto-Scaling still an issue

9
Open Problems

• Rescaling Kafka while preserving per-key ordering

• Efficient Dynamic Computations over streams

• Long term storage for events: real-time and historical reads

• Zero Downtime deployments for streaming services

10
Rescaling Kafka

• Rescaling Kafka while preserving per-key ordering

• Kafka only provides partition ordering guarantees!

• We want per-key ordering guarantees

• Guarantees should hold across re-partitioning events

• Basic approach: Read old partitions completely before reading

new

• Achieve this using something akin to Flink’s checkpoint 11

Rescaling Kafka while preserving per-key ordering

12
Rescaling Kafka while preserving per-key ordering

13
Efficient Dynamic Computation Over Streams

• Enable many users to dynamically submit small streaming

computations

• Share bandwidth amongst multiple computations

• Share computed sub-results amongst multiple computations

• Correctly handle bootstrapping of computations which

depend on historical data

• Basic approach: Map any computation into a fixed/general

14
Efficient Dynamic Computations over streams

15
Efficient Dynamic Computations over streams

16
Long term storage for events: Real-time and historical reads

17
Zero Downtime deployments for streaming services

18
Summary

• Lyft is building a next generation streaming platform based

on Apache Flink and Apache Kafka

• Stateful stream processing is not a “solved problem”

• There are many hard / open problems left to solve

• If these sort of problems interest you please come join us!

We’re Hiring!
19
Thank you!
Jamie Grier

Iti Pdfs
No ratings yet
Iti Pdfs
10 pages
De Mod 5 Deploy Workloads With Databricks Workflows
No ratings yet
De Mod 5 Deploy Workloads With Databricks Workflows
19 pages
Material For Student RWVCPC V012021A EN
No ratings yet
Material For Student RWVCPC V012021A EN
70 pages
Load Balancing in Oracle RAC 11GR2
No ratings yet
Load Balancing in Oracle RAC 11GR2
3 pages
Jakarta Struts: An MVC Framework: Overview Installation and Setup Overview, Installation, and Setup
No ratings yet
Jakarta Struts: An MVC Framework: Overview Installation and Setup Overview, Installation, and Setup
17 pages
2016 05 10 Apache Nifi Deep Dive 160511170654
No ratings yet
2016 05 10 Apache Nifi Deep Dive 160511170654
34 pages
Mastering Kafka Streams: From Basics to Expert Proficiency
From Everand
Mastering Kafka Streams: From Basics to Expert Proficiency
William Smith
No ratings yet
01-Docker - 02 - Install Docker Desktop on Windows (1)
No ratings yet
01-Docker - 02 - Install Docker Desktop on Windows (1)
6 pages
Set Your Data in Motion
No ratings yet
Set Your Data in Motion
8 pages
3 Lecture 3-ETL
100% (1)
3 Lecture 3-ETL
42 pages
Move The Data That Moves Your Business: Attunity Replicate
No ratings yet
Move The Data That Moves Your Business: Attunity Replicate
2 pages
AWS Athena Knowledgebase
No ratings yet
AWS Athena Knowledgebase
4 pages
(English (Auto-Generated) ) Building End-to-End Delta Pipelines On GCP (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) Building End-to-End Delta Pipelines On GCP (DownSub - Com)
24 pages
Slide 13 - Kafka
No ratings yet
Slide 13 - Kafka
109 pages
Installing Tivoli System Automation For High Availability of DB2 UDB BCU On AIX Redp4254
No ratings yet
Installing Tivoli System Automation For High Availability of DB2 UDB BCU On AIX Redp4254
14 pages
Low Level Design
No ratings yet
Low Level Design
23 pages
Matillion Optimizing Snowflake
No ratings yet
Matillion Optimizing Snowflake
23 pages
Polars Vs Pandas - Benchmarking Performances and Beyond - LinkedIn
No ratings yet
Polars Vs Pandas - Benchmarking Performances and Beyond - LinkedIn
12 pages
Nifi Expression Language Cheat Sheet
100% (1)
Nifi Expression Language Cheat Sheet
2 pages
Time Series Database
No ratings yet
Time Series Database
6 pages
Spark Optimizations & Deployment
No ratings yet
Spark Optimizations & Deployment
39 pages
Download Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir ebook All Chapters PDF
100% (6)
Download Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir ebook All Chapters PDF
81 pages
2018 02 08 Whats New in Apache Spark 2 180213220045
No ratings yet
2018 02 08 Whats New in Apache Spark 2 180213220045
57 pages
HDPDeveloper EnterpriseSpark1 StudentGuide
100% (1)
HDPDeveloper EnterpriseSpark1 StudentGuide
244 pages
Kudu
No ratings yet
Kudu
9 pages
Scaladayslambda Architecture Spark Cassandra Akka Kafka 150609194508 Lva1 App6891 PDF
No ratings yet
Scaladayslambda Architecture Spark Cassandra Akka Kafka 150609194508 Lva1 App6891 PDF
100 pages
Installing and Using Impala
No ratings yet
Installing and Using Impala
248 pages
Hive and Impala
No ratings yet
Hive and Impala
46 pages
Business Intelligence DW
No ratings yet
Business Intelligence DW
17 pages
Crossplane Overview
100% (1)
Crossplane Overview
2 pages
Neo4j-Manual-2 0 1
No ratings yet
Neo4j-Manual-2 0 1
593 pages
Data Ingestion Using Nifi: Quick Overview
No ratings yet
Data Ingestion Using Nifi: Quick Overview
24 pages
big_data_topic3_[spark]_[thanh_binh_nguyen].TextMark
No ratings yet
big_data_topic3_[spark]_[thanh_binh_nguyen].TextMark
60 pages
Qlik Sense Cmdlet For Powershell: Sokkorn Cheav
No ratings yet
Qlik Sense Cmdlet For Powershell: Sokkorn Cheav
11 pages
Maestro Job Schedular
No ratings yet
Maestro Job Schedular
648 pages
Talend Data Integration Basics
No ratings yet
Talend Data Integration Basics
3 pages
Operating System
No ratings yet
Operating System
60 pages
A Path To Event Sourcing With Amazon MSK - James Ousby
No ratings yet
A Path To Event Sourcing With Amazon MSK - James Ousby
42 pages
Graphite
No ratings yet
Graphite
195 pages
Introduction To Neo4j
No ratings yet
Introduction To Neo4j
14 pages
SQLGraph - When ClickHouse Marries Graph Processing Amoisbird PDF
0% (1)
SQLGraph - When ClickHouse Marries Graph Processing Amoisbird PDF
35 pages
Java Performance Tuning (Full Presentation) by Ender
No ratings yet
Java Performance Tuning (Full Presentation) by Ender
172 pages
Crunchy Postgresql High-Availability Suite Keeps Critical Applications Running
No ratings yet
Crunchy Postgresql High-Availability Suite Keeps Critical Applications Running
2 pages
04 - Google BigQuery Pricing
No ratings yet
04 - Google BigQuery Pricing
18 pages
Tableau CheatSheet Zep
No ratings yet
Tableau CheatSheet Zep
1 page
AI Infrastructure Ecosystem 2022
No ratings yet
AI Infrastructure Ecosystem 2022
100 pages
Sonar Qube
No ratings yet
Sonar Qube
46 pages
Qlik Sense Installation Guide
No ratings yet
Qlik Sense Installation Guide
63 pages
Hadoop Security S360 2015v8 PDF
No ratings yet
Hadoop Security S360 2015v8 PDF
27 pages
Jarupula Praveen
No ratings yet
Jarupula Praveen
7 pages
Course Outline Hadoop and Spark For Big Data and Data Science PDF
No ratings yet
Course Outline Hadoop and Spark For Big Data and Data Science PDF
4 pages
Apache Calcite Tutorial
No ratings yet
Apache Calcite Tutorial
83 pages
Anatomy of A Program in Memory
No ratings yet
Anatomy of A Program in Memory
19 pages
Unstructured Dataload Into Hive Database Through PySpark
No ratings yet
Unstructured Dataload Into Hive Database Through PySpark
9 pages
Oltp Olap Rtap
No ratings yet
Oltp Olap Rtap
53 pages
Argos 2015 11 25 - Couchbase Architecture
No ratings yet
Argos 2015 11 25 - Couchbase Architecture
31 pages
IBM Security Product Integration Reference
100% (1)
IBM Security Product Integration Reference
14 pages
Installing, Upgrading and Migrating To The Latest Release of SAS 9.4
No ratings yet
Installing, Upgrading and Migrating To The Latest Release of SAS 9.4
24 pages
Talend Open Studio For Data Integration: User Guide
No ratings yet
Talend Open Studio For Data Integration: User Guide
452 pages
Aws Redshift: Calculations Are Typically Executed On Small Number of Columns
No ratings yet
Aws Redshift: Calculations Are Typically Executed On Small Number of Columns
8 pages
CIS Distribution Independent Linux Benchmark v1.1.0
No ratings yet
CIS Distribution Independent Linux Benchmark v1.1.0
409 pages
PSPICE
100% (2)
PSPICE
9 pages
MIC TRAC - Gagemaker
No ratings yet
MIC TRAC - Gagemaker
4 pages
PH Dsi X
No ratings yet
PH Dsi X
72 pages
Sample Data
No ratings yet
Sample Data
4 pages
PMO - Netw Forensic - 130319 v2.2
No ratings yet
PMO - Netw Forensic - 130319 v2.2
12 pages
Using Swing Components
No ratings yet
Using Swing Components
330 pages
Change Over Socomec ATYS-PM
No ratings yet
Change Over Socomec ATYS-PM
2 pages
Wa0076
No ratings yet
Wa0076
2 pages
Procedure For Induction and Orientation Flow Chart
No ratings yet
Procedure For Induction and Orientation Flow Chart
2 pages
Ultraprolink For Apple: Vylis Dock 15
No ratings yet
Ultraprolink For Apple: Vylis Dock 15
7 pages
AN INSIGHT INTO CYBER SECURITY AND INTELLECTUAL PROPERTY RIGHTS
No ratings yet
AN INSIGHT INTO CYBER SECURITY AND INTELLECTUAL PROPERTY RIGHTS
158 pages
SPM 200
No ratings yet
SPM 200
33 pages
Siemens 7SJ4x Feeder PTT User Manual ENU
No ratings yet
Siemens 7SJ4x Feeder PTT User Manual ENU
5 pages
Application Note For Boiler Tube Failure Reduction
No ratings yet
Application Note For Boiler Tube Failure Reduction
4 pages
e_bw4hana214
No ratings yet
e_bw4hana214
4 pages
Solarsan en
No ratings yet
Solarsan en
6 pages
Data Loader
100% (1)
Data Loader
6 pages
Mitre Att&Ck Study Overview
100% (1)
Mitre Att&Ck Study Overview
12 pages
Configuring DNS and DHCP On Sophos Firewall
No ratings yet
Configuring DNS and DHCP On Sophos Firewall
12 pages
Linux Hands On PARAM Shavak
No ratings yet
Linux Hands On PARAM Shavak
37 pages
Maven
No ratings yet
Maven
2 pages
Tutorial 1 Question
No ratings yet
Tutorial 1 Question
2 pages
Edirectory - Support - How To Collect Edirectory LDAP Traces For Troubleshooting
No ratings yet
Edirectory - Support - How To Collect Edirectory LDAP Traces For Troubleshooting
2 pages
Datasheet: FUJITSU Image Scanner Fi-7600
No ratings yet
Datasheet: FUJITSU Image Scanner Fi-7600
2 pages
How To Uninstall Prime OS
No ratings yet
How To Uninstall Prime OS
16 pages
PNC380
No ratings yet
PNC380
4 pages
Beagle Esc 4
No ratings yet
Beagle Esc 4
84 pages
Controlling Children Using Computer
No ratings yet
Controlling Children Using Computer
2 pages
MM Extraction
No ratings yet
MM Extraction
5 pages

Stream Processing at Lyft

Uploaded by

Stream Processing at Lyft

Uploaded by

Streaming

Jamie Grier | @jgrier

• Goals of Lyft’s Streaming Platform

• Streaming Platform Overview

• Make it easy to build real-time, event-driven, stateful,

• Be a force multiplier for other teams within Lyft

Stream / Schema Deployment Metrics &

API Considerations: Operational Considerations

• Stateful Computation and Exactly-once Processing Semantics

Semantics / Features Operational Considerations

The contenders: Apache Kafka, Amazon Kinesis, Pravega

• Rescaling Kafka while preserving per-key ordering

• Efficient Dynamic Computations over streams

• Long term storage for events: real-time and historical reads

• Zero Downtime deployments for streaming services

• Rescaling Kafka while preserving per-key ordering

• Kafka only provides partition ordering guarantees!

• We want per-key ordering guarantees

• Guarantees should hold across re-partitioning events

• Basic approach: Read old partitions completely before reading

• Achieve this using something akin to Flink’s checkpoint 11

• Enable many users to dynamically submit small streaming

• Share bandwidth amongst multiple computations

• Share computed sub-results amongst multiple computations

• Correctly handle bootstrapping of computations which

• Basic approach: Map any computation into a fixed/general

• Lyft is building a next generation streaming platform based

• Stateful stream processing is not a “solved problem”

• There are many hard / open problems left to solve

• If these sort of problems interest you please come join us!

You might also like