SlideShare a Scribd company logo
© 2017 MapR TechnologiesMapR Confidential 1
Fast Cars, Big Data
How Streaming Can Help Formula1 ?
Tugdual Grall
@tgrall
© 2017 MapR Technologies@tgrall
{“about” : “me”}
Tugdual “Tug” Grall
• MapR : Technical Evangelist
• MongoDB, Couchbase, eXo, Oracle
• NantesJUG co-founder

• @tgrall
• http://tgrall.github.io
• tug@mapr.com / tugdual@gmail.com
© 2017 MapR Technologies@tgrall 3
Open Source Engines & Tools Commercial Engines & Applications
Utility-Grade Platform Services
DataProcessing
Web-Scale Storage
MapR-FS MapR-DB
Search and
Others
Real Time Unified Security Multi-tenancy Disaster Recovery Global NamespaceHigh Availability
MapR Streams
Cloud and
Managed
Services
Search and
Others
UnifiedManagementandMonitoring
Search and
Others
Event StreamingDatabase
Custom
Apps
MapR Converged Data Platform
© 2017 MapR Technologies@tgrall 4
Agenda
• What’s the point of data in motorsports?
• Demo
• Architecture
• What’s next?
• Q&A
© 2017 MapR Technologies@tgrall 5
What’s the point of data in motorsports?
© 2017 MapR Technologies@tgrall 6
© 2017 MapR Technologies@tgrall 7
Data in Motorsports
F1 Framework - http://f1framework.blogspot.de/2013/08/short-guide-to-f1-telemetry-spa-circuit.html
© 2017 MapR Technologies@tgrall 8
Race Strategy
F1 Framework - http://f1framework.blogspot.be/2013/08/race-strategy-explained.html
© 2017 MapR Technologies@tgrall 9
Got examples?
© 2017 MapR Technologies@tgrall 10
Got examples?
• Up to 300 sensors per car
© 2017 MapR Technologies@tgrall 11
Got examples?
• Up to 300 sensors per car
• Up to 2000 channels
© 2017 MapR Technologies@tgrall 12
Got examples?
• Up to 300 sensors per car
• Up to 2000 channels
• Sensor data are sent to the paddock in 2ms
© 2017 MapR Technologies@tgrall 13
Got examples?
• Up to 300 sensors per car
• Up to 2000 channels
• Sensor data are sent to the paddock in 2ms
• 1.5 billions of data points for a race
© 2017 MapR Technologies@tgrall 14
Got examples?
• Up to 300 sensors per car
• Up to 2000 channels
• Sensor data are sent to the paddock in 2ms
• 1.5 billions of data points for a race
• 5 billions for a full race weekend
© 2017 MapR Technologies@tgrall 15
Got examples?
• Up to 300 sensors per car
• Up to 2000 channels
• Sensor data are sent to the paddock in 2ms
• 1.5 billions of data points for a race
• 5 billions for a full race weekend
• 5/6Gb of compressed data per car for 90mn
© 2017 MapR Technologies@tgrall 16
Got examples?
• Up to 300 sensors per car
• Up to 2000 channels
• Sensor data are sent to the paddock in 2ms
• 1.5 billions of data points for a race
• 5 billions for a full race weekend
• 5/6Gb of compressed data per car for 90mn
US Grand Prix 2014 : 243 Tb (race teams combined)
© 2017 MapR Technologies@tgrall 17
How does that work?
© 2017 MapR Technologies@tgrall 18
Production System Outline
© 2017 MapR Technologies@tgrall 19
Simplified Demo System Outline
© 2017 MapR Technologies@tgrall 20
TORCS for Cars, Physics & Drivers
• The Open Racing Car Simulator
– http://torcs.sourceforge.net/
• Car racing game
• AI & Research Platform
© 2017 MapR Technologies@tgrall 21
Demo
© 2017 MapR Technologies@tgrall 22
Architecture
Events Producers Events Consumers
sensors data
Real Time
Analytics
Analytics with SQL
https://github.com/mapr-demos/racing-time-series
© 2017 MapR Technologies@tgrall 23
Architecture
Events Producers Events Consumers
sensors data
Real Time
Analytics
https://github.com/mapr-demos/racing-time-series
JSONProducer
Consumer
© 2017 MapR Technologies@tgrall 24
Where/How to store data ?
© 2017 MapR Technologies@tgrall 25
Big Datastore
Distributed File System
HDFS/MapR-FS
NoSQL Database
HBase/MapR-DB
….
© 2017 MapR Technologies@tgrall 26
Data Streaming
© 2017 MapR Technologies@tgrall 27
Data Streaming

Moving millions of events per h|mn|s
© 2017 MapR Technologies@tgrall 28
What is Kafka?
• http://kafka.apache.org/
• Created at LinkedIn, open sourced in 2011
• Implemented in Scala / Java
• Distributed messaging system built to scale
© 2017 MapR Technologies@tgrall 29
Big Picture
Producer
Producer
Producer
Consumer
Consumer
Consumer
© 2017 MapR Technologies@tgrall 30
Topic & Partitions
0 1 2 3 4 5 6 7 8
0 1 2 3 4 5
0 1 2 3 4 5 6 7
Partition 0
Partition 1
Partition 2
Writes
© 2017 MapR Technologies@tgrall 31
Consumer Groups
© 2017 MapR Technologies@tgrall 32
More real life Kafka …
Zookeeper
Broker 1
Topic A Topic B
Broker 2
Topic A Topic B
Broker 3
Topic A Topic B
Producer
Producer
Producer
Consumer
Consumer
Consumer
© 2017 MapR Technologies@tgrall 33
MapR Streams
• Distributed messaging system built to scale
• Use Apache Kafka API 0.9.0
• No code change
• Does not use the same “broker” architecture
– Log stored in MapR Storage (Scalable, Secured, Fast, Multi DC)
– No Zookeeper
© 2017 MapR Technologies@tgrall 34
Produce Messages


ProducerRecord<String, byte[]> rec = new ProducerRecord<>(

“/apps/racing/stream:sensors_data“,

eventName,

value.toString().getBytes());

producer.send(rec, (recordMetadata, e) -> {

if (e != null) { … });
producer.flush();
© 2017 MapR Technologies@tgrall 35
Consume Messages
long pollTimeOut = 800;

while(true) {

ConsumerRecords<String, String> records = consumer.poll(pollTimeOut);
if (!records.isEmpty()) {
Iterable<ConsumerRecord<String, String>> iterable = records::iterator;

StreamSupport
.stream(iterable.spliterator(), false)
.forEach((record) -> {
// work with record object
record.value();

…


});

consumer.commitAsync();

}

}
© 2017 MapR Technologies@tgrall 36
What’s next?
© 2017 MapR Technologies@tgrall 37
Capture more data
© 2017 MapR Technologies@tgrall 38
Sensor Data V1
• 3 main data points:
– Speed (m/s)
– RPM
– Distance (m)
• Buffered
{ "_id":"1.458141858E9/0.324",
"car" = "car1",
"timestamp":1458141858,
"racetime”:0.324,
"records":
[
{
"sensors":{
"Speed":3.588583,
"Distance":2003.023071,
"RPM":1896.575806
},
"racetime":0.324,
"timestamp":1458141858
},
{
"sensors":{
"Speed":6.755624,
"Distance":2004.084717,
"RPM":1673.264526
},
"racetime":0.556,
"timestamp":1458141858
},
© 2017 MapR Technologies@tgrall 39
Sensor Data V2
• 3 main data points:
– Speed (m/s)
– RPM
– Distance (m)
– Throttle
– Gears
– …
• Buffered
{ "_id":"1.458141858E9/0.324",
"car" = "car1",
"timestamp":1458141858,
"racetime”:0.324,
"records":
[
{
"sensors":{
"Speed":3.588583,
"Distance":2003.023071,
"RPM":1896.575806,
"Throttle" : 33,
"Gear" : 2
},
"racetime":0.324,
"timestamp":1458141858
},
{
"sensors":{
"Speed":6.755624,
"Distance":2004.084717,
“RPM”:1673.264526,
"Throttle" : 37,
"Gear" : 2
},
"racetime":0.556,
"timestamp":1458141858
},
© 2017 MapR Technologies@tgrall 40
Add new services
© 2017 MapR Technologies@tgrall 41
New Data Service
sensors data
Alerts
Stream
Processing
© 2017 MapR Technologies@tgrall 42
• Cluster Computing Platform
• Extends “MapReduce” with extensions
– Streaming
– Interactive Analytics
• Run in Memory
• http://spark.apache.org/
© 2017 MapR Technologies@tgrall 43
• Streaming Dataflow Engine
– Datastream/Dataset APIs
– CEP, Graph, ML
• Run in Memory
• https://flink.apache.org/
© 2017 MapR Technologies@tgrall 44
Demo
Stream Processing with Flink
© 2017 MapR Technologies@tgrall 45
Streaming Architecture & Formula 1
• Stream & Process data in real time
– Use distributed & scalable processing and storage
– NoSQL Database, Distributed File System
• Decouple the source from the consumer(s)
– Dashboard, Analytics, Machine Learning
– Add new use case….
© 2017 MapR Technologies@tgrall 46
Streaming Architecture & Formula 1
• Stream & Process data in real time
– Use distributed & scalable
processing and storage
– NoSQL Database, Distributed
File System
• Decouple the source from the
consumer(s)
– Dashboard, Analytics, Machine
Learning
– Add new use case….
This is not only about Formula 1!
(Telco, Finance, Retail, Content, IT)
© 2017 MapR Technologies@tgrall 47
One Last Thing…
Events Producers Events Consumers
Web Socket
https://github.com/tgrall/anki-mapr-demo
BLE
MapR-Streams
Apache Kafka
© 2017 MapR Technologies@tgrall
Streaming Architecture
http://mapr.com/ebooks/
Free ebooks & Online training
http://mapr.com/training/
© 2017 MapR TechnologiesMapR Confidential 49
Fast Cars, Big Data
How Streaming Can Help Formula1 ?
Tugdual Grall
@tgrall

More Related Content

PDF
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Carol McDonald
 
PDF
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
 
PDF
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Carol McDonald
 
PDF
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Carol McDonald
 
PDF
Streaming Architecture to Connect Everything (Including Hybrid Cloud) - Strat...
Mathieu Dumoulin
 
PDF
Fast Cars, Big Data How Streaming can help Formula 1
Carol McDonald
 
PDF
Spark and MapR Streams: A Motivating Example
Ian Downard
 
PDF
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Carol McDonald
 
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Carol McDonald
 
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
 
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Carol McDonald
 
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Carol McDonald
 
Streaming Architecture to Connect Everything (Including Hybrid Cloud) - Strat...
Mathieu Dumoulin
 
Fast Cars, Big Data How Streaming can help Formula 1
Carol McDonald
 
Spark and MapR Streams: A Motivating Example
Ian Downard
 
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Carol McDonald
 

What's hot (20)

PDF
Applying Machine Learning to Live Patient Data
Carol McDonald
 
PPTX
Sharing Sensitive Data Securely
Ted Dunning
 
PDF
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Carol McDonald
 
PPTX
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
Mathieu Dumoulin
 
PPTX
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
 
PPTX
Where is Data Going? - RMDC Keynote
Ted Dunning
 
PPTX
MapR and Machine Learning Primer
Mathieu Dumoulin
 
PPTX
Apache Spark Machine Learning Decision Trees
Carol McDonald
 
PPTX
The Hive Think Tank: "Stream Processing Systems" by M.C. Srivas of MapR
The Hive
 
PDF
Streaming in the Extreme
Julius Remigio, CBIP
 
PDF
Predicting Flight Delays with Spark Machine Learning
Carol McDonald
 
PDF
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
MapR Technologies
 
PDF
TraceMON - a new RIPE Atlas tool
AFRINIC
 
PDF
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Carol McDonald
 
PDF
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Carol McDonald
 
PDF
Demystifying AI, Machine Learning and Deep Learning
Carol McDonald
 
PDF
Advanced Threat Detection on Streaming Data
Carol McDonald
 
PDF
Streaming patterns revolutionary architectures
Carol McDonald
 
PPTX
Finding Changes in Real Data
Ted Dunning
 
PDF
Live Machine Learning Tutorial: Churn Prediction
MapR Technologies
 
Applying Machine Learning to Live Patient Data
Carol McDonald
 
Sharing Sensitive Data Securely
Ted Dunning
 
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Carol McDonald
 
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
Mathieu Dumoulin
 
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
 
Where is Data Going? - RMDC Keynote
Ted Dunning
 
MapR and Machine Learning Primer
Mathieu Dumoulin
 
Apache Spark Machine Learning Decision Trees
Carol McDonald
 
The Hive Think Tank: "Stream Processing Systems" by M.C. Srivas of MapR
The Hive
 
Streaming in the Extreme
Julius Remigio, CBIP
 
Predicting Flight Delays with Spark Machine Learning
Carol McDonald
 
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
MapR Technologies
 
TraceMON - a new RIPE Atlas tool
AFRINIC
 
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Carol McDonald
 
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Carol McDonald
 
Demystifying AI, Machine Learning and Deep Learning
Carol McDonald
 
Advanced Threat Detection on Streaming Data
Carol McDonald
 
Streaming patterns revolutionary architectures
Carol McDonald
 
Finding Changes in Real Data
Ted Dunning
 
Live Machine Learning Tutorial: Churn Prediction
MapR Technologies
 
Ad

Similar to Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Codemotion Amsterdam 2017 (20)

PDF
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
Codemotion
 
PDF
Fast Cars, Big Data - How Streaming Can Help Formula 1
Tugdual Grall
 
PDF
Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
 
PPTX
How Spark is Enabling the New Wave of Converged Cloud Applications
MapR Technologies
 
PPTX
MapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR Technologies
 
PDF
Live Tutorial – Streaming Real-Time Events Using Apache APIs
MapR Technologies
 
PDF
An Introduction to the MapR Converged Data Platform
MapR Technologies
 
PPTX
Streaming Architecture including Rendezvous for Machine Learning
Ted Dunning
 
PDF
Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...
Codemotion
 
PPTX
MapR Edge : Act Locally Learn Globally
ridhav
 
PDF
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR Technologies
 
PDF
Introduction to Streaming with Apache Flink
Tugdual Grall
 
PDF
Container and Kubernetes without limits
Antje Barth
 
PPTX
Geo-Distributed Big Data and Analytics
MapR Technologies
 
PPTX
How Spark is Enabling the New Wave of Converged Applications
MapR Technologies
 
PDF
CWIN17 Frankfurt / Cloudera
Capgemini
 
PDF
Predictive Maintenance Using Recurrent Neural Networks
Justin Brandenburg
 
PPTX
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
MapR Technologies
 
PDF
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Mathieu Dumoulin
 
PPTX
Map r seattle streams meetup oct 2016
Nitin Kumar
 
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
Codemotion
 
Fast Cars, Big Data - How Streaming Can Help Formula 1
Tugdual Grall
 
Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
 
How Spark is Enabling the New Wave of Converged Cloud Applications
MapR Technologies
 
MapR 5.2: Getting More Value from the MapR Converged Community Edition
MapR Technologies
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
MapR Technologies
 
An Introduction to the MapR Converged Data Platform
MapR Technologies
 
Streaming Architecture including Rendezvous for Machine Learning
Ted Dunning
 
Anomaly Detection in Telecom with Spark - Tugdual Grall - Codemotion Amsterda...
Codemotion
 
MapR Edge : Act Locally Learn Globally
ridhav
 
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR Technologies
 
Introduction to Streaming with Apache Flink
Tugdual Grall
 
Container and Kubernetes without limits
Antje Barth
 
Geo-Distributed Big Data and Analytics
MapR Technologies
 
How Spark is Enabling the New Wave of Converged Applications
MapR Technologies
 
CWIN17 Frankfurt / Cloudera
Capgemini
 
Predictive Maintenance Using Recurrent Neural Networks
Justin Brandenburg
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
MapR Technologies
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Mathieu Dumoulin
 
Map r seattle streams meetup oct 2016
Nitin Kumar
 
Ad

More from Codemotion (20)

PDF
Fuzz-testing: A hacker's approach to making your code more secure | Pascal Ze...
Codemotion
 
PDF
Pompili - From hero to_zero: The FatalNoise neverending story
Codemotion
 
PPTX
Pastore - Commodore 65 - La storia
Codemotion
 
PPTX
Pennisi - Essere Richard Altwasser
Codemotion
 
PPTX
Michel Schudel - Let's build a blockchain... in 40 minutes! - Codemotion Amst...
Codemotion
 
PPTX
Richard Süselbeck - Building your own ride share app - Codemotion Amsterdam 2019
Codemotion
 
PPTX
Eward Driehuis - What we learned from 20.000 attacks - Codemotion Amsterdam 2019
Codemotion
 
PPTX
Francesco Baldassarri - Deliver Data at Scale - Codemotion Amsterdam 2019 -
Codemotion
 
PDF
Martin Förtsch, Thomas Endres - Stereoscopic Style Transfer AI - Codemotion A...
Codemotion
 
PDF
Melanie Rieback, Klaus Kursawe - Blockchain Security: Melting the "Silver Bul...
Codemotion
 
PDF
Angelo van der Sijpt - How well do you know your network stack? - Codemotion ...
Codemotion
 
PDF
Lars Wolff - Performance Testing for DevOps in the Cloud - Codemotion Amsterd...
Codemotion
 
PDF
Sascha Wolter - Conversational AI Demystified - Codemotion Amsterdam 2019
Codemotion
 
PDF
Michele Tonutti - Scaling is caring - Codemotion Amsterdam 2019
Codemotion
 
PPTX
Pat Hermens - From 100 to 1,000+ deployments a day - Codemotion Amsterdam 2019
Codemotion
 
PPTX
James Birnie - Using Many Worlds of Compute Power with Quantum - Codemotion A...
Codemotion
 
PDF
Don Goodman-Wilson - Chinese food, motor scooters, and open source developmen...
Codemotion
 
PDF
Pieter Omvlee - The story behind Sketch - Codemotion Amsterdam 2019
Codemotion
 
PDF
Dave Farley - Taking Back “Software Engineering” - Codemotion Amsterdam 2019
Codemotion
 
PDF
Joshua Hoffman - Should the CTO be Coding? - Codemotion Amsterdam 2019
Codemotion
 
Fuzz-testing: A hacker's approach to making your code more secure | Pascal Ze...
Codemotion
 
Pompili - From hero to_zero: The FatalNoise neverending story
Codemotion
 
Pastore - Commodore 65 - La storia
Codemotion
 
Pennisi - Essere Richard Altwasser
Codemotion
 
Michel Schudel - Let's build a blockchain... in 40 minutes! - Codemotion Amst...
Codemotion
 
Richard Süselbeck - Building your own ride share app - Codemotion Amsterdam 2019
Codemotion
 
Eward Driehuis - What we learned from 20.000 attacks - Codemotion Amsterdam 2019
Codemotion
 
Francesco Baldassarri - Deliver Data at Scale - Codemotion Amsterdam 2019 -
Codemotion
 
Martin Förtsch, Thomas Endres - Stereoscopic Style Transfer AI - Codemotion A...
Codemotion
 
Melanie Rieback, Klaus Kursawe - Blockchain Security: Melting the "Silver Bul...
Codemotion
 
Angelo van der Sijpt - How well do you know your network stack? - Codemotion ...
Codemotion
 
Lars Wolff - Performance Testing for DevOps in the Cloud - Codemotion Amsterd...
Codemotion
 
Sascha Wolter - Conversational AI Demystified - Codemotion Amsterdam 2019
Codemotion
 
Michele Tonutti - Scaling is caring - Codemotion Amsterdam 2019
Codemotion
 
Pat Hermens - From 100 to 1,000+ deployments a day - Codemotion Amsterdam 2019
Codemotion
 
James Birnie - Using Many Worlds of Compute Power with Quantum - Codemotion A...
Codemotion
 
Don Goodman-Wilson - Chinese food, motor scooters, and open source developmen...
Codemotion
 
Pieter Omvlee - The story behind Sketch - Codemotion Amsterdam 2019
Codemotion
 
Dave Farley - Taking Back “Software Engineering” - Codemotion Amsterdam 2019
Codemotion
 
Joshua Hoffman - Should the CTO be Coding? - Codemotion Amsterdam 2019
Codemotion
 

Recently uploaded (20)

PDF
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
PPTX
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira Júnior
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
Architecture of the Future (09152021)
EdwardMeyman
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PPTX
ChatGPT's Deck on The Enduring Legacy of Fax Machines
Greg Swan
 
PDF
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira Júnior
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Architecture of the Future (09152021)
EdwardMeyman
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
ChatGPT's Deck on The Enduring Legacy of Fax Machines
Greg Swan
 
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 

Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Codemotion Amsterdam 2017

  • 1. © 2017 MapR TechnologiesMapR Confidential 1 Fast Cars, Big Data How Streaming Can Help Formula1 ? Tugdual Grall @tgrall
  • 2. © 2017 MapR Technologies@tgrall {“about” : “me”} Tugdual “Tug” Grall • MapR : Technical Evangelist • MongoDB, Couchbase, eXo, Oracle • NantesJUG co-founder
 • @tgrall • http://tgrall.github.io • [email protected] / [email protected]
  • 3. © 2017 MapR Technologies@tgrall 3 Open Source Engines & Tools Commercial Engines & Applications Utility-Grade Platform Services DataProcessing Web-Scale Storage MapR-FS MapR-DB Search and Others Real Time Unified Security Multi-tenancy Disaster Recovery Global NamespaceHigh Availability MapR Streams Cloud and Managed Services Search and Others UnifiedManagementandMonitoring Search and Others Event StreamingDatabase Custom Apps MapR Converged Data Platform
  • 4. © 2017 MapR Technologies@tgrall 4 Agenda • What’s the point of data in motorsports? • Demo • Architecture • What’s next? • Q&A
  • 5. © 2017 MapR Technologies@tgrall 5 What’s the point of data in motorsports?
  • 6. © 2017 MapR Technologies@tgrall 6
  • 7. © 2017 MapR Technologies@tgrall 7 Data in Motorsports F1 Framework - http://f1framework.blogspot.de/2013/08/short-guide-to-f1-telemetry-spa-circuit.html
  • 8. © 2017 MapR Technologies@tgrall 8 Race Strategy F1 Framework - http://f1framework.blogspot.be/2013/08/race-strategy-explained.html
  • 9. © 2017 MapR Technologies@tgrall 9 Got examples?
  • 10. © 2017 MapR Technologies@tgrall 10 Got examples? • Up to 300 sensors per car
  • 11. © 2017 MapR Technologies@tgrall 11 Got examples? • Up to 300 sensors per car • Up to 2000 channels
  • 12. © 2017 MapR Technologies@tgrall 12 Got examples? • Up to 300 sensors per car • Up to 2000 channels • Sensor data are sent to the paddock in 2ms
  • 13. © 2017 MapR Technologies@tgrall 13 Got examples? • Up to 300 sensors per car • Up to 2000 channels • Sensor data are sent to the paddock in 2ms • 1.5 billions of data points for a race
  • 14. © 2017 MapR Technologies@tgrall 14 Got examples? • Up to 300 sensors per car • Up to 2000 channels • Sensor data are sent to the paddock in 2ms • 1.5 billions of data points for a race • 5 billions for a full race weekend
  • 15. © 2017 MapR Technologies@tgrall 15 Got examples? • Up to 300 sensors per car • Up to 2000 channels • Sensor data are sent to the paddock in 2ms • 1.5 billions of data points for a race • 5 billions for a full race weekend • 5/6Gb of compressed data per car for 90mn
  • 16. © 2017 MapR Technologies@tgrall 16 Got examples? • Up to 300 sensors per car • Up to 2000 channels • Sensor data are sent to the paddock in 2ms • 1.5 billions of data points for a race • 5 billions for a full race weekend • 5/6Gb of compressed data per car for 90mn US Grand Prix 2014 : 243 Tb (race teams combined)
  • 17. © 2017 MapR Technologies@tgrall 17 How does that work?
  • 18. © 2017 MapR Technologies@tgrall 18 Production System Outline
  • 19. © 2017 MapR Technologies@tgrall 19 Simplified Demo System Outline
  • 20. © 2017 MapR Technologies@tgrall 20 TORCS for Cars, Physics & Drivers • The Open Racing Car Simulator – http://torcs.sourceforge.net/ • Car racing game • AI & Research Platform
  • 21. © 2017 MapR Technologies@tgrall 21 Demo
  • 22. © 2017 MapR Technologies@tgrall 22 Architecture Events Producers Events Consumers sensors data Real Time Analytics Analytics with SQL https://github.com/mapr-demos/racing-time-series
  • 23. © 2017 MapR Technologies@tgrall 23 Architecture Events Producers Events Consumers sensors data Real Time Analytics https://github.com/mapr-demos/racing-time-series JSONProducer Consumer
  • 24. © 2017 MapR Technologies@tgrall 24 Where/How to store data ?
  • 25. © 2017 MapR Technologies@tgrall 25 Big Datastore Distributed File System HDFS/MapR-FS NoSQL Database HBase/MapR-DB ….
  • 26. © 2017 MapR Technologies@tgrall 26 Data Streaming
  • 27. © 2017 MapR Technologies@tgrall 27 Data Streaming
 Moving millions of events per h|mn|s
  • 28. © 2017 MapR Technologies@tgrall 28 What is Kafka? • http://kafka.apache.org/ • Created at LinkedIn, open sourced in 2011 • Implemented in Scala / Java • Distributed messaging system built to scale
  • 29. © 2017 MapR Technologies@tgrall 29 Big Picture Producer Producer Producer Consumer Consumer Consumer
  • 30. © 2017 MapR Technologies@tgrall 30 Topic & Partitions 0 1 2 3 4 5 6 7 8 0 1 2 3 4 5 0 1 2 3 4 5 6 7 Partition 0 Partition 1 Partition 2 Writes
  • 31. © 2017 MapR Technologies@tgrall 31 Consumer Groups
  • 32. © 2017 MapR Technologies@tgrall 32 More real life Kafka … Zookeeper Broker 1 Topic A Topic B Broker 2 Topic A Topic B Broker 3 Topic A Topic B Producer Producer Producer Consumer Consumer Consumer
  • 33. © 2017 MapR Technologies@tgrall 33 MapR Streams • Distributed messaging system built to scale • Use Apache Kafka API 0.9.0 • No code change • Does not use the same “broker” architecture – Log stored in MapR Storage (Scalable, Secured, Fast, Multi DC) – No Zookeeper
  • 34. © 2017 MapR Technologies@tgrall 34 Produce Messages 
 ProducerRecord<String, byte[]> rec = new ProducerRecord<>(
 “/apps/racing/stream:sensors_data“,
 eventName,
 value.toString().getBytes());
 producer.send(rec, (recordMetadata, e) -> {
 if (e != null) { … }); producer.flush();
  • 35. © 2017 MapR Technologies@tgrall 35 Consume Messages long pollTimeOut = 800;
 while(true) {
 ConsumerRecords<String, String> records = consumer.poll(pollTimeOut); if (!records.isEmpty()) { Iterable<ConsumerRecord<String, String>> iterable = records::iterator;
 StreamSupport .stream(iterable.spliterator(), false) .forEach((record) -> { // work with record object record.value();
 … 
 });
 consumer.commitAsync();
 }
 }
  • 36. © 2017 MapR Technologies@tgrall 36 What’s next?
  • 37. © 2017 MapR Technologies@tgrall 37 Capture more data
  • 38. © 2017 MapR Technologies@tgrall 38 Sensor Data V1 • 3 main data points: – Speed (m/s) – RPM – Distance (m) • Buffered { "_id":"1.458141858E9/0.324", "car" = "car1", "timestamp":1458141858, "racetime”:0.324, "records": [ { "sensors":{ "Speed":3.588583, "Distance":2003.023071, "RPM":1896.575806 }, "racetime":0.324, "timestamp":1458141858 }, { "sensors":{ "Speed":6.755624, "Distance":2004.084717, "RPM":1673.264526 }, "racetime":0.556, "timestamp":1458141858 },
  • 39. © 2017 MapR Technologies@tgrall 39 Sensor Data V2 • 3 main data points: – Speed (m/s) – RPM – Distance (m) – Throttle – Gears – … • Buffered { "_id":"1.458141858E9/0.324", "car" = "car1", "timestamp":1458141858, "racetime”:0.324, "records": [ { "sensors":{ "Speed":3.588583, "Distance":2003.023071, "RPM":1896.575806, "Throttle" : 33, "Gear" : 2 }, "racetime":0.324, "timestamp":1458141858 }, { "sensors":{ "Speed":6.755624, "Distance":2004.084717, “RPM”:1673.264526, "Throttle" : 37, "Gear" : 2 }, "racetime":0.556, "timestamp":1458141858 },
  • 40. © 2017 MapR Technologies@tgrall 40 Add new services
  • 41. © 2017 MapR Technologies@tgrall 41 New Data Service sensors data Alerts Stream Processing
  • 42. © 2017 MapR Technologies@tgrall 42 • Cluster Computing Platform • Extends “MapReduce” with extensions – Streaming – Interactive Analytics • Run in Memory • http://spark.apache.org/
  • 43. © 2017 MapR Technologies@tgrall 43 • Streaming Dataflow Engine – Datastream/Dataset APIs – CEP, Graph, ML • Run in Memory • https://flink.apache.org/
  • 44. © 2017 MapR Technologies@tgrall 44 Demo Stream Processing with Flink
  • 45. © 2017 MapR Technologies@tgrall 45 Streaming Architecture & Formula 1 • Stream & Process data in real time – Use distributed & scalable processing and storage – NoSQL Database, Distributed File System • Decouple the source from the consumer(s) – Dashboard, Analytics, Machine Learning – Add new use case….
  • 46. © 2017 MapR Technologies@tgrall 46 Streaming Architecture & Formula 1 • Stream & Process data in real time – Use distributed & scalable processing and storage – NoSQL Database, Distributed File System • Decouple the source from the consumer(s) – Dashboard, Analytics, Machine Learning – Add new use case…. This is not only about Formula 1! (Telco, Finance, Retail, Content, IT)
  • 47. © 2017 MapR Technologies@tgrall 47 One Last Thing… Events Producers Events Consumers Web Socket https://github.com/tgrall/anki-mapr-demo BLE MapR-Streams Apache Kafka
  • 48. © 2017 MapR Technologies@tgrall Streaming Architecture http://mapr.com/ebooks/ Free ebooks & Online training http://mapr.com/training/
  • 49. © 2017 MapR TechnologiesMapR Confidential 49 Fast Cars, Big Data How Streaming Can Help Formula1 ? Tugdual Grall @tgrall