SlideShare a Scribd company logo
Apache Apex as
YARN Application
Chinmay Kolhatkar (chinmay@apache.org)
Mar 22, 2016
Apache Apex Meetup
Agenda
• Directed Acyclic Graph
• Apex as a YARN Application
• Application Components of Apex
• Lifecycle of Apex as a YARN Application
Apache Apex Meetup
Directed Acyclic Graph (DAG)
• Defines compute stages of streaming application
• Defines tuple flow across Operators via Stream
Compute
1
Apache Apex Meetup
Compute
3
Compute
2
Compute
4
DAG Components
• Tuple
● Atomic data that flows over a stream
• Operator
● Basic compute unit per tuple
• Stream
● Connector abstraction between operators
● Tuples flow over this
Operator
1
Operator
2
Apache Apex Meetup
Stream
tuple
3
tuple
1
tuple
2
DAG Types
O1 O2
O3
O4
Physical DAG
Apache Apex Meetup
O5
Logical DAG
• Logical Plan
● Logical representation of computation
● Defines operators, streams and dataflow
• Physical Plan
● Deployable plan on cluster
● Contains partition information
of operators
● Has ready-to-deploy serialized operator
instances
O1
P1
O1
P2
O1
P3
O2
P1
O2
P2
O2
P3
U
O3
O4
O5
Apex as YARN application
Node
ResourceManager
(AsM + Scheduler)
NM Node NM Node NM
YarnClient
AppMaster
YarnContainer
YarnContainer
YarnContainer
StrAM
(AppMaster)
YarnContainer
StrAMChild
O1 O2
YarnContainer
StrAMChild
O3
DTCLI
StrAMClient
YarnClient
Apache Apex Meetup
ClientRM
Protocol
AMRM
Protocol
ContainerManager
Protocol
ContainerManager
Protocol
ClientRM
Protocol
AMRM
Protocol
ContainerManager
Protocol
Application Components of Apex - StrAMClient
• Part of dtcli client interface
• Invoked by “launch” command of dtcli
• Tasks:
● Copy required the application package files into HDFS
● Validate Logical Plan
● Serialize Logical plan to HDFS
● Launch Application Master i.e. StrAM
Apache Apex Meetup
Application Components of Apex - StrAM
• Streaming Application Master
• Started by StrAMClient on a YarnContainer
• Tasks:
● Convert logical plan to physical plan
● Serialize operators to HDFS
● Request for resources to ResourceManager
● Start StrAMChild in YarnContainer(s)
● Monitor StrAMChild using ContainerManager protocol
● Generate Application statistics
● Host results on WebService (dtManage)
● Fault Tolerance
● Checkpointing/Committing Application States
● Support Security
● Shutdown Application
Apache Apex Meetup
Application Components of Apex - StrAMChild
• Deployed on YarnContainer
• Started by NodeManager as instructed by StrAM
• Instance of StreamingContainer
• Contains Operators (compute-related)
• Contains BufferServer (stream-related)
• Tasks:
● Regularly send heartbeat to StrAM
● Execute commands from StrAM
● Shutdown or Kill self if instructed
● Manage lifecycle of an Operator
● Network communication using BufferServer
Apache Apex Meetup
Lifecycle of Apex/YARN Application - Start
Node
ResourceManager
(AsM + Scheduler)
NM Node NM Node NM
DTCLI/
StrAMClient
(YarnClient)
1) Access cluster information
HDFS
3) Submit Application to RM
StrAM
(AppMaster)
4) StrAM Registers with RM
5) StrAM sends heartbeats regularly
6) StrAM request containers with specifications
7) StrAMChild reads
serialized operator
from HDFS
8) StrAMChild starts
operator lifecycle
Apache Apex Meetup
2) Copies files from HDFS
ClientRMProtocol
AMRMProtocol
YarnContainer
StrAMChild
O2
O1
YarnContainer
StrAMChild
O3
YarnContainer
StrAMChild
O4ContainerManager
Protocol
ContainerManager
Protocol
Lifecycle of Apex/YARN Application - Running
Node
ResourceManager
(AsM + Scheduler)
NM Node NM Node NM
DTCLI/
StrAMClient
(YarnClient)
HDFS
StrAM
(AppMaster)
Apache Apex Meetup
ClientRMProtocol
AMRMProtocol
YarnContainer
StrAMChild
O2
O1
YarnContainer
StrAMChild
O3
YarnContainer
StrAMChild
O4ContainerManager
Protocol
ContainerManager
Protocol
1) StrAMChild sends
heartbeats
2) StrAMChild sends operator
data
3) StrAM send regular
heartbeats to RM
4) Query status of application
Lifecycle of Apex/YARN Application - Shutdown
Node
ResourceManager
(AsM + Scheduler)
NM Node NM Node NM
DTCLI/
StrAMClient
(YarnClient)
HDFS
StrAM
(AppMaster)
Apache Apex Meetup
ClientRMProtocol
AMRMProtocol
YarnContainer
StrAMChild
O2
O1
YarnContainer
StrAMChild
O3
YarnContainer
StrAMChild
O4ContainerManager
Protocol
ContainerManager
Protocol
1) Connect on WebService
REST API
3) Send shutdown signal to
StrAMChild
4) StrAMChild finishes
operator lifecycle
5) Check if all containers are freed
6) StrAM unregisters itself
7) StrAM exits
8) Check if application has
shutdown
2) Send command to StrAM
Lifecycle of Apex/YARN Application - Kill
Node
ResourceManager
(AsM + Scheduler)
NM Node NM Node NM
DTCLI/
StrAMClient
(YarnClient)
HDFS
StrAM
(AppMaster)
Apache Apex Meetup
ClientRMProtocol
AMRMProtocol
YarnContainer
StrAMChild
O2
O1
YarnContainer
StrAMChild
O3
YarnContainer
StrAMChild
O4ContainerManager
Protocol
ContainerManager
Protocol
1) Send kill-app command to YARN
2) RM kills all containers
Summary – Apex platform
• Enables YARN to be used for Streaming Applications
• Takes care of YARN specific work
• User can focus on business logic defined in Operators
Apache Apex Meetup
15
Apache Apex Meetup
Resources
Apache Apex Meetup
• Apache Apex website - http://apex.incubator.apache.org/
• Subscribe - http://apex.incubator.apache.org/community.html
• Download - http://apex.incubator.apache.org/downloads.html
• Twitter - @ApacheApex; Follow - https://twitter.com/apacheapex
• Facebook - https://www.facebook.com/ApacheApex/
• Meetup - http://www.meetup.com/topics/apache-apex
• Startup Program – Free Enterprise License for startups, Universities, Non-Profits
Upcoming events...
Apache Apex Meetup
• March 24th 9am PST - Fault Tolerance and Processing Semantics with Apache
Apex
• March 28th 6pm PST - Low-latency ingestion and analytics with Apache Kafka
and Apache Apex (Hadoop)
• ...

More Related Content

PDF
Apache Apex as YARN Application
Chinmay Kolhatkar
 
PDF
Flink Forward Berlin 2017: Pramod Bhatotia, Do Le Quoc - StreamApprox: Approx...
Flink Forward
 
PPTX
Till Rohrmann – Fault Tolerance and Job Recovery in Apache Flink
Flink Forward
 
PPTX
University program - writing an apache apex application
Akshay Gore
 
PDF
Finding OOMS in Legacy Systems with the Syslog Telegraf Plugin
InfluxData
 
PDF
Spark Summit EU talk by Nimbus Goehausen
Spark Summit
 
PPTX
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
InfluxData
 
PDF
PSUG #52 Dataflow and simplified reactive programming with Akka-streams
Stephane Manciot
 
Apache Apex as YARN Application
Chinmay Kolhatkar
 
Flink Forward Berlin 2017: Pramod Bhatotia, Do Le Quoc - StreamApprox: Approx...
Flink Forward
 
Till Rohrmann – Fault Tolerance and Job Recovery in Apache Flink
Flink Forward
 
University program - writing an apache apex application
Akshay Gore
 
Finding OOMS in Legacy Systems with the Syslog Telegraf Plugin
InfluxData
 
Spark Summit EU talk by Nimbus Goehausen
Spark Summit
 
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
InfluxData
 
PSUG #52 Dataflow and simplified reactive programming with Akka-streams
Stephane Manciot
 

What's hot (20)

PPTX
Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...
Flink Forward
 
PDF
Apache Flink internals
Kostas Tzoumas
 
PPTX
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
InfluxData
 
PDF
SignalFx: Making Cassandra Perform as a Time Series Database
DataStax Academy
 
PDF
Akka streams
mircodotta
 
ODP
Akka streams
Knoldus Inc.
 
PDF
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Flink Forward
 
PDF
Flink Gelly - Karlsruhe - June 2015
Andra Lungu
 
PPTX
Apache Flink Training: System Overview
Flink Forward
 
PDF
Self-managed and automatically reconfigurable stream processing
Vasia Kalavri
 
PDF
Scaling ingest pipelines with high performance computing principles - Rajiv K...
SignalFx
 
PPTX
Javantura v3 - Going Reactive with RxJava – Hrvoje Crnjak
HUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association
 
PDF
Flink Forward Berlin 2017: Robert Metzger - Keep it going - How to reliably a...
Flink Forward
 
PDF
Introducing Exactly Once Semantics in Apache Kafka with Matthias J. Sax
Databricks
 
PDF
Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...
Flink Forward
 
ODP
Introduction to ScalaZ
Knoldus Inc.
 
PDF
Virtual Flink Forward 2020: Build your next-generation stream platform based ...
Flink Forward
 
PDF
Flink Forward Berlin 2017: Aljoscha Krettek - Talk Python to me: Stream Proce...
Flink Forward
 
PDF
Flink Forward SF 2017: Joe Olson - Using Flink and Queryable State to Buffer ...
Flink Forward
 
PDF
Processing Big Data in Real-Time - Yanai Franchi, Tikal
Codemotion Tel Aviv
 
Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...
Flink Forward
 
Apache Flink internals
Kostas Tzoumas
 
How Texas Instruments Uses InfluxDB to Uphold Product Standards and to Improv...
InfluxData
 
SignalFx: Making Cassandra Perform as a Time Series Database
DataStax Academy
 
Akka streams
mircodotta
 
Akka streams
Knoldus Inc.
 
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Flink Forward
 
Flink Gelly - Karlsruhe - June 2015
Andra Lungu
 
Apache Flink Training: System Overview
Flink Forward
 
Self-managed and automatically reconfigurable stream processing
Vasia Kalavri
 
Scaling ingest pipelines with high performance computing principles - Rajiv K...
SignalFx
 
Javantura v3 - Going Reactive with RxJava – Hrvoje Crnjak
HUJAK - Hrvatska udruga Java korisnika / Croatian Java User Association
 
Flink Forward Berlin 2017: Robert Metzger - Keep it going - How to reliably a...
Flink Forward
 
Introducing Exactly Once Semantics in Apache Kafka with Matthias J. Sax
Databricks
 
Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...
Flink Forward
 
Introduction to ScalaZ
Knoldus Inc.
 
Virtual Flink Forward 2020: Build your next-generation stream platform based ...
Flink Forward
 
Flink Forward Berlin 2017: Aljoscha Krettek - Talk Python to me: Stream Proce...
Flink Forward
 
Flink Forward SF 2017: Joe Olson - Using Flink and Queryable State to Buffer ...
Flink Forward
 
Processing Big Data in Real-Time - Yanai Franchi, Tikal
Codemotion Tel Aviv
 
Ad

Similar to Apex as yarn application (20)

PPTX
Intro to YARN (Hadoop 2.0) & Apex as YARN App (Next Gen Big Data)
Apache Apex
 
PDF
Spark on yarn
datamantra
 
PPTX
Ingestion and Dimensions Compute and Enrich using Apache Apex
Apache Apex
 
PDF
Introduction to Apache Apex
Chinmay Kolhatkar
 
PPTX
Intro to Apache Apex - Next Gen Native Hadoop Platform - Hackac
Apache Apex
 
PPTX
Introduction to Apache Apex and writing a big data streaming application
Apache Apex
 
PDF
BigDataSpain 2016: Stream Processing Applications with Apache Apex
Thomas Weise
 
PPTX
Apache Apex: Stream Processing Architecture and Applications
Thomas Weise
 
PPTX
Apache Apex: Stream Processing Architecture and Applications
Comsysto Reply GmbH
 
PPTX
Flink Streaming @BudapestData
Gyula Fóra
 
PDF
SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)
Yuuki Takano
 
PPTX
Introduction to Apache Apex
Apache Apex
 
PPTX
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
DataWorks Summit/Hadoop Summit
 
PDF
Stream Processing use cases and applications with Apache Apex by Thomas Weise
Big Data Spain
 
PPTX
Apache Apex - BufferServer
Pradeep Dalvi
 
PDF
YOW2018 Cloud Performance Root Cause Analysis at Netflix
Brendan Gregg
 
PPTX
Apache Arrow Flight Overview
Jacques Nadeau
 
PDF
Porting a Streaming Pipeline from Scala to Rust
Evan Chan
 
PPT
Acl Tcam
amit_monty
 
PDF
BigDataSpain 2016: Introduction to Apache Apex
Thomas Weise
 
Intro to YARN (Hadoop 2.0) & Apex as YARN App (Next Gen Big Data)
Apache Apex
 
Spark on yarn
datamantra
 
Ingestion and Dimensions Compute and Enrich using Apache Apex
Apache Apex
 
Introduction to Apache Apex
Chinmay Kolhatkar
 
Intro to Apache Apex - Next Gen Native Hadoop Platform - Hackac
Apache Apex
 
Introduction to Apache Apex and writing a big data streaming application
Apache Apex
 
BigDataSpain 2016: Stream Processing Applications with Apache Apex
Thomas Weise
 
Apache Apex: Stream Processing Architecture and Applications
Thomas Weise
 
Apache Apex: Stream Processing Architecture and Applications
Comsysto Reply GmbH
 
Flink Streaming @BudapestData
Gyula Fóra
 
SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)
Yuuki Takano
 
Introduction to Apache Apex
Apache Apex
 
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
DataWorks Summit/Hadoop Summit
 
Stream Processing use cases and applications with Apache Apex by Thomas Weise
Big Data Spain
 
Apache Apex - BufferServer
Pradeep Dalvi
 
YOW2018 Cloud Performance Root Cause Analysis at Netflix
Brendan Gregg
 
Apache Arrow Flight Overview
Jacques Nadeau
 
Porting a Streaming Pipeline from Scala to Rust
Evan Chan
 
Acl Tcam
amit_monty
 
BigDataSpain 2016: Introduction to Apache Apex
Thomas Weise
 
Ad

Recently uploaded (20)

PDF
Software Testing Tools - names and explanation
shruti533256
 
PPTX
AgentX UiPath Community Webinar series - Delhi
RohitRadhakrishnan8
 
PPTX
EE3303-EM-I 25.7.25 electrical machines.pptx
Nagen87
 
PDF
Introduction to Ship Engine Room Systems.pdf
Mahmoud Moghtaderi
 
PDF
2010_Book_EnvironmentalBioengineering (1).pdf
EmilianoRodriguezTll
 
PPTX
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
PDF
top-5-use-cases-for-splunk-security-analytics.pdf
yaghutialireza
 
PPTX
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
PPTX
unit 3a.pptx material management. Chapter of operational management
atisht0104
 
PDF
Principles of Food Science and Nutritions
Dr. Yogesh Kumar Kosariya
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
Cryptography and Information :Security Fundamentals
Dr. Madhuri Jawale
 
PDF
Queuing formulas to evaluate throughputs and servers
gptshubham
 
PDF
Introduction to Data Science: data science process
ShivarkarSandip
 
PPTX
ternal cell structure: leadership, steering
hodeeesite4
 
PDF
FLEX-LNG-Company-Presentation-Nov-2017.pdf
jbloggzs
 
DOCX
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
PPT
1. SYSTEMS, ROLES, AND DEVELOPMENT METHODOLOGIES.ppt
zilow058
 
PPTX
Chapter_Seven_Construction_Reliability_Elective_III_Msc CM
SubashKumarBhattarai
 
PDF
Top 10 read articles In Managing Information Technology.pdf
IJMIT JOURNAL
 
Software Testing Tools - names and explanation
shruti533256
 
AgentX UiPath Community Webinar series - Delhi
RohitRadhakrishnan8
 
EE3303-EM-I 25.7.25 electrical machines.pptx
Nagen87
 
Introduction to Ship Engine Room Systems.pdf
Mahmoud Moghtaderi
 
2010_Book_EnvironmentalBioengineering (1).pdf
EmilianoRodriguezTll
 
Module2 Data Base Design- ER and NF.pptx
gomathisankariv2
 
top-5-use-cases-for-splunk-security-analytics.pdf
yaghutialireza
 
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
unit 3a.pptx material management. Chapter of operational management
atisht0104
 
Principles of Food Science and Nutritions
Dr. Yogesh Kumar Kosariya
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
Cryptography and Information :Security Fundamentals
Dr. Madhuri Jawale
 
Queuing formulas to evaluate throughputs and servers
gptshubham
 
Introduction to Data Science: data science process
ShivarkarSandip
 
ternal cell structure: leadership, steering
hodeeesite4
 
FLEX-LNG-Company-Presentation-Nov-2017.pdf
jbloggzs
 
SAR - EEEfdfdsdasdsdasdasdasdasdasdasdasda.docx
Kanimozhi676285
 
1. SYSTEMS, ROLES, AND DEVELOPMENT METHODOLOGIES.ppt
zilow058
 
Chapter_Seven_Construction_Reliability_Elective_III_Msc CM
SubashKumarBhattarai
 
Top 10 read articles In Managing Information Technology.pdf
IJMIT JOURNAL
 

Apex as yarn application

  • 1. Apache Apex as YARN Application Chinmay Kolhatkar ([email protected]) Mar 22, 2016 Apache Apex Meetup
  • 2. Agenda • Directed Acyclic Graph • Apex as a YARN Application • Application Components of Apex • Lifecycle of Apex as a YARN Application Apache Apex Meetup
  • 3. Directed Acyclic Graph (DAG) • Defines compute stages of streaming application • Defines tuple flow across Operators via Stream Compute 1 Apache Apex Meetup Compute 3 Compute 2 Compute 4
  • 4. DAG Components • Tuple ● Atomic data that flows over a stream • Operator ● Basic compute unit per tuple • Stream ● Connector abstraction between operators ● Tuples flow over this Operator 1 Operator 2 Apache Apex Meetup Stream tuple 3 tuple 1 tuple 2
  • 5. DAG Types O1 O2 O3 O4 Physical DAG Apache Apex Meetup O5 Logical DAG • Logical Plan ● Logical representation of computation ● Defines operators, streams and dataflow • Physical Plan ● Deployable plan on cluster ● Contains partition information of operators ● Has ready-to-deploy serialized operator instances O1 P1 O1 P2 O1 P3 O2 P1 O2 P2 O2 P3 U O3 O4 O5
  • 6. Apex as YARN application Node ResourceManager (AsM + Scheduler) NM Node NM Node NM YarnClient AppMaster YarnContainer YarnContainer YarnContainer StrAM (AppMaster) YarnContainer StrAMChild O1 O2 YarnContainer StrAMChild O3 DTCLI StrAMClient YarnClient Apache Apex Meetup ClientRM Protocol AMRM Protocol ContainerManager Protocol ContainerManager Protocol ClientRM Protocol AMRM Protocol ContainerManager Protocol
  • 7. Application Components of Apex - StrAMClient • Part of dtcli client interface • Invoked by “launch” command of dtcli • Tasks: ● Copy required the application package files into HDFS ● Validate Logical Plan ● Serialize Logical plan to HDFS ● Launch Application Master i.e. StrAM Apache Apex Meetup
  • 8. Application Components of Apex - StrAM • Streaming Application Master • Started by StrAMClient on a YarnContainer • Tasks: ● Convert logical plan to physical plan ● Serialize operators to HDFS ● Request for resources to ResourceManager ● Start StrAMChild in YarnContainer(s) ● Monitor StrAMChild using ContainerManager protocol ● Generate Application statistics ● Host results on WebService (dtManage) ● Fault Tolerance ● Checkpointing/Committing Application States ● Support Security ● Shutdown Application Apache Apex Meetup
  • 9. Application Components of Apex - StrAMChild • Deployed on YarnContainer • Started by NodeManager as instructed by StrAM • Instance of StreamingContainer • Contains Operators (compute-related) • Contains BufferServer (stream-related) • Tasks: ● Regularly send heartbeat to StrAM ● Execute commands from StrAM ● Shutdown or Kill self if instructed ● Manage lifecycle of an Operator ● Network communication using BufferServer Apache Apex Meetup
  • 10. Lifecycle of Apex/YARN Application - Start Node ResourceManager (AsM + Scheduler) NM Node NM Node NM DTCLI/ StrAMClient (YarnClient) 1) Access cluster information HDFS 3) Submit Application to RM StrAM (AppMaster) 4) StrAM Registers with RM 5) StrAM sends heartbeats regularly 6) StrAM request containers with specifications 7) StrAMChild reads serialized operator from HDFS 8) StrAMChild starts operator lifecycle Apache Apex Meetup 2) Copies files from HDFS ClientRMProtocol AMRMProtocol YarnContainer StrAMChild O2 O1 YarnContainer StrAMChild O3 YarnContainer StrAMChild O4ContainerManager Protocol ContainerManager Protocol
  • 11. Lifecycle of Apex/YARN Application - Running Node ResourceManager (AsM + Scheduler) NM Node NM Node NM DTCLI/ StrAMClient (YarnClient) HDFS StrAM (AppMaster) Apache Apex Meetup ClientRMProtocol AMRMProtocol YarnContainer StrAMChild O2 O1 YarnContainer StrAMChild O3 YarnContainer StrAMChild O4ContainerManager Protocol ContainerManager Protocol 1) StrAMChild sends heartbeats 2) StrAMChild sends operator data 3) StrAM send regular heartbeats to RM 4) Query status of application
  • 12. Lifecycle of Apex/YARN Application - Shutdown Node ResourceManager (AsM + Scheduler) NM Node NM Node NM DTCLI/ StrAMClient (YarnClient) HDFS StrAM (AppMaster) Apache Apex Meetup ClientRMProtocol AMRMProtocol YarnContainer StrAMChild O2 O1 YarnContainer StrAMChild O3 YarnContainer StrAMChild O4ContainerManager Protocol ContainerManager Protocol 1) Connect on WebService REST API 3) Send shutdown signal to StrAMChild 4) StrAMChild finishes operator lifecycle 5) Check if all containers are freed 6) StrAM unregisters itself 7) StrAM exits 8) Check if application has shutdown 2) Send command to StrAM
  • 13. Lifecycle of Apex/YARN Application - Kill Node ResourceManager (AsM + Scheduler) NM Node NM Node NM DTCLI/ StrAMClient (YarnClient) HDFS StrAM (AppMaster) Apache Apex Meetup ClientRMProtocol AMRMProtocol YarnContainer StrAMChild O2 O1 YarnContainer StrAMChild O3 YarnContainer StrAMChild O4ContainerManager Protocol ContainerManager Protocol 1) Send kill-app command to YARN 2) RM kills all containers
  • 14. Summary – Apex platform • Enables YARN to be used for Streaming Applications • Takes care of YARN specific work • User can focus on business logic defined in Operators Apache Apex Meetup
  • 16. Resources Apache Apex Meetup • Apache Apex website - http://apex.incubator.apache.org/ • Subscribe - http://apex.incubator.apache.org/community.html • Download - http://apex.incubator.apache.org/downloads.html • Twitter - @ApacheApex; Follow - https://twitter.com/apacheapex • Facebook - https://www.facebook.com/ApacheApex/ • Meetup - http://www.meetup.com/topics/apache-apex • Startup Program – Free Enterprise License for startups, Universities, Non-Profits
  • 17. Upcoming events... Apache Apex Meetup • March 24th 9am PST - Fault Tolerance and Processing Semantics with Apache Apex • March 28th 6pm PST - Low-latency ingestion and analytics with Apache Kafka and Apache Apex (Hadoop) • ...