SlideShare a Scribd company logo
An Adaptive Replication Scheme For Elastic
Data Stream Processing
Thomas Heinze, Mariam Zia, Robert Krahn, Zbigniew Jerzak, Christof Fetzer
July 02, 2015
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 2InternalPublic
Elasticity
 Utilization below 30% in most cloud data centers
 Users needs to reserve required resources
 Limited understanding of the performance of the system
 Limited knowledge of characteristics of the workload
Workload/
Resources
Load
Static Provisioning
Elastic Provisioning
time
Underprovisioning
Overprovisioning
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 3InternalPublic
Configuring an Elastic Scaling System
 Data Stream Processing highly suited for elasticity due to highly variable
load and small state size (e.g. StreamCloud[1] or SEEP[2])
 Key challenge: Minimize the overprovisioning and number of SLA
violations by optimizing scaling decisions
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 4InternalPublic
Enabling Fault Tolerance
 Elasticity requires horizontal scaling → we need fault tolerance
 Two mechanisms: Active Replication vs. Upstream Backup
User-defined
Threshold
Financial
0.0
2.5
5.0
7.5
10.0
12.5
0.0 0.5 1.0 1.5 2.0
Monetary Cost($)
RecoveryTime(insec.)
Active Upstream
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 5InternalPublic
Outline
1. Introduction
2. An Adaptive Replication Scheme
3. Evaluation
4. Conclusion and Future Work
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 6InternalPublic
Related Work
a) Improving Upstream Backup:
 Sweeping checkpoints, …
 Faster Recovery by using Micro Batch Processing (D-Stream [3],
TimeStream [4])
 But: no user-configurable recovery time threshold
b) Combination of both mechanisms:
 Already proposed for Borealis by Hwang et al. [5]
 Static Optimizer proposed by Updahyaya et al. [6]
 Dynamic switching to handle overload/fault case by Martin et al. [7]/
Zhang et al.[8]
 But: static or without user-configurable recovery time threshold
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 7InternalPublic
Adaptive Replication Scheme
 Dynamically switch between upstream backup and active replication
during runtime
 Replication Scheme describes current replication mode for all operators
Active Replication
Upstream BackupUpstream Backup
process
process
passive
reserved
Switch
Roles
Switch
Instance 2
Switch
Instance 1
process
process
reserved
Key Questions:
1) When we need to switch replication mode?
→ Estimation Model for Upstream Backup
2) How to integrate with our elastic scaling system?
→ Adaption Algorithm
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 8InternalPublic
Recovery Time Estimation
 Many factors influence the recovery time:
 Operator type and Checkpointing time (static)
 State Size and Queue Length (changing with the current workload)
 Our solution: estimation based on historical samples
 Accurancy: 0.3 sec. error for 10 sec. recovery time (sample size: 1000)
Clustering EstimationHistorical
Samples
Clustered
Samples
Estimated
Recovery Time
Current workload
characteristics
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 9InternalPublic
Single Operator Scenario
 Observe current state size and queue length of all operators
 Adapt replication scheme if user threshold is not met
t
Operating
interval
Recovery Time
Threshold
Estimated
Recovery
Time
Estimated
Recovery Time
1
2Active Replication
Upstream Backup
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 10InternalPublic
Integration with Elastic Scaling System
 Architecture of an elastic scaling system
 Process many queries in parallel
 Places operators on a varying number of hosts based CPU + network
consumption
 Scaling requires moving operators between hosts
 Integration
 Replication-aware operator placement
 System recovery time = max(recovery time per query)
 Monitor the recovery time for the crash of host h
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 11InternalPublic
Example: Multi Query Scenario
System Recovery
Time:
max(trec(q1) , trec(q2))
F1 A1S
A1‘F1‘
q1:
trec(q1)= max(trec(F1), trec(A1) , trec(D1))
D1
D1‘
F2 A1S
A2‘F2‘
q2:
trec(q2)= max(trec(F2), trec(A2) , trec(D2))
D1
D1‘
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 12InternalPublic
Example: Operator Placement
F1 A1S
A1‘F1‘
D1
D1‘
F2 A2S
A2‘F2‘
q2:
D2
D2‘
Placement:
Host 1
F1
F2
Host 2
A2
D1‘
F1‘
Host 4
A2‘
D2
F1‘
A1
q1:
Host 3
D2‘
D1
A1‘
Recovery
Time (max):
trec(F1), trec(F2),
trec(A1)
trec(A2) trec(D1) trec(A2), trec(D2)
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 13InternalPublic
Example: Too High Recovery Time
F1 A1S
A1‘F1‘
D1
D1‘
F2 A2S
A2‘F2‘
q2:
D2
D2‘
Placement:
Host 1
F1
F2
Host 2
A2
D1‘
F1‘
Host 4
A2‘
D2
F1‘
A1
trec(F1), trec(F2),
trec(A1)
trec(A2) trec(D1)
q1:
Host 3
D2‘
D1
A1‘
trec(A2), trec(D2)Recovery
Time (max):
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 14InternalPublic
Example: Too High Recovery Time
F1 A1S
A1‘F1‘
D1
D1‘
F2 A2S
A2‘F2‘
q2:
D2
D2‘
Placement:
Host 1
F1
F2
Host 2
A2
D1‘
F1‘
Host 4
A2‘
D2
F1‘
A1
trec(F1), trec(F2),
trec(A1)
trec(A2), trec (D1‘) trec(D1), trec(A1)
q1:
Host 3
D2‘
D1
A1‘
trec(A2), trec(D2)Recovery
Time (max):
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 15InternalPublic
Example: Too Low Recovery Time
F1 A1S
A1‘F1‘
D1
D1‘
F2 A2S
A2‘F2‘
q2:
D2
D2‘
Placement:
Host 1
F1
F2
Host 2
A2
D1‘
F1‘
Host 4
A2‘
D2
F1‘
A1
trec(F1), trec(F2),
trec(A1)
trec(A2), trec (D1‘) trec(D1), trec(A1)
q1:
Host 3
D2‘
D1
A1‘
trec(A2), trec(D2)Recovery
Time (max):
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 16InternalPublic
Example: Too Low Recovery Time
F1 A1S
A1‘F1‘
D1
D1‘
F2 A2S
A2‘F2‘
q2:
D2
D2‘
Placement:
Host 1
F1
F2
Host 2
A2
D1‘
F1‘
Host 4
A2‘
D2
F1‘
A1
trec(F1), trec(F2),
trec(A1)
trec(A2), trec (D1‘) trec(D1), trec(A1)
q1:
Host 3
D2‘
D1
A1‘
trec(D2)Recovery
Time (max):
Evaluation
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 18InternalPublic
Setup
 Private cloud environment with up to 12 hosts
 Three Workloads: Financial, Twitter, Energy Sensors
 Measure characteristics like CPU load, latency, etc. in 10 seconds
intervals
 20 crashes of a random host (immediately trigger recovery process)
 Recovery Time measured as maximal latency peak observed after a host
crash
 Two baseline algorithms: Active Replication and Upstream Backup
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 19InternalPublic
Recovery Time For Different Thresholds
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 20InternalPublic
Adaptive Replication Scheme
© 2015 SAP SE or an SAP affiliate company. All rights reserved. 21InternalPublic
Summary
 Active replication/upstream backup forces a hard trade-off between
resource overhead and recovery time
 Our adaptive replication scheme allows to customize trade-off based on
user configuration
Future work
 Formalize approach for replication degree >2
 Network-bound workloads
 Replication Placement
© 2015 SAP SE or an SAP affiliate company. All rights reserved.
Thank you
Contact information:
Thomas Heinze
Research Associate
thomas.heinze@sap.com
Ad

More Related Content

What's hot (20)

A Comparative Study between Honeybee Foraging Behaviour Algorithm and Round ...
A Comparative Study between Honeybee Foraging Behaviour Algorithm and  Round ...A Comparative Study between Honeybee Foraging Behaviour Algorithm and  Round ...
A Comparative Study between Honeybee Foraging Behaviour Algorithm and Round ...
sondhicse
 
Load Balancing In Cloud Computing newppt
Load Balancing In Cloud Computing newpptLoad Balancing In Cloud Computing newppt
Load Balancing In Cloud Computing newppt
Utshab Saha
 
Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...
Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...
Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...
IRJET Journal
 
An Efficient Decentralized Load Balancing Algorithm in Cloud Computing
An Efficient Decentralized Load Balancing Algorithm in Cloud ComputingAn Efficient Decentralized Load Balancing Algorithm in Cloud Computing
An Efficient Decentralized Load Balancing Algorithm in Cloud Computing
Aisha Kalsoom
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
NECST Lab @ Politecnico di Milano
 
STUDY ON PROJECT MANAGEMENT THROUGH GENETIC ALGORITHM
STUDY ON PROJECT MANAGEMENT THROUGH GENETIC ALGORITHMSTUDY ON PROJECT MANAGEMENT THROUGH GENETIC ALGORITHM
STUDY ON PROJECT MANAGEMENT THROUGH GENETIC ALGORITHM
Avay Minni
 
Configuration Optimization for Big Data Software
Configuration Optimization for Big Data SoftwareConfiguration Optimization for Big Data Software
Configuration Optimization for Big Data Software
Pooyan Jamshidi
 
Enhancing Performance and Fault Tolerance of Hadoop Cluster
Enhancing Performance and Fault Tolerance of Hadoop ClusterEnhancing Performance and Fault Tolerance of Hadoop Cluster
Enhancing Performance and Fault Tolerance of Hadoop Cluster
IRJET Journal
 
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
Xiao Qin
 
HDFS-HC: A Data Placement Module for Heterogeneous Hadoop Clusters
HDFS-HC: A Data Placement Module for Heterogeneous Hadoop ClustersHDFS-HC: A Data Placement Module for Heterogeneous Hadoop Clusters
HDFS-HC: A Data Placement Module for Heterogeneous Hadoop Clusters
Xiao Qin
 
capacityshifting1
capacityshifting1capacityshifting1
capacityshifting1
Gokul Vasan
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
NECST Lab @ Politecnico di Milano
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
NECST Lab @ Politecnico di Milano
 
LOAD BALANCING ALGORITHMS
LOAD BALANCING ALGORITHMSLOAD BALANCING ALGORITHMS
LOAD BALANCING ALGORITHMS
tanmayshah95
 
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation AlgorithmA Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
NECST Lab @ Politecnico di Milano
 
Buzz Words Dunning Real-Time Learning
Buzz Words Dunning Real-Time LearningBuzz Words Dunning Real-Time Learning
Buzz Words Dunning Real-Time Learning
MapR Technologies
 
Detecting Lateral Movement with a Compute-Intense Graph Kernel
Detecting Lateral Movement with a Compute-Intense Graph KernelDetecting Lateral Movement with a Compute-Intense Graph Kernel
Detecting Lateral Movement with a Compute-Intense Graph Kernel
Data Works MD
 
IEEE CLOUD \'11
IEEE CLOUD \'11IEEE CLOUD \'11
IEEE CLOUD \'11
David Ribeiro Alves
 
load balancing in public cloud ppt
load balancing in public cloud pptload balancing in public cloud ppt
load balancing in public cloud ppt
Krishna Kumar
 
Hadoop fault tolerance
Hadoop  fault toleranceHadoop  fault tolerance
Hadoop fault tolerance
Pallav Jha
 
A Comparative Study between Honeybee Foraging Behaviour Algorithm and Round ...
A Comparative Study between Honeybee Foraging Behaviour Algorithm and  Round ...A Comparative Study between Honeybee Foraging Behaviour Algorithm and  Round ...
A Comparative Study between Honeybee Foraging Behaviour Algorithm and Round ...
sondhicse
 
Load Balancing In Cloud Computing newppt
Load Balancing In Cloud Computing newpptLoad Balancing In Cloud Computing newppt
Load Balancing In Cloud Computing newppt
Utshab Saha
 
Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...
Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...
Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...
IRJET Journal
 
An Efficient Decentralized Load Balancing Algorithm in Cloud Computing
An Efficient Decentralized Load Balancing Algorithm in Cloud ComputingAn Efficient Decentralized Load Balancing Algorithm in Cloud Computing
An Efficient Decentralized Load Balancing Algorithm in Cloud Computing
Aisha Kalsoom
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
NECST Lab @ Politecnico di Milano
 
STUDY ON PROJECT MANAGEMENT THROUGH GENETIC ALGORITHM
STUDY ON PROJECT MANAGEMENT THROUGH GENETIC ALGORITHMSTUDY ON PROJECT MANAGEMENT THROUGH GENETIC ALGORITHM
STUDY ON PROJECT MANAGEMENT THROUGH GENETIC ALGORITHM
Avay Minni
 
Configuration Optimization for Big Data Software
Configuration Optimization for Big Data SoftwareConfiguration Optimization for Big Data Software
Configuration Optimization for Big Data Software
Pooyan Jamshidi
 
Enhancing Performance and Fault Tolerance of Hadoop Cluster
Enhancing Performance and Fault Tolerance of Hadoop ClusterEnhancing Performance and Fault Tolerance of Hadoop Cluster
Enhancing Performance and Fault Tolerance of Hadoop Cluster
IRJET Journal
 
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
HDFS-HC2: Analysis of Data Placement Strategy based on Computing Power of Nod...
Xiao Qin
 
HDFS-HC: A Data Placement Module for Heterogeneous Hadoop Clusters
HDFS-HC: A Data Placement Module for Heterogeneous Hadoop ClustersHDFS-HC: A Data Placement Module for Heterogeneous Hadoop Clusters
HDFS-HC: A Data Placement Module for Heterogeneous Hadoop Clusters
Xiao Qin
 
capacityshifting1
capacityshifting1capacityshifting1
capacityshifting1
Gokul Vasan
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
NECST Lab @ Politecnico di Milano
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
NECST Lab @ Politecnico di Milano
 
LOAD BALANCING ALGORITHMS
LOAD BALANCING ALGORITHMSLOAD BALANCING ALGORITHMS
LOAD BALANCING ALGORITHMS
tanmayshah95
 
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation AlgorithmA Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
NECST Lab @ Politecnico di Milano
 
Buzz Words Dunning Real-Time Learning
Buzz Words Dunning Real-Time LearningBuzz Words Dunning Real-Time Learning
Buzz Words Dunning Real-Time Learning
MapR Technologies
 
Detecting Lateral Movement with a Compute-Intense Graph Kernel
Detecting Lateral Movement with a Compute-Intense Graph KernelDetecting Lateral Movement with a Compute-Intense Graph Kernel
Detecting Lateral Movement with a Compute-Intense Graph Kernel
Data Works MD
 
load balancing in public cloud ppt
load balancing in public cloud pptload balancing in public cloud ppt
load balancing in public cloud ppt
Krishna Kumar
 
Hadoop fault tolerance
Hadoop  fault toleranceHadoop  fault tolerance
Hadoop fault tolerance
Pallav Jha
 

Viewers also liked (18)

Visualization-Driven Data Aggregation
Visualization-Driven Data AggregationVisualization-Driven Data Aggregation
Visualization-Driven Data Aggregation
Zbigniew Jerzak
 
High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark
DataWorks Summit/Hadoop Summit
 
Shn Overview Updated 2009 06 P21 23
Shn Overview   Updated 2009 06 P21 23Shn Overview   Updated 2009 06 P21 23
Shn Overview Updated 2009 06 P21 23
joaovox
 
Latency-aware Elastic Scaling for Distributed Data Stream Processing Systems
Latency-aware Elastic Scaling for Distributed Data Stream Processing SystemsLatency-aware Elastic Scaling for Distributed Data Stream Processing Systems
Latency-aware Elastic Scaling for Distributed Data Stream Processing Systems
Zbigniew Jerzak
 
Cloud-based Data Stream Processing
Cloud-based Data Stream ProcessingCloud-based Data Stream Processing
Cloud-based Data Stream Processing
Zbigniew Jerzak
 
Research Paper Presentation Rubric
Research Paper Presentation RubricResearch Paper Presentation Rubric
Research Paper Presentation Rubric
epfund
 
Dataflow - A Unified Model for Batch and Streaming Data Processing
Dataflow - A Unified Model for Batch and Streaming Data ProcessingDataflow - A Unified Model for Batch and Streaming Data Processing
Dataflow - A Unified Model for Batch and Streaming Data Processing
DoiT International
 
Will it Scale? The Secrets behind Scaling Stream Processing Applications
Will it Scale? The Secrets behind Scaling Stream Processing ApplicationsWill it Scale? The Secrets behind Scaling Stream Processing Applications
Will it Scale? The Secrets behind Scaling Stream Processing Applications
Navina Ramesh
 
The structure of the research paper
The structure of the research paperThe structure of the research paper
The structure of the research paper
Davao Institute for Educational Research Development & Consultancy
 
Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...
Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...
Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...
MLconf
 
Pansig2010 - Cypris Chat
Pansig2010 - Cypris ChatPansig2010 - Cypris Chat
Pansig2010 - Cypris Chat
Mike McKay
 
More amazing photoshop tut
More amazing photoshop tutMore amazing photoshop tut
More amazing photoshop tut
ShdwClaw
 
Git WorkFlow & Best Practice
Git WorkFlow & Best PracticeGit WorkFlow & Best Practice
Git WorkFlow & Best Practice
Hiraq Citra M
 
Shn, permaculture pilot, 2008 april, 21 30
Shn, permaculture pilot, 2008 april, 21 30Shn, permaculture pilot, 2008 april, 21 30
Shn, permaculture pilot, 2008 april, 21 30
joaovox
 
Doug Altman 15 Jan09 V4
Doug Altman 15 Jan09 V4Doug Altman 15 Jan09 V4
Doug Altman 15 Jan09 V4
US Cochrane Center
 
Ddd part 2 modelling qiscus
Ddd part 2   modelling qiscusDdd part 2   modelling qiscus
Ddd part 2 modelling qiscus
Hiraq Citra M
 
Чести проблеми в сигурността на уеб проектите
Чести проблеми в сигурността на уеб проектитеЧести проблеми в сигурността на уеб проектите
Чести проблеми в сигурността на уеб проектите
Veselin Nikolov
 
Moodle and Second Life Registration
Moodle and Second Life RegistrationMoodle and Second Life Registration
Moodle and Second Life Registration
Mike McKay
 
Visualization-Driven Data Aggregation
Visualization-Driven Data AggregationVisualization-Driven Data Aggregation
Visualization-Driven Data Aggregation
Zbigniew Jerzak
 
High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark
DataWorks Summit/Hadoop Summit
 
Shn Overview Updated 2009 06 P21 23
Shn Overview   Updated 2009 06 P21 23Shn Overview   Updated 2009 06 P21 23
Shn Overview Updated 2009 06 P21 23
joaovox
 
Latency-aware Elastic Scaling for Distributed Data Stream Processing Systems
Latency-aware Elastic Scaling for Distributed Data Stream Processing SystemsLatency-aware Elastic Scaling for Distributed Data Stream Processing Systems
Latency-aware Elastic Scaling for Distributed Data Stream Processing Systems
Zbigniew Jerzak
 
Cloud-based Data Stream Processing
Cloud-based Data Stream ProcessingCloud-based Data Stream Processing
Cloud-based Data Stream Processing
Zbigniew Jerzak
 
Research Paper Presentation Rubric
Research Paper Presentation RubricResearch Paper Presentation Rubric
Research Paper Presentation Rubric
epfund
 
Dataflow - A Unified Model for Batch and Streaming Data Processing
Dataflow - A Unified Model for Batch and Streaming Data ProcessingDataflow - A Unified Model for Batch and Streaming Data Processing
Dataflow - A Unified Model for Batch and Streaming Data Processing
DoiT International
 
Will it Scale? The Secrets behind Scaling Stream Processing Applications
Will it Scale? The Secrets behind Scaling Stream Processing ApplicationsWill it Scale? The Secrets behind Scaling Stream Processing Applications
Will it Scale? The Secrets behind Scaling Stream Processing Applications
Navina Ramesh
 
Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...
Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...
Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...
MLconf
 
Pansig2010 - Cypris Chat
Pansig2010 - Cypris ChatPansig2010 - Cypris Chat
Pansig2010 - Cypris Chat
Mike McKay
 
More amazing photoshop tut
More amazing photoshop tutMore amazing photoshop tut
More amazing photoshop tut
ShdwClaw
 
Git WorkFlow & Best Practice
Git WorkFlow & Best PracticeGit WorkFlow & Best Practice
Git WorkFlow & Best Practice
Hiraq Citra M
 
Shn, permaculture pilot, 2008 april, 21 30
Shn, permaculture pilot, 2008 april, 21 30Shn, permaculture pilot, 2008 april, 21 30
Shn, permaculture pilot, 2008 april, 21 30
joaovox
 
Ddd part 2 modelling qiscus
Ddd part 2   modelling qiscusDdd part 2   modelling qiscus
Ddd part 2 modelling qiscus
Hiraq Citra M
 
Чести проблеми в сигурността на уеб проектите
Чести проблеми в сигурността на уеб проектитеЧести проблеми в сигурността на уеб проектите
Чести проблеми в сигурността на уеб проектите
Veselin Nikolov
 
Moodle and Second Life Registration
Moodle and Second Life RegistrationMoodle and Second Life Registration
Moodle and Second Life Registration
Mike McKay
 
Ad

Similar to Adaptive Replication for Elastic Data Stream Processing (20)

Sybase ASE 15.7- Two Case Studies of Successful Migration
Sybase ASE 15.7- Two Case Studies of Successful Migration Sybase ASE 15.7- Two Case Studies of Successful Migration
Sybase ASE 15.7- Two Case Studies of Successful Migration
SAP Technology
 
Skytap parasoft webinar new years resolution- accelerate sdlc
Skytap parasoft webinar new years resolution- accelerate sdlcSkytap parasoft webinar new years resolution- accelerate sdlc
Skytap parasoft webinar new years resolution- accelerate sdlc
Skytap Cloud
 
OS-CPU-Scheduling-chap5.pptx
OS-CPU-Scheduling-chap5.pptxOS-CPU-Scheduling-chap5.pptx
OS-CPU-Scheduling-chap5.pptx
DrAmarNathDhebla
 
Operating systems chapter 5 silberschatz
Operating systems chapter 5 silberschatzOperating systems chapter 5 silberschatz
Operating systems chapter 5 silberschatz
GiulianoRanauro
 
CS_10_DR_CFD
CS_10_DR_CFDCS_10_DR_CFD
CS_10_DR_CFD
ajaya gummadi
 
ch5.pptx CUP Scheduling and its details in OS
ch5.pptx CUP Scheduling and its details in OSch5.pptx CUP Scheduling and its details in OS
ch5.pptx CUP Scheduling and its details in OS
23017156038
 
SAP HANA SPS10- Scale-Out, High Availability and Disaster Recovery
SAP HANA SPS10- Scale-Out, High Availability and Disaster RecoverySAP HANA SPS10- Scale-Out, High Availability and Disaster Recovery
SAP HANA SPS10- Scale-Out, High Availability and Disaster Recovery
SAP Technology
 
Public Sector Virtual Town Hall: High Availability for PostgreSQL
Public Sector Virtual Town Hall: High Availability for PostgreSQLPublic Sector Virtual Town Hall: High Availability for PostgreSQL
Public Sector Virtual Town Hall: High Availability for PostgreSQL
EDB
 
Adaptive Computing Using PlateSpin Orchestrate
Adaptive Computing Using PlateSpin OrchestrateAdaptive Computing Using PlateSpin Orchestrate
Adaptive Computing Using PlateSpin Orchestrate
Novell
 
ch5_EN_CPUSched_2022.pdf
ch5_EN_CPUSched_2022.pdfch5_EN_CPUSched_2022.pdf
ch5_EN_CPUSched_2022.pdf
CuracaoJTR
 
Ejecución de sizer para SimpliVity Partners
Ejecución de sizer para SimpliVity PartnersEjecución de sizer para SimpliVity Partners
Ejecución de sizer para SimpliVity Partners
JessMoreno901369
 
Best Practice for Supercharging CA Workload Automation dSeries (DE) for Optim...
Best Practice for Supercharging CA Workload Automation dSeries (DE) for Optim...Best Practice for Supercharging CA Workload Automation dSeries (DE) for Optim...
Best Practice for Supercharging CA Workload Automation dSeries (DE) for Optim...
CA Technologies
 
nZDM.ppt
nZDM.pptnZDM.ppt
nZDM.ppt
Navin Somal
 
CPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULING.pptx
CPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULING.pptxCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULING.pptx
CPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULING.pptx
ridmoon40318
 
Comparison of various streaming technologies
Comparison of various streaming technologiesComparison of various streaming technologies
Comparison of various streaming technologies
Sachin Aggarwal
 
Presentation v mware roi tco calculator
Presentation   v mware roi tco calculatorPresentation   v mware roi tco calculator
Presentation v mware roi tco calculator
solarisyourep
 
Beginner's Guide to High Availability for Postgres
Beginner's Guide to High Availability for PostgresBeginner's Guide to High Availability for Postgres
Beginner's Guide to High Availability for Postgres
EDB
 
cloud computing chapter one in computer science
cloud computing chapter one in computer sciencecloud computing chapter one in computer science
cloud computing chapter one in computer science
TSha7
 
operating system in computer science .pdf
operating system in computer science .pdfoperating system in computer science .pdf
operating system in computer science .pdf
TSha7
 
operating system in computer science ch05.pdf
operating system in computer science ch05.pdfoperating system in computer science ch05.pdf
operating system in computer science ch05.pdf
TSha7
 
Sybase ASE 15.7- Two Case Studies of Successful Migration
Sybase ASE 15.7- Two Case Studies of Successful Migration Sybase ASE 15.7- Two Case Studies of Successful Migration
Sybase ASE 15.7- Two Case Studies of Successful Migration
SAP Technology
 
Skytap parasoft webinar new years resolution- accelerate sdlc
Skytap parasoft webinar new years resolution- accelerate sdlcSkytap parasoft webinar new years resolution- accelerate sdlc
Skytap parasoft webinar new years resolution- accelerate sdlc
Skytap Cloud
 
OS-CPU-Scheduling-chap5.pptx
OS-CPU-Scheduling-chap5.pptxOS-CPU-Scheduling-chap5.pptx
OS-CPU-Scheduling-chap5.pptx
DrAmarNathDhebla
 
Operating systems chapter 5 silberschatz
Operating systems chapter 5 silberschatzOperating systems chapter 5 silberschatz
Operating systems chapter 5 silberschatz
GiulianoRanauro
 
ch5.pptx CUP Scheduling and its details in OS
ch5.pptx CUP Scheduling and its details in OSch5.pptx CUP Scheduling and its details in OS
ch5.pptx CUP Scheduling and its details in OS
23017156038
 
SAP HANA SPS10- Scale-Out, High Availability and Disaster Recovery
SAP HANA SPS10- Scale-Out, High Availability and Disaster RecoverySAP HANA SPS10- Scale-Out, High Availability and Disaster Recovery
SAP HANA SPS10- Scale-Out, High Availability and Disaster Recovery
SAP Technology
 
Public Sector Virtual Town Hall: High Availability for PostgreSQL
Public Sector Virtual Town Hall: High Availability for PostgreSQLPublic Sector Virtual Town Hall: High Availability for PostgreSQL
Public Sector Virtual Town Hall: High Availability for PostgreSQL
EDB
 
Adaptive Computing Using PlateSpin Orchestrate
Adaptive Computing Using PlateSpin OrchestrateAdaptive Computing Using PlateSpin Orchestrate
Adaptive Computing Using PlateSpin Orchestrate
Novell
 
ch5_EN_CPUSched_2022.pdf
ch5_EN_CPUSched_2022.pdfch5_EN_CPUSched_2022.pdf
ch5_EN_CPUSched_2022.pdf
CuracaoJTR
 
Ejecución de sizer para SimpliVity Partners
Ejecución de sizer para SimpliVity PartnersEjecución de sizer para SimpliVity Partners
Ejecución de sizer para SimpliVity Partners
JessMoreno901369
 
Best Practice for Supercharging CA Workload Automation dSeries (DE) for Optim...
Best Practice for Supercharging CA Workload Automation dSeries (DE) for Optim...Best Practice for Supercharging CA Workload Automation dSeries (DE) for Optim...
Best Practice for Supercharging CA Workload Automation dSeries (DE) for Optim...
CA Technologies
 
CPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULING.pptx
CPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULING.pptxCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULING.pptx
CPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULINGCPU SCHEDULING.pptx
ridmoon40318
 
Comparison of various streaming technologies
Comparison of various streaming technologiesComparison of various streaming technologies
Comparison of various streaming technologies
Sachin Aggarwal
 
Presentation v mware roi tco calculator
Presentation   v mware roi tco calculatorPresentation   v mware roi tco calculator
Presentation v mware roi tco calculator
solarisyourep
 
Beginner's Guide to High Availability for Postgres
Beginner's Guide to High Availability for PostgresBeginner's Guide to High Availability for Postgres
Beginner's Guide to High Availability for Postgres
EDB
 
cloud computing chapter one in computer science
cloud computing chapter one in computer sciencecloud computing chapter one in computer science
cloud computing chapter one in computer science
TSha7
 
operating system in computer science .pdf
operating system in computer science .pdfoperating system in computer science .pdf
operating system in computer science .pdf
TSha7
 
operating system in computer science ch05.pdf
operating system in computer science ch05.pdfoperating system in computer science ch05.pdf
operating system in computer science ch05.pdf
TSha7
 
Ad

More from Zbigniew Jerzak (10)

Elastic Scaling of a High-Throughput Content-Based Publish/Subscribe Engine
Elastic Scaling of a High-Throughput Content-Based Publish/Subscribe EngineElastic Scaling of a High-Throughput Content-Based Publish/Subscribe Engine
Elastic Scaling of a High-Throughput Content-Based Publish/Subscribe Engine
Zbigniew Jerzak
 
ThesisXSiena: The Content-Based Publish/Subscribe System
ThesisXSiena: The Content-Based Publish/Subscribe SystemThesisXSiena: The Content-Based Publish/Subscribe System
ThesisXSiena: The Content-Based Publish/Subscribe System
Zbigniew Jerzak
 
Clock Synchronization in Distributed Systems
Clock Synchronization in Distributed SystemsClock Synchronization in Distributed Systems
Clock Synchronization in Distributed Systems
Zbigniew Jerzak
 
XSiena: The Content-Based Publish/Subscribe System
XSiena: The Content-Based Publish/Subscribe SystemXSiena: The Content-Based Publish/Subscribe System
XSiena: The Content-Based Publish/Subscribe System
Zbigniew Jerzak
 
Soft State in Publish/Subscribe
Soft State in Publish/SubscribeSoft State in Publish/Subscribe
Soft State in Publish/Subscribe
Zbigniew Jerzak
 
Highly Available Publish/Subscribe
Highly Available Publish/SubscribeHighly Available Publish/Subscribe
Highly Available Publish/Subscribe
Zbigniew Jerzak
 
Prefix Forwarding for Publish/Subscribe
Prefix Forwarding for Publish/SubscribePrefix Forwarding for Publish/Subscribe
Prefix Forwarding for Publish/Subscribe
Zbigniew Jerzak
 
Fail-Aware Publish/Subscribe
Fail-Aware Publish/SubscribeFail-Aware Publish/Subscribe
Fail-Aware Publish/Subscribe
Zbigniew Jerzak
 
Bloom Filter Based Routing for Content-Based Publish/Subscribe
Bloom Filter Based Routing for Content-Based Publish/SubscribeBloom Filter Based Routing for Content-Based Publish/Subscribe
Bloom Filter Based Routing for Content-Based Publish/Subscribe
Zbigniew Jerzak
 
Adaptive Internal Clock Synchronization
Adaptive Internal Clock SynchronizationAdaptive Internal Clock Synchronization
Adaptive Internal Clock Synchronization
Zbigniew Jerzak
 
Elastic Scaling of a High-Throughput Content-Based Publish/Subscribe Engine
Elastic Scaling of a High-Throughput Content-Based Publish/Subscribe EngineElastic Scaling of a High-Throughput Content-Based Publish/Subscribe Engine
Elastic Scaling of a High-Throughput Content-Based Publish/Subscribe Engine
Zbigniew Jerzak
 
ThesisXSiena: The Content-Based Publish/Subscribe System
ThesisXSiena: The Content-Based Publish/Subscribe SystemThesisXSiena: The Content-Based Publish/Subscribe System
ThesisXSiena: The Content-Based Publish/Subscribe System
Zbigniew Jerzak
 
Clock Synchronization in Distributed Systems
Clock Synchronization in Distributed SystemsClock Synchronization in Distributed Systems
Clock Synchronization in Distributed Systems
Zbigniew Jerzak
 
XSiena: The Content-Based Publish/Subscribe System
XSiena: The Content-Based Publish/Subscribe SystemXSiena: The Content-Based Publish/Subscribe System
XSiena: The Content-Based Publish/Subscribe System
Zbigniew Jerzak
 
Soft State in Publish/Subscribe
Soft State in Publish/SubscribeSoft State in Publish/Subscribe
Soft State in Publish/Subscribe
Zbigniew Jerzak
 
Highly Available Publish/Subscribe
Highly Available Publish/SubscribeHighly Available Publish/Subscribe
Highly Available Publish/Subscribe
Zbigniew Jerzak
 
Prefix Forwarding for Publish/Subscribe
Prefix Forwarding for Publish/SubscribePrefix Forwarding for Publish/Subscribe
Prefix Forwarding for Publish/Subscribe
Zbigniew Jerzak
 
Fail-Aware Publish/Subscribe
Fail-Aware Publish/SubscribeFail-Aware Publish/Subscribe
Fail-Aware Publish/Subscribe
Zbigniew Jerzak
 
Bloom Filter Based Routing for Content-Based Publish/Subscribe
Bloom Filter Based Routing for Content-Based Publish/SubscribeBloom Filter Based Routing for Content-Based Publish/Subscribe
Bloom Filter Based Routing for Content-Based Publish/Subscribe
Zbigniew Jerzak
 
Adaptive Internal Clock Synchronization
Adaptive Internal Clock SynchronizationAdaptive Internal Clock Synchronization
Adaptive Internal Clock Synchronization
Zbigniew Jerzak
 

Recently uploaded (20)

Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
Ch3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendencyCh3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendency
ayeleasefa2
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136
illuminati Agent uganda call+256776963507/0741506136
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
VKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptxVKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptx
Vinod Srivastava
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
LLM finetuning for multiple choice google bert
LLM finetuning for multiple choice google bertLLM finetuning for multiple choice google bert
LLM finetuning for multiple choice google bert
ChadapornK
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjksPpt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
panchariyasahil
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
Ch3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendencyCh3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendency
ayeleasefa2
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
VKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptxVKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptx
Vinod Srivastava
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
LLM finetuning for multiple choice google bert
LLM finetuning for multiple choice google bertLLM finetuning for multiple choice google bert
LLM finetuning for multiple choice google bert
ChadapornK
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTemplate_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
Template_A3nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
cegiver630
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjksPpt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
panchariyasahil
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 

Adaptive Replication for Elastic Data Stream Processing

  • 1. An Adaptive Replication Scheme For Elastic Data Stream Processing Thomas Heinze, Mariam Zia, Robert Krahn, Zbigniew Jerzak, Christof Fetzer July 02, 2015
  • 2. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 2InternalPublic Elasticity  Utilization below 30% in most cloud data centers  Users needs to reserve required resources  Limited understanding of the performance of the system  Limited knowledge of characteristics of the workload Workload/ Resources Load Static Provisioning Elastic Provisioning time Underprovisioning Overprovisioning
  • 3. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 3InternalPublic Configuring an Elastic Scaling System  Data Stream Processing highly suited for elasticity due to highly variable load and small state size (e.g. StreamCloud[1] or SEEP[2])  Key challenge: Minimize the overprovisioning and number of SLA violations by optimizing scaling decisions
  • 4. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 4InternalPublic Enabling Fault Tolerance  Elasticity requires horizontal scaling → we need fault tolerance  Two mechanisms: Active Replication vs. Upstream Backup User-defined Threshold Financial 0.0 2.5 5.0 7.5 10.0 12.5 0.0 0.5 1.0 1.5 2.0 Monetary Cost($) RecoveryTime(insec.) Active Upstream
  • 5. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 5InternalPublic Outline 1. Introduction 2. An Adaptive Replication Scheme 3. Evaluation 4. Conclusion and Future Work
  • 6. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 6InternalPublic Related Work a) Improving Upstream Backup:  Sweeping checkpoints, …  Faster Recovery by using Micro Batch Processing (D-Stream [3], TimeStream [4])  But: no user-configurable recovery time threshold b) Combination of both mechanisms:  Already proposed for Borealis by Hwang et al. [5]  Static Optimizer proposed by Updahyaya et al. [6]  Dynamic switching to handle overload/fault case by Martin et al. [7]/ Zhang et al.[8]  But: static or without user-configurable recovery time threshold
  • 7. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 7InternalPublic Adaptive Replication Scheme  Dynamically switch between upstream backup and active replication during runtime  Replication Scheme describes current replication mode for all operators Active Replication Upstream BackupUpstream Backup process process passive reserved Switch Roles Switch Instance 2 Switch Instance 1 process process reserved Key Questions: 1) When we need to switch replication mode? → Estimation Model for Upstream Backup 2) How to integrate with our elastic scaling system? → Adaption Algorithm
  • 8. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 8InternalPublic Recovery Time Estimation  Many factors influence the recovery time:  Operator type and Checkpointing time (static)  State Size and Queue Length (changing with the current workload)  Our solution: estimation based on historical samples  Accurancy: 0.3 sec. error for 10 sec. recovery time (sample size: 1000) Clustering EstimationHistorical Samples Clustered Samples Estimated Recovery Time Current workload characteristics
  • 9. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 9InternalPublic Single Operator Scenario  Observe current state size and queue length of all operators  Adapt replication scheme if user threshold is not met t Operating interval Recovery Time Threshold Estimated Recovery Time Estimated Recovery Time 1 2Active Replication Upstream Backup
  • 10. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 10InternalPublic Integration with Elastic Scaling System  Architecture of an elastic scaling system  Process many queries in parallel  Places operators on a varying number of hosts based CPU + network consumption  Scaling requires moving operators between hosts  Integration  Replication-aware operator placement  System recovery time = max(recovery time per query)  Monitor the recovery time for the crash of host h
  • 11. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 11InternalPublic Example: Multi Query Scenario System Recovery Time: max(trec(q1) , trec(q2)) F1 A1S A1‘F1‘ q1: trec(q1)= max(trec(F1), trec(A1) , trec(D1)) D1 D1‘ F2 A1S A2‘F2‘ q2: trec(q2)= max(trec(F2), trec(A2) , trec(D2)) D1 D1‘
  • 12. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 12InternalPublic Example: Operator Placement F1 A1S A1‘F1‘ D1 D1‘ F2 A2S A2‘F2‘ q2: D2 D2‘ Placement: Host 1 F1 F2 Host 2 A2 D1‘ F1‘ Host 4 A2‘ D2 F1‘ A1 q1: Host 3 D2‘ D1 A1‘ Recovery Time (max): trec(F1), trec(F2), trec(A1) trec(A2) trec(D1) trec(A2), trec(D2)
  • 13. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 13InternalPublic Example: Too High Recovery Time F1 A1S A1‘F1‘ D1 D1‘ F2 A2S A2‘F2‘ q2: D2 D2‘ Placement: Host 1 F1 F2 Host 2 A2 D1‘ F1‘ Host 4 A2‘ D2 F1‘ A1 trec(F1), trec(F2), trec(A1) trec(A2) trec(D1) q1: Host 3 D2‘ D1 A1‘ trec(A2), trec(D2)Recovery Time (max):
  • 14. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 14InternalPublic Example: Too High Recovery Time F1 A1S A1‘F1‘ D1 D1‘ F2 A2S A2‘F2‘ q2: D2 D2‘ Placement: Host 1 F1 F2 Host 2 A2 D1‘ F1‘ Host 4 A2‘ D2 F1‘ A1 trec(F1), trec(F2), trec(A1) trec(A2), trec (D1‘) trec(D1), trec(A1) q1: Host 3 D2‘ D1 A1‘ trec(A2), trec(D2)Recovery Time (max):
  • 15. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 15InternalPublic Example: Too Low Recovery Time F1 A1S A1‘F1‘ D1 D1‘ F2 A2S A2‘F2‘ q2: D2 D2‘ Placement: Host 1 F1 F2 Host 2 A2 D1‘ F1‘ Host 4 A2‘ D2 F1‘ A1 trec(F1), trec(F2), trec(A1) trec(A2), trec (D1‘) trec(D1), trec(A1) q1: Host 3 D2‘ D1 A1‘ trec(A2), trec(D2)Recovery Time (max):
  • 16. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 16InternalPublic Example: Too Low Recovery Time F1 A1S A1‘F1‘ D1 D1‘ F2 A2S A2‘F2‘ q2: D2 D2‘ Placement: Host 1 F1 F2 Host 2 A2 D1‘ F1‘ Host 4 A2‘ D2 F1‘ A1 trec(F1), trec(F2), trec(A1) trec(A2), trec (D1‘) trec(D1), trec(A1) q1: Host 3 D2‘ D1 A1‘ trec(D2)Recovery Time (max):
  • 18. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 18InternalPublic Setup  Private cloud environment with up to 12 hosts  Three Workloads: Financial, Twitter, Energy Sensors  Measure characteristics like CPU load, latency, etc. in 10 seconds intervals  20 crashes of a random host (immediately trigger recovery process)  Recovery Time measured as maximal latency peak observed after a host crash  Two baseline algorithms: Active Replication and Upstream Backup
  • 19. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 19InternalPublic Recovery Time For Different Thresholds
  • 20. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 20InternalPublic Adaptive Replication Scheme
  • 21. © 2015 SAP SE or an SAP affiliate company. All rights reserved. 21InternalPublic Summary  Active replication/upstream backup forces a hard trade-off between resource overhead and recovery time  Our adaptive replication scheme allows to customize trade-off based on user configuration Future work  Formalize approach for replication degree >2  Network-bound workloads  Replication Placement
  • 22. © 2015 SAP SE or an SAP affiliate company. All rights reserved. Thank you Contact information: Thomas Heinze Research Associate [email protected]