SlideShare a Scribd company logo
Learning Accurate Business Process Simulation
Models from Event Logs via Automated Process
Discovery and Deep Learning
Manuel Camargo
Marlon Dumas
Oscar González-Rojas
z
2
Process Credit Card
Accept Cash or Check
Identify payment method
18 / 8 min
20 / 5 min
5 / 2 min
Prepare package for customer
10 / 5 min
Cycle time
Processing / Waiting times
Costs x activity x instance x resource …
Resource utilization
1h
5 min
38 min
10 min
How to assess the impact of a
business process change on temporal
& cost measures?
BUSINESS PROCESS SIMULATION
z
DESIGNING BUSINESS PROCESS SIMULATION (BPS) MODELS
3
• Interviews
• Expert knowledge
• Observations
• Sampling
Time consuming (hard to tune)
Execution paths left aside
Identify
payment
method
Prepare
package for
customer
Accept
Cash or
Check
X
Process
Credit
Card
X
Payment
Method?
Process
Virtual
Currencies
X
Credit Card
Accepted?
Prone to human
errors
Unrealistic parameters
z
Data to the Rescue!
EnterpriseSystem
(CRM, ERP,…)
Event Log
Simulation
Model
z
PROBLEM STATEMENT
5
How to automatically create accurate business process
simulation models based on data extracted from enterprise
information systems?
z
OBSERVATION
6
A generative model of business processes is a statistical
model constructed from an event log that can generate
traces that resemble those observed in the log and other
traces of the process (not observed in the log).
A Business Process Simulation (BPS) model is a generative
model of a business process.
z
LEARNING GENERATIVE BUSINESS PROCESS MODELS
7
Data-Driven Discrete Event
Simulation
• Use process mining &
data mining techniques to
discover branching
probabilities, resource
pools, resource calendars,
etc.
Deep Learning
• Use deep learning
methods to discover a
neural network (e.g. an
LSTM network) to predict
successive events and to
generate event sequences
z
8
Data-Driven Simulation
• May take as input a process specification (helps with
interpretability)
• Requires specifications of resource constraints
• Models the case creation process via a probability
distribution
• Assumes undifferentiated resources with robotic
behavior
• Models resource availability as calendars (possibly
discovered from historical data)
• Relies on branching probabilities to local conditional
choice
• Provides a natural mechanism for capturing the
effect of changes to the process
Deep Learning (DL) Sequence Generation Methods
• No interpretable process specification
• Does not explicitly consider resource constraints
• Learns the case arrival process from data
• May capture differentiated resources and robotic
behavior
• Captures resource availability via non-linear
functions
• Branching behavior modeled via neural networks
 may capture complex relations
• Does not have a mechanism for capturing the
effect of changes to the process
LEARNING GENERATIVE BUSINESS PROCESS MODELS
M. Camargo, M. Dumas, O. González Rojas: Discovering generative models from
event logs: data-driven simulation vs deep learning. PeerJ Computer Science vol.
7, e577, 2021
z
HYPOTHESIS
9
By combining data-driven simulation and deep learning, we can
learn generative business process models that are:
1. more accurate than those we can learn by using these methods
in isolation
2. Suitable both for “as is” and for “what if” analysis use cases
z
DeepSimulator: HYBRID LEARNING OF BUSINESS PROCESS SIMULATION MODELS
10
{T1 -> T2 -> T3}
{T1 -> T3 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T2}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T3 -> T2}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
Event log
Phase 1
• Stochastic process
model Discovery &
optimization
• Generation of activity
sequences
Activity sequence
generation
Accuracy assessment
Phase 2
• Time-series analysis &
optimization
• Enrichment of traces
with start-times
Case start-times
generation
Phase 3
• Deep-learning models
training & optimization
• Enrichment of traces with
timestamps
Activity timestamps
generation
z
11
PHASE 1 - ACTIVITY SEQUENCE GENERATION
A1 A2 A3 A5 A6
A4
Ꝺ1:
A2 A3 A4 A5
Ꝺ2:
• Control-flow discovery (BPMN model):
• Split Miner algorithm
• Discover the branching probabilities:
I. Assign equal values to each
conditional branch
II. Compute the branching probabilities
by replaying the aligned event against
the
process model
DeepSimulator: HYBRID LEARNING OF BUSINESS PROCESS SIMULATION MODELS
z
12
PHASE 2 - CASE START-TIMES GENERATION
A1 A2 A3 A5 A6
A4
A2 A3 A4 A5
Time series decomposition (trend, seasonality, and holidays):
y(t) = g(t) + s(t) + h(t) + Ɛt
Trend g(t): models nonperiodic changes in the value of the time
series: Saturating growth model (logistic growth model)
Seasonality s(t): represents periodic changes (e.g., weekly and
yearly seasonality): Fourier series
Holidays h(t): represents the effects of holidays that occur on
potentially irregular schedules over one or more days: Introduced
by the analyst
Error Ɛt: idiosyncratic changes which are not accommodated by
the model
DeepSimulator: HYBRID LEARNING OF BUSINESS PROCESS SIMULATION MODELS
z
13
PHASE 3 - ACTIVITY TIMESTAMPS GENERATION
e1- start e1- complete
e2- start
𝜎1 Ac1
Ac2
e2- complete
Waiting time predictive model
Features: Wait+Ac2+Cx+WIP+RO
Processing time predictive model
Features: Proc+Ac1+Cx+WIP+RO
A1 A2 A3 A5 A6
A4
A2 A3 A4 A5
DeepSimulator: HYBRID LEARNING OF BUSINESS PROCESS SIMULATION MODELS
z
Size Source log #Traces #Events #Act. Avg. activities
per trace
Avg. duration Max. duration Description
LARGE
R POC 70512 415261 8 5.89 15.21 days 269.23 days Undisclosed banking process*
R BPI17W 30276 240854 8 7.96 12.66 days 286.07 days Dutch financial institution updated
R BPI12W 8616 59302 6 6.88 8.91 days 85.87 days Dutch financial institution
S CVS 10000 103906 15 10.39 7.58 days 21.0 days CVS retail pharmacy**
S CFM 1670 44373 29 26.57 0.76 days 5.83 days Anonymized confidential process**
SMALL
R INS 1182 23141 9 19.58 70.93 days 599.9 days Insurance claims process*
R ACR 954 4962 16 5.2 14.89 days 135.84 days Academic Credential Recognition
R MP 225 4503 24 20.01 20.63 days 87.5 days Manufacturing Production
S CFS 800 21221 29 26.53 0.83 days 4.09 days Anonymized confidential process**
S P2P 608 9119 21 15 21.46 days 108.31 days Purchase-to-Pay process
DATASETS (EVENT LOGS)
14
(*) Private logs, (**) Generated from simulation models of real processes
z
EXP1 - AS-IS ANALYSIS USING DEEPSIMULATOR
15
Partition 2
(30%)
Testing
Partition
1
(70%) Training
Testing
Deep Learning
Trainer
BEST DL MODEL
Trace generator
Time
splitting
Evaluator
ELS/DL/MAE/EMD
Simod
Parameter extraction
BEST SIM MODEL
Simulator
Training
(80%)
Validation
(20%)
Event-log 1
2
3
DeepSimulator
BEST DEEP SIM
MODEL
Deep Simulator
z
EXP1 – EVALUATION RESULTS
16
DeepSimulator generally outperforms classical DDS w.r.t. temporal measures
z
EXP2 - WHAT-IF ANALYSIS (ARRIVAL INTENSITY)
17
Scenario 1
Batch 1 Batch 2 Batch 3 Batch 4 Batch 5 Batch 6
BPI17W
BPI12W
CVS
Version1 : Demand modified (D)
Version2 : Demand + waiting times Modified (TD)
z
EXP2 - WHAT-IF ANALYSIS (ADDING A NEVER-BEFORE-OBSERVED ACTIVITY)
18
Remove
activity
Train DSIM
baseline model
List of changes
Update models
Update
embeddings
Replace embeddings of
generative models
Evaluator
MAE/RMSE/SMAPE
Generate log Generate log
Simulation
model
Train DSIM modified
model
BASELINE MODEL UPDATED MODEL
Simulation
model modified
log
Modified log
Partition 2 (30%)
Testing
Partition
1
(70%)
Time
splitting
Training (80%)
Validation (20%)
Event-log
Scenario 2
z
EXP2 - EVALUATION RESULTS
19
• DeepSimulator can better estimate the impact of changes in the demand in settings where such
changes have been previously observed in the data.
• The accuracy of DeepSimulator degraded when evaluated in a previously unobserved scenario (a
new activity is added to the process)
SIMOD DSIM SIMOD DSIM SIMOD DSIM
Version 1
BPI17W 971151 417572 0.02222 0.03593 3185 3647
BPI12W 660211 534341 0.11295 0.04853 515 458
CVS 1489252 467572 0.03213 0.00001 3380 849
Version 2
BPI17W 895524 290980 0.06438 0.03218 4528 3431
BPT12W 550266 524995 0.25888 0.22003 726 507
CVS 540112 246159 0.15674 0.05708 2453 1967
AS-IS WHAT-IF AS-IS WHAT-IF AS-IS WHAT-IF
CFM 7155 17546 22006 33137 0.15629 0.28762
CVS 283061 1040344 357717 1052255 0.31972 1.84601
Log
MAE RMSE SMAPE
Scenario
1
Scenario
2 Log
MAE EMD DTW
z
CONCLUSION
20
The DeepSimulator method combines data-driven simulation to capture the control-flow
perspective of a process with deep learning techniques to capture the temporal perspective.
The evaluation in the AS-IS setting shows that DeepSimulator outperforms a pure Data-Driven
Discrete Event Simulation method and a pure Deep Learning method.
The evaluation on WHAT-IF analysis scenarios shows that DeepSimulator can better
estimate the impact of changes on the arrival rate of cases (the demand) in settings
where such changes have been previously observed in the data.
However, the accuracy of DeepSimulator degrades when applied to a previously
unobserved scenario, specifically a scenario where a completely new activity is added to
the process.
z
FUTURE WORK
21
Explore other mechanisms for modeling the activities of the
process via embeddings (e.g., word2vec, transformer models).
Generate events that include resource and domain-specific
attributes.
Support a broader range of changes, such as changes in the
resource perspective
QUESTIONS?
22
Ph.D Manuel Camargo
University of Tartu
Universidad de los Andes
manuel.camargo@ut.ee
Ad

More Related Content

What's hot (20)

Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
Amanda Whitmire
 
Process Mining - Chapter 9 - Operational Support
Process Mining - Chapter 9 - Operational SupportProcess Mining - Chapter 9 - Operational Support
Process Mining - Chapter 9 - Operational Support
Wil van der Aalst
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Marlon Dumas
 
ITIL Demand Management: why August is a bad time for a presentation
ITIL Demand Management: why August is a bad time for a presentationITIL Demand Management: why August is a bad time for a presentation
ITIL Demand Management: why August is a bad time for a presentation
HigherEdITMgt
 
Introduction to Machine Learning with Azure & Databricks
Introduction to Machine Learning with Azure & DatabricksIntroduction to Machine Learning with Azure & Databricks
Introduction to Machine Learning with Azure & Databricks
CCG
 
Process Mining - Chapter 1 - Introduction
Process Mining - Chapter 1 - IntroductionProcess Mining - Chapter 1 - Introduction
Process Mining - Chapter 1 - Introduction
Wil van der Aalst
 
Types of Mathematical Model.
Types of Mathematical Model.Types of Mathematical Model.
Types of Mathematical Model.
Megha Sharma
 
Big Query Basics
Big Query BasicsBig Query Basics
Big Query Basics
Ido Green
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality Reduction
mrizwan969
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
Databricks
 
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
DATAVERSITY
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data Analysis
Eva Durall
 
Data visualization
Data visualizationData visualization
Data visualization
Jan Willem Tulp
 
Monitoring AI with AI
Monitoring AI with AIMonitoring AI with AI
Monitoring AI with AI
Stepan Pushkarev
 
Desicion tree and neural networks
Desicion tree and neural networksDesicion tree and neural networks
Desicion tree and neural networks
jaskarankaur21
 
Presentation on K-Means Clustering
Presentation on K-Means ClusteringPresentation on K-Means Clustering
Presentation on K-Means Clustering
Pabna University of Science & Technology
 
Data visualization
Data visualizationData visualization
Data visualization
Moushmi Dasgupta
 
Python Seaborn Data Visualization
Python Seaborn Data Visualization Python Seaborn Data Visualization
Python Seaborn Data Visualization
Sourabh Sahu
 
Apache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming AnalyticsApache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming Analytics
Slim Baltagi
 
Ai and machine learning help detect, predict and prevent fraud - IBM Watson ...
Ai and machine learning help detect, predict and prevent fraud -  IBM Watson ...Ai and machine learning help detect, predict and prevent fraud -  IBM Watson ...
Ai and machine learning help detect, predict and prevent fraud - IBM Watson ...
Institute of Contemporary Sciences
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
Amanda Whitmire
 
Process Mining - Chapter 9 - Operational Support
Process Mining - Chapter 9 - Operational SupportProcess Mining - Chapter 9 - Operational Support
Process Mining - Chapter 9 - Operational Support
Wil van der Aalst
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Marlon Dumas
 
ITIL Demand Management: why August is a bad time for a presentation
ITIL Demand Management: why August is a bad time for a presentationITIL Demand Management: why August is a bad time for a presentation
ITIL Demand Management: why August is a bad time for a presentation
HigherEdITMgt
 
Introduction to Machine Learning with Azure & Databricks
Introduction to Machine Learning with Azure & DatabricksIntroduction to Machine Learning with Azure & Databricks
Introduction to Machine Learning with Azure & Databricks
CCG
 
Process Mining - Chapter 1 - Introduction
Process Mining - Chapter 1 - IntroductionProcess Mining - Chapter 1 - Introduction
Process Mining - Chapter 1 - Introduction
Wil van der Aalst
 
Types of Mathematical Model.
Types of Mathematical Model.Types of Mathematical Model.
Types of Mathematical Model.
Megha Sharma
 
Big Query Basics
Big Query BasicsBig Query Basics
Big Query Basics
Ido Green
 
Dimensionality Reduction
Dimensionality ReductionDimensionality Reduction
Dimensionality Reduction
mrizwan969
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
Databricks
 
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
DATAVERSITY
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data Analysis
Eva Durall
 
Desicion tree and neural networks
Desicion tree and neural networksDesicion tree and neural networks
Desicion tree and neural networks
jaskarankaur21
 
Python Seaborn Data Visualization
Python Seaborn Data Visualization Python Seaborn Data Visualization
Python Seaborn Data Visualization
Sourabh Sahu
 
Apache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming AnalyticsApache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming Analytics
Slim Baltagi
 
Ai and machine learning help detect, predict and prevent fraud - IBM Watson ...
Ai and machine learning help detect, predict and prevent fraud -  IBM Watson ...Ai and machine learning help detect, predict and prevent fraud -  IBM Watson ...
Ai and machine learning help detect, predict and prevent fraud - IBM Watson ...
Institute of Contemporary Sciences
 

Similar to Learning Accurate Business Process Simulation Models from Event Logs via Automated Process Discovery and Deep Learning (20)

Business Process Analytics: From Insights to Predictions
Business Process Analytics: From Insights to PredictionsBusiness Process Analytics: From Insights to Predictions
Business Process Analytics: From Insights to Predictions
Marlon Dumas
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Marlon Dumas
 
Process Mining and Predictive Process Monitoring
Process Mining and Predictive Process MonitoringProcess Mining and Predictive Process Monitoring
Process Mining and Predictive Process Monitoring
Marlon Dumas
 
Six sigma technique
Six sigma techniqueSix sigma technique
Six sigma technique
ShwetaPatil174
 
Apromore: Advanced Business Process Analytics on the Cloud
Apromore: Advanced Business Process Analytics on the CloudApromore: Advanced Business Process Analytics on the Cloud
Apromore: Advanced Business Process Analytics on the Cloud
Marlon Dumas
 
Winter Simulation Conference 2021 - Process Wind Tunnel Talk
Winter Simulation Conference 2021 - Process Wind Tunnel TalkWinter Simulation Conference 2021 - Process Wind Tunnel Talk
Winter Simulation Conference 2021 - Process Wind Tunnel Talk
Sudhendu Rai
 
CamundaCon 2018: Profile Print and Explorative Data Mining (Duni, Viadee)
CamundaCon 2018: Profile Print and Explorative Data Mining (Duni, Viadee)CamundaCon 2018: Profile Print and Explorative Data Mining (Duni, Viadee)
CamundaCon 2018: Profile Print and Explorative Data Mining (Duni, Viadee)
camunda services GmbH
 
Six Sigma
Six SigmaSix Sigma
Six Sigma
Tushar Makvana
 
Process wind tunnel - A novel capability for data-driven business process imp...
Process wind tunnel - A novel capability for data-driven business process imp...Process wind tunnel - A novel capability for data-driven business process imp...
Process wind tunnel - A novel capability for data-driven business process imp...
Sudhendu Rai
 
Discovery and Simulation of Data-Aware Business Processes
Discovery and Simulation of Data-Aware Business ProcessesDiscovery and Simulation of Data-Aware Business Processes
Discovery and Simulation of Data-Aware Business Processes
Orlenys López Pintado
 
Mtc strategy-briefing-houston-pd m-05212018-3
Mtc strategy-briefing-houston-pd m-05212018-3Mtc strategy-briefing-houston-pd m-05212018-3
Mtc strategy-briefing-houston-pd m-05212018-3
Dania Kodeih
 
itSMF Presentation March 2009
itSMF Presentation March 2009itSMF Presentation March 2009
itSMF Presentation March 2009
jdmoore
 
6sigma
6sigma6sigma
6sigma
Yasser Hamed
 
Sixsigmaintrorerevisedii 13198308548378-phpapp02-111028144148-phpapp02
Sixsigmaintrorerevisedii 13198308548378-phpapp02-111028144148-phpapp02Sixsigmaintrorerevisedii 13198308548378-phpapp02-111028144148-phpapp02
Sixsigmaintrorerevisedii 13198308548378-phpapp02-111028144148-phpapp02
satyajit pandey
 
AI for Business Process Management
AI for Business Process ManagementAI for Business Process Management
AI for Business Process Management
Marlon Dumas
 
Adm graphics-2003
Adm graphics-2003Adm graphics-2003
Adm graphics-2003
John B. Cook, PE, CEO
 
Liquid process model collections
Liquid process model collectionsLiquid process model collections
Liquid process model collections
Marcello La Rosa
 
[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...
PAPIs.io
 
Six Sigma Introduction
Six Sigma IntroductionSix Sigma Introduction
Six Sigma Introduction
Abhishek Kumar
 
AI for Business Process Management
AI for Business Process ManagementAI for Business Process Management
AI for Business Process Management
Marlon Dumas
 
Business Process Analytics: From Insights to Predictions
Business Process Analytics: From Insights to PredictionsBusiness Process Analytics: From Insights to Predictions
Business Process Analytics: From Insights to Predictions
Marlon Dumas
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Marlon Dumas
 
Process Mining and Predictive Process Monitoring
Process Mining and Predictive Process MonitoringProcess Mining and Predictive Process Monitoring
Process Mining and Predictive Process Monitoring
Marlon Dumas
 
Apromore: Advanced Business Process Analytics on the Cloud
Apromore: Advanced Business Process Analytics on the CloudApromore: Advanced Business Process Analytics on the Cloud
Apromore: Advanced Business Process Analytics on the Cloud
Marlon Dumas
 
Winter Simulation Conference 2021 - Process Wind Tunnel Talk
Winter Simulation Conference 2021 - Process Wind Tunnel TalkWinter Simulation Conference 2021 - Process Wind Tunnel Talk
Winter Simulation Conference 2021 - Process Wind Tunnel Talk
Sudhendu Rai
 
CamundaCon 2018: Profile Print and Explorative Data Mining (Duni, Viadee)
CamundaCon 2018: Profile Print and Explorative Data Mining (Duni, Viadee)CamundaCon 2018: Profile Print and Explorative Data Mining (Duni, Viadee)
CamundaCon 2018: Profile Print and Explorative Data Mining (Duni, Viadee)
camunda services GmbH
 
Process wind tunnel - A novel capability for data-driven business process imp...
Process wind tunnel - A novel capability for data-driven business process imp...Process wind tunnel - A novel capability for data-driven business process imp...
Process wind tunnel - A novel capability for data-driven business process imp...
Sudhendu Rai
 
Discovery and Simulation of Data-Aware Business Processes
Discovery and Simulation of Data-Aware Business ProcessesDiscovery and Simulation of Data-Aware Business Processes
Discovery and Simulation of Data-Aware Business Processes
Orlenys López Pintado
 
Mtc strategy-briefing-houston-pd m-05212018-3
Mtc strategy-briefing-houston-pd m-05212018-3Mtc strategy-briefing-houston-pd m-05212018-3
Mtc strategy-briefing-houston-pd m-05212018-3
Dania Kodeih
 
itSMF Presentation March 2009
itSMF Presentation March 2009itSMF Presentation March 2009
itSMF Presentation March 2009
jdmoore
 
Sixsigmaintrorerevisedii 13198308548378-phpapp02-111028144148-phpapp02
Sixsigmaintrorerevisedii 13198308548378-phpapp02-111028144148-phpapp02Sixsigmaintrorerevisedii 13198308548378-phpapp02-111028144148-phpapp02
Sixsigmaintrorerevisedii 13198308548378-phpapp02-111028144148-phpapp02
satyajit pandey
 
AI for Business Process Management
AI for Business Process ManagementAI for Business Process Management
AI for Business Process Management
Marlon Dumas
 
Liquid process model collections
Liquid process model collectionsLiquid process model collections
Liquid process model collections
Marcello La Rosa
 
[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...[Tutorial] building machine learning models for predictive maintenance applic...
[Tutorial] building machine learning models for predictive maintenance applic...
PAPIs.io
 
Six Sigma Introduction
Six Sigma IntroductionSix Sigma Introduction
Six Sigma Introduction
Abhishek Kumar
 
AI for Business Process Management
AI for Business Process ManagementAI for Business Process Management
AI for Business Process Management
Marlon Dumas
 
Ad

More from Marlon Dumas (20)

LLM-Assisted Optimization of Waiting Time in Business Processes: A Prompting ...
LLM-Assisted Optimization of Waiting Time in Business Processes: A Prompting ...LLM-Assisted Optimization of Waiting Time in Business Processes: A Prompting ...
LLM-Assisted Optimization of Waiting Time in Business Processes: A Prompting ...
Marlon Dumas
 
Explanatory Capabilities of Large Language Models in Prescriptive Process Mon...
Explanatory Capabilities of Large Language Models in Prescriptive Process Mon...Explanatory Capabilities of Large Language Models in Prescriptive Process Mon...
Explanatory Capabilities of Large Language Models in Prescriptive Process Mon...
Marlon Dumas
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Marlon Dumas
 
How GenAI will (not) change your business?
How GenAI will (not)  change your business?How GenAI will (not)  change your business?
How GenAI will (not) change your business?
Marlon Dumas
 
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Marlon Dumas
 
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Marlon Dumas
 
Business Process Optimization: Status and Perspectives
Business Process Optimization: Status and PerspectivesBusiness Process Optimization: Status and Perspectives
Business Process Optimization: Status and Perspectives
Marlon Dumas
 
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Marlon Dumas
 
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business ProcessesWhy am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Marlon Dumas
 
Augmented Business Process Management
Augmented Business Process ManagementAugmented Business Process Management
Augmented Business Process Management
Marlon Dumas
 
Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process Simulation
Marlon Dumas
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process Simulation
Marlon Dumas
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Marlon Dumas
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Marlon Dumas
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process Mining
Marlon Dumas
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for Practitioners
Marlon Dumas
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptx
Marlon Dumas
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datos
Marlon Dumas
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process Improvement
Marlon Dumas
 
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time ReductionPrescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Marlon Dumas
 
LLM-Assisted Optimization of Waiting Time in Business Processes: A Prompting ...
LLM-Assisted Optimization of Waiting Time in Business Processes: A Prompting ...LLM-Assisted Optimization of Waiting Time in Business Processes: A Prompting ...
LLM-Assisted Optimization of Waiting Time in Business Processes: A Prompting ...
Marlon Dumas
 
Explanatory Capabilities of Large Language Models in Prescriptive Process Mon...
Explanatory Capabilities of Large Language Models in Prescriptive Process Mon...Explanatory Capabilities of Large Language Models in Prescriptive Process Mon...
Explanatory Capabilities of Large Language Models in Prescriptive Process Mon...
Marlon Dumas
 
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...
Marlon Dumas
 
How GenAI will (not) change your business?
How GenAI will (not)  change your business?How GenAI will (not)  change your business?
How GenAI will (not) change your business?
Marlon Dumas
 
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Marlon Dumas
 
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Marlon Dumas
 
Business Process Optimization: Status and Perspectives
Business Process Optimization: Status and PerspectivesBusiness Process Optimization: Status and Perspectives
Business Process Optimization: Status and Perspectives
Marlon Dumas
 
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Marlon Dumas
 
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business ProcessesWhy am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Marlon Dumas
 
Augmented Business Process Management
Augmented Business Process ManagementAugmented Business Process Management
Augmented Business Process Management
Marlon Dumas
 
Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process Simulation
Marlon Dumas
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process Simulation
Marlon Dumas
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Marlon Dumas
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Marlon Dumas
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process Mining
Marlon Dumas
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for Practitioners
Marlon Dumas
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptx
Marlon Dumas
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datos
Marlon Dumas
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process Improvement
Marlon Dumas
 
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time ReductionPrescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Marlon Dumas
 
Ad

Recently uploaded (20)

Process Mining at Dimension Data - Jan vermeulen
Process Mining at Dimension Data - Jan vermeulenProcess Mining at Dimension Data - Jan vermeulen
Process Mining at Dimension Data - Jan vermeulen
Process mining Evangelist
 
AWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdfAWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdf
philsparkshome
 
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
Taqyea
 
AWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptxAWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptx
bharatkumarbhojwani
 
hersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distributionhersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distribution
hershtara1
 
problem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursingproblem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursing
vishnudathas123
 
Improving Product Manufacturing Processes
Improving Product Manufacturing ProcessesImproving Product Manufacturing Processes
Improving Product Manufacturing Processes
Process mining Evangelist
 
Fundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithmsFundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithms
priyaiyerkbcsc
 
RAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit FrameworkRAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit Framework
apanneer
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 
CS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docxCS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docx
nidarizvitit
 
HershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistributionHershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistribution
hershtara1
 
Agricultural_regionalisation_in_India(Final).pptx
Agricultural_regionalisation_in_India(Final).pptxAgricultural_regionalisation_in_India(Final).pptx
Agricultural_regionalisation_in_India(Final).pptx
mostafaahammed38
 
Understanding Complex Development Processes
Understanding Complex Development ProcessesUnderstanding Complex Development Processes
Understanding Complex Development Processes
Process mining Evangelist
 
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
Taqyea
 
Voice Control robotic arm hggyghghgjgjhgjg
Voice Control robotic arm hggyghghgjgjhgjgVoice Control robotic arm hggyghghgjgjhgjg
Voice Control robotic arm hggyghghgjgjhgjg
4mg22ec401
 
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdfTOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
NhiV747372
 
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
Taqyea
 
Controlling Financial Processes at a Municipality
Controlling Financial Processes at a MunicipalityControlling Financial Processes at a Municipality
Controlling Financial Processes at a Municipality
Process mining Evangelist
 
Mining a Global Trade Process with Data Science - Microsoft
Mining a Global Trade Process with Data Science - MicrosoftMining a Global Trade Process with Data Science - Microsoft
Mining a Global Trade Process with Data Science - Microsoft
Process mining Evangelist
 
Process Mining at Dimension Data - Jan vermeulen
Process Mining at Dimension Data - Jan vermeulenProcess Mining at Dimension Data - Jan vermeulen
Process Mining at Dimension Data - Jan vermeulen
Process mining Evangelist
 
AWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdfAWS Certified Machine Learning Slides.pdf
AWS Certified Machine Learning Slides.pdf
philsparkshome
 
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
Taqyea
 
AWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptxAWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptx
bharatkumarbhojwani
 
hersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distributionhersh's midterm project.pdf music retail and distribution
hersh's midterm project.pdf music retail and distribution
hershtara1
 
problem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursingproblem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursing
vishnudathas123
 
Fundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithmsFundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithms
priyaiyerkbcsc
 
RAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit FrameworkRAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit Framework
apanneer
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 
CS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docxCS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docx
nidarizvitit
 
HershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistributionHershAggregator (2).pdf musicretaildistribution
HershAggregator (2).pdf musicretaildistribution
hershtara1
 
Agricultural_regionalisation_in_India(Final).pptx
Agricultural_regionalisation_in_India(Final).pptxAgricultural_regionalisation_in_India(Final).pptx
Agricultural_regionalisation_in_India(Final).pptx
mostafaahammed38
 
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
Taqyea
 
Voice Control robotic arm hggyghghgjgjhgjg
Voice Control robotic arm hggyghghgjgjhgjgVoice Control robotic arm hggyghghgjgjhgjg
Voice Control robotic arm hggyghghgjgjhgjg
4mg22ec401
 
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdfTOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
NhiV747372
 
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
录取通知书加拿大TMU毕业证多伦多都会大学电子版毕业证成绩单
Taqyea
 
Controlling Financial Processes at a Municipality
Controlling Financial Processes at a MunicipalityControlling Financial Processes at a Municipality
Controlling Financial Processes at a Municipality
Process mining Evangelist
 
Mining a Global Trade Process with Data Science - Microsoft
Mining a Global Trade Process with Data Science - MicrosoftMining a Global Trade Process with Data Science - Microsoft
Mining a Global Trade Process with Data Science - Microsoft
Process mining Evangelist
 

Learning Accurate Business Process Simulation Models from Event Logs via Automated Process Discovery and Deep Learning

  • 1. Learning Accurate Business Process Simulation Models from Event Logs via Automated Process Discovery and Deep Learning Manuel Camargo Marlon Dumas Oscar González-Rojas
  • 2. z 2 Process Credit Card Accept Cash or Check Identify payment method 18 / 8 min 20 / 5 min 5 / 2 min Prepare package for customer 10 / 5 min Cycle time Processing / Waiting times Costs x activity x instance x resource … Resource utilization 1h 5 min 38 min 10 min How to assess the impact of a business process change on temporal & cost measures? BUSINESS PROCESS SIMULATION
  • 3. z DESIGNING BUSINESS PROCESS SIMULATION (BPS) MODELS 3 • Interviews • Expert knowledge • Observations • Sampling Time consuming (hard to tune) Execution paths left aside Identify payment method Prepare package for customer Accept Cash or Check X Process Credit Card X Payment Method? Process Virtual Currencies X Credit Card Accepted? Prone to human errors Unrealistic parameters
  • 4. z Data to the Rescue! EnterpriseSystem (CRM, ERP,…) Event Log Simulation Model
  • 5. z PROBLEM STATEMENT 5 How to automatically create accurate business process simulation models based on data extracted from enterprise information systems?
  • 6. z OBSERVATION 6 A generative model of business processes is a statistical model constructed from an event log that can generate traces that resemble those observed in the log and other traces of the process (not observed in the log). A Business Process Simulation (BPS) model is a generative model of a business process.
  • 7. z LEARNING GENERATIVE BUSINESS PROCESS MODELS 7 Data-Driven Discrete Event Simulation • Use process mining & data mining techniques to discover branching probabilities, resource pools, resource calendars, etc. Deep Learning • Use deep learning methods to discover a neural network (e.g. an LSTM network) to predict successive events and to generate event sequences
  • 8. z 8 Data-Driven Simulation • May take as input a process specification (helps with interpretability) • Requires specifications of resource constraints • Models the case creation process via a probability distribution • Assumes undifferentiated resources with robotic behavior • Models resource availability as calendars (possibly discovered from historical data) • Relies on branching probabilities to local conditional choice • Provides a natural mechanism for capturing the effect of changes to the process Deep Learning (DL) Sequence Generation Methods • No interpretable process specification • Does not explicitly consider resource constraints • Learns the case arrival process from data • May capture differentiated resources and robotic behavior • Captures resource availability via non-linear functions • Branching behavior modeled via neural networks  may capture complex relations • Does not have a mechanism for capturing the effect of changes to the process LEARNING GENERATIVE BUSINESS PROCESS MODELS M. Camargo, M. Dumas, O. González Rojas: Discovering generative models from event logs: data-driven simulation vs deep learning. PeerJ Computer Science vol. 7, e577, 2021
  • 9. z HYPOTHESIS 9 By combining data-driven simulation and deep learning, we can learn generative business process models that are: 1. more accurate than those we can learn by using these methods in isolation 2. Suitable both for “as is” and for “what if” analysis use cases
  • 10. z DeepSimulator: HYBRID LEARNING OF BUSINESS PROCESS SIMULATION MODELS 10 {T1 -> T2 -> T3} {T1 -> T3 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T2} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T3 -> T2} {T1 -> T2 -> T3} {T1 -> T2 -> T3} Event log Phase 1 • Stochastic process model Discovery & optimization • Generation of activity sequences Activity sequence generation Accuracy assessment Phase 2 • Time-series analysis & optimization • Enrichment of traces with start-times Case start-times generation Phase 3 • Deep-learning models training & optimization • Enrichment of traces with timestamps Activity timestamps generation
  • 11. z 11 PHASE 1 - ACTIVITY SEQUENCE GENERATION A1 A2 A3 A5 A6 A4 Ꝺ1: A2 A3 A4 A5 Ꝺ2: • Control-flow discovery (BPMN model): • Split Miner algorithm • Discover the branching probabilities: I. Assign equal values to each conditional branch II. Compute the branching probabilities by replaying the aligned event against the process model DeepSimulator: HYBRID LEARNING OF BUSINESS PROCESS SIMULATION MODELS
  • 12. z 12 PHASE 2 - CASE START-TIMES GENERATION A1 A2 A3 A5 A6 A4 A2 A3 A4 A5 Time series decomposition (trend, seasonality, and holidays): y(t) = g(t) + s(t) + h(t) + Ɛt Trend g(t): models nonperiodic changes in the value of the time series: Saturating growth model (logistic growth model) Seasonality s(t): represents periodic changes (e.g., weekly and yearly seasonality): Fourier series Holidays h(t): represents the effects of holidays that occur on potentially irregular schedules over one or more days: Introduced by the analyst Error Ɛt: idiosyncratic changes which are not accommodated by the model DeepSimulator: HYBRID LEARNING OF BUSINESS PROCESS SIMULATION MODELS
  • 13. z 13 PHASE 3 - ACTIVITY TIMESTAMPS GENERATION e1- start e1- complete e2- start 𝜎1 Ac1 Ac2 e2- complete Waiting time predictive model Features: Wait+Ac2+Cx+WIP+RO Processing time predictive model Features: Proc+Ac1+Cx+WIP+RO A1 A2 A3 A5 A6 A4 A2 A3 A4 A5 DeepSimulator: HYBRID LEARNING OF BUSINESS PROCESS SIMULATION MODELS
  • 14. z Size Source log #Traces #Events #Act. Avg. activities per trace Avg. duration Max. duration Description LARGE R POC 70512 415261 8 5.89 15.21 days 269.23 days Undisclosed banking process* R BPI17W 30276 240854 8 7.96 12.66 days 286.07 days Dutch financial institution updated R BPI12W 8616 59302 6 6.88 8.91 days 85.87 days Dutch financial institution S CVS 10000 103906 15 10.39 7.58 days 21.0 days CVS retail pharmacy** S CFM 1670 44373 29 26.57 0.76 days 5.83 days Anonymized confidential process** SMALL R INS 1182 23141 9 19.58 70.93 days 599.9 days Insurance claims process* R ACR 954 4962 16 5.2 14.89 days 135.84 days Academic Credential Recognition R MP 225 4503 24 20.01 20.63 days 87.5 days Manufacturing Production S CFS 800 21221 29 26.53 0.83 days 4.09 days Anonymized confidential process** S P2P 608 9119 21 15 21.46 days 108.31 days Purchase-to-Pay process DATASETS (EVENT LOGS) 14 (*) Private logs, (**) Generated from simulation models of real processes
  • 15. z EXP1 - AS-IS ANALYSIS USING DEEPSIMULATOR 15 Partition 2 (30%) Testing Partition 1 (70%) Training Testing Deep Learning Trainer BEST DL MODEL Trace generator Time splitting Evaluator ELS/DL/MAE/EMD Simod Parameter extraction BEST SIM MODEL Simulator Training (80%) Validation (20%) Event-log 1 2 3 DeepSimulator BEST DEEP SIM MODEL Deep Simulator
  • 16. z EXP1 – EVALUATION RESULTS 16 DeepSimulator generally outperforms classical DDS w.r.t. temporal measures
  • 17. z EXP2 - WHAT-IF ANALYSIS (ARRIVAL INTENSITY) 17 Scenario 1 Batch 1 Batch 2 Batch 3 Batch 4 Batch 5 Batch 6 BPI17W BPI12W CVS Version1 : Demand modified (D) Version2 : Demand + waiting times Modified (TD)
  • 18. z EXP2 - WHAT-IF ANALYSIS (ADDING A NEVER-BEFORE-OBSERVED ACTIVITY) 18 Remove activity Train DSIM baseline model List of changes Update models Update embeddings Replace embeddings of generative models Evaluator MAE/RMSE/SMAPE Generate log Generate log Simulation model Train DSIM modified model BASELINE MODEL UPDATED MODEL Simulation model modified log Modified log Partition 2 (30%) Testing Partition 1 (70%) Time splitting Training (80%) Validation (20%) Event-log Scenario 2
  • 19. z EXP2 - EVALUATION RESULTS 19 • DeepSimulator can better estimate the impact of changes in the demand in settings where such changes have been previously observed in the data. • The accuracy of DeepSimulator degraded when evaluated in a previously unobserved scenario (a new activity is added to the process) SIMOD DSIM SIMOD DSIM SIMOD DSIM Version 1 BPI17W 971151 417572 0.02222 0.03593 3185 3647 BPI12W 660211 534341 0.11295 0.04853 515 458 CVS 1489252 467572 0.03213 0.00001 3380 849 Version 2 BPI17W 895524 290980 0.06438 0.03218 4528 3431 BPT12W 550266 524995 0.25888 0.22003 726 507 CVS 540112 246159 0.15674 0.05708 2453 1967 AS-IS WHAT-IF AS-IS WHAT-IF AS-IS WHAT-IF CFM 7155 17546 22006 33137 0.15629 0.28762 CVS 283061 1040344 357717 1052255 0.31972 1.84601 Log MAE RMSE SMAPE Scenario 1 Scenario 2 Log MAE EMD DTW
  • 20. z CONCLUSION 20 The DeepSimulator method combines data-driven simulation to capture the control-flow perspective of a process with deep learning techniques to capture the temporal perspective. The evaluation in the AS-IS setting shows that DeepSimulator outperforms a pure Data-Driven Discrete Event Simulation method and a pure Deep Learning method. The evaluation on WHAT-IF analysis scenarios shows that DeepSimulator can better estimate the impact of changes on the arrival rate of cases (the demand) in settings where such changes have been previously observed in the data. However, the accuracy of DeepSimulator degrades when applied to a previously unobserved scenario, specifically a scenario where a completely new activity is added to the process.
  • 21. z FUTURE WORK 21 Explore other mechanisms for modeling the activities of the process via embeddings (e.g., word2vec, transformer models). Generate events that include resource and domain-specific attributes. Support a broader range of changes, such as changes in the resource perspective