0% found this document useful (0 votes)

3 views

SM_Lect_07 (1)

The document outlines the process of analyzing simulation data, focusing on input modeling, data collection, and the identification of statistical distributions. It details methods for data collection, types of data, and techniques for testing the goodness of fit of distributions. The four key steps in developing input data models are collecting raw data, identifying statistical distributions, estimating parameters, and testing for goodness of fit.

Uploaded by

ifexplora

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

SM_Lect_07 (1)

Uploaded by

ifexplora

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Simulation And Modeling

CS-805: SIMULATION and MODELING

Analysis of Simulation Data

Lecture 13, Unit-4
Contents

Analysis of Simulation Data

• Input Modelling:
• Data collection,
• Identification and
• distribution with data
• Parameter estimation, Goodness of fit tests,
• Selection of input models without data, Multivariate and time
series analysis,
• Verification and Validation of Model – Model Building, Verification,
• Calibration and Validation of Models.
Introduction

Input Modelling : Why ?

Input Data

• The ultimate use of input data is to drive the

simulation.
• The process involves
• Collection of Input Data
• Analysis of the input data
• Use the analysis of input data in the simulation model
Collection of input data

• The data may not exist

• eg. A project involves analysis of new capital equipment

• Collection of historical data

• E.g. sales data

• The data may be collected in real time

• E.g. changes in traffic patterns
Sources for input data
• Historical records
• Old data may not be of much use
• Reliability factor
• Complete information is not available

• Manufacturer specifications
• Whether or not these claims can actually be achieved in a real environment has to be proven

• Vendor claims
• The vendor or distributor should already have some experience with the type of system that is being considered.

• Operator claims
• If the operator is knowledgeable about the system, it may be possible to obtain some performance estimates that can
be used as input data

• Management estimates
• their input maybe helpful when an experienced operator is not available for input

• Automatic data capture

• This is analogous to the traffic volume monitors that are frequently encountered on the road.

• Direct observations
• The most physically and mentally demanding form of data collection
• This approach can be particularly grueling and costly when a large amount of data on infrequently occurring events
must be captured.
Data Collection Mechanisms

• Data Collection Devices

• Wit equipment's
• With the video
• With the help of programs

• Time collection mode and units

• Event advance system vs fixed time interval
• Time metric: nonSec/Msec,Sec,Min,Hr,Week…

• Other data collection consideration

• Unbiased Data
• Data collection wit out disruption
Types of Data

• Identify the data type

• Deterministic vs Probabilistic Data
• Deterministics: Conveyor velocities, Preventive maintenance schedule
• Probablistic: Interarrival Time, Customer service processes, Repair times
• Discrete Vs Continuous Data
• Discrete: No. of people arrive in system as a group or a batch, number of
jobs processed before a machine experiences a breakdown
• Continuous: Time between arrivals, Service times, route times
Common Data Distributions

Already covered
• Bernoulli
• Uniform
• Exponential
• Normal
• Triangular
• Weibull
• Erlang
Selecting the Family of Distributions
Use the physical basis of the distribution as a guide, e.g.:
• Binomial: Number of successes in n trials
• Negative binomial and geometric: Number of trials to achieve k successes
• Poisson: Number of independent events that occur in a fix amount of time or space
• Normal: Distribution of a process that is the sum of a number of component processes
• Lognormal: Distribution of a process that is the product of a number of component
processes
• Exponential: Time between independent events, or a process time that is memoryless
• Weibull: Time to failure for components
• Discrete or continuous uniform: Models complete uncertainty
• Triangular: A process for which only the minimum, most likely, and maximum values are
known
• Empirical: Re-samples from the actual data collected
Analysis of input data
• The process of determining the underlying theoretical distribution for
a set of data usually involves what is known as a goodness of fit test.
• these tests are based on some sort of comparison between the
observed data distribution and a corresponding theoretical
distribution.
• If the difference between the observed data distribution and the
corresponding theoretical distribution is small, then it may be stated
with some level of certainty that the input data could have come from
a set of data with the same parameters as the theoretical distribution.
• Methods:
• Graphic approach
• Chi-square test
• Kolmogorov–Smirnov test
• Square error
Analysis of input data: Graphic Approach
• This approach consists of a visual qualitative comparison between
the actual data distribution and a theoretical distribution from which
the observed data may have come.
• Steps
• Create a histogram of observed data
• Create a histogram for the theoretical distribution
• Visually compare the two histograms for similarity
• Make a qualitative decision as to the similarity of the two data sets

• The practitioner must first decide

• how wide a data range each bar in the histogram covers and how many bars to graph.
• The number of observations in each data cell is used to represent the height of the
histogram bars.

• There are two common approaches for determining how to handle

the cell issue:
• Equal-interval approach
• Equal-probability approach
Data Collection
• Suggestions that may enhance and facilitate data
collection:
• Analyze the data as it is being collected: check
adequacy
• Combine homogeneous data sets: successive time
periods, during the same time period on successive days
• Be aware of data censoring: the quantity is not
observed in its entirety, danger of leaving out long
process times
• Check for relationship between variables (scatter
diagram)
• Check for autocorrelation
Identifying the Distribution
Histograms
• A frequency distribution or histogram is useful in determining
the shape of a distribution
• The number of class intervals depends on:
• The number of observations
• The dispersion of the data
• Suggested number of intervals: the square root of the sample
size
• For continuous data:
• Corresponds to the probability density function (pdf) of a theoretical distribution
• For discrete data:
• Corresponds to the probability mass function (pmf)
• If few data points are available
• combine adjacent cells to eliminate the ragged appearance of the histogram
Histograms

Same data with different interval

sizes
Histograms
Example
• Vehicle Arrival Example: Number of vehicles arriving at
an intersection between 7 am and 7:05 am was
monitored for 100 random workdays.
• There are ample data, so the histogram may have a
cell for each possible value in the data range
Histograms: Example

• Sample size 10000

• with different numbers of bins
Identifying the Distribution
Scatter diagrams
A scatter diagram is a quality tool that can show
the relationship between paired data
• Random Variable X = Data 1
• Random Variable Y = Data 2
• Draw random variable X on the x-axis and Y on
the y-axis
Scatter diagrams
➢ Linear relationship
➢ • Correlation: Measures how well data line up
• Slope: Measures the steepness of the data
• Direction
➢ • Y intercept
Identifying the Distribution
Selecting the Family of Distributions
Selecting the Family of Distributions
A family of distributions is selected based on:
• The context of the input variable
• Shape of the histogram
• Frequently encountered distributions:
Selecting the Family of Distributions
Use the physical basis of the distribution as a guide, e.g.:
• Binomial: Number of successes in n trials
• Negative binomial and geometric: Number of trials to achieve k successes
• Poisson: Number of independent events that occur in a fix amount of time or space
• Normal: Distribution of a process that is the sum of a number of component processes
• Lognormal: Distribution of a process that is the product of a number of component
processes
• Exponential: Time between independent events, or a process time that is memoryless
• Weibull: Time to failure for components
• Discrete or continuous uniform: Models complete uncertainty
• Triangular: A process for which only the minimum, most likely, and maximum values are
known
• Empirical: Re-samples from the actual data collected
Selecting the Family of Distributions
Remember the physical characteristics of the process
• Is the process naturally discrete or continuous valued?
• Is it bound?
• Value range?
• Only positive values
• Only negative values
• Interval of [-a:b]
• No “true” distribution for any stochastic input process
• Goal: obtain a good approximation
Summary
• In this Unit,
we described the 4 steps in developing input data
models:
(1) Collecting the raw data
(2) Identifying the underlying statistical distribution
(3) Estimating the parameters
(4) Testing for goodness of fit
Reference

• Simulation modelling Handbook: a practical

approach: Cristopher A. Chung, CRC Press,
ISBN 0-8493-1241-8, 2004
Thank You

Wish you a fruitful Simulation

DMS2100i Bridge Manoeuvring System: MAN B&W ME/ME-C Engines
No ratings yet
DMS2100i Bridge Manoeuvring System: MAN B&W ME/ME-C Engines
113 pages
Assessment Checklist
100% (1)
Assessment Checklist
5 pages
7 Input Modeling 2024
No ratings yet
7 Input Modeling 2024
90 pages
2
No ratings yet
2
7 pages
Input Modelling: Name: Sohail Shaikh Roll No.: Pa03 Sub: Dess Cad/Cam/Cae
No ratings yet
Input Modelling: Name: Sohail Shaikh Roll No.: Pa03 Sub: Dess Cad/Cam/Cae
14 pages
4.1.1 Input Modeling
No ratings yet
4.1.1 Input Modeling
63 pages
Input Modeling
No ratings yet
Input Modeling
10 pages
lec08-2025
No ratings yet
lec08-2025
43 pages
Input Modeling For Simulation
No ratings yet
Input Modeling For Simulation
48 pages
Input Modelling: Discrete-Event System Simulation
No ratings yet
Input Modelling: Discrete-Event System Simulation
41 pages
WK4 - Input Data v1
No ratings yet
WK4 - Input Data v1
27 pages
Chap 9 Input Modeling - 8-9
No ratings yet
Chap 9 Input Modeling - 8-9
42 pages
Input Modeling: Banks, Carson, Nelson & Nicol
No ratings yet
Input Modeling: Banks, Carson, Nelson & Nicol
7 pages
Cpsc531 Input
No ratings yet
Cpsc531 Input
44 pages
CS30 5 System Modeling and Simulation Prof. Dr. Khaled Mahar
No ratings yet
CS30 5 System Modeling and Simulation Prof. Dr. Khaled Mahar
32 pages
SMS Module 4_Input Modeling
No ratings yet
SMS Module 4_Input Modeling
33 pages
8 CSC446 546 InputModeling
No ratings yet
8 CSC446 546 InputModeling
44 pages
CH 9
No ratings yet
CH 9
13 pages
20250319-Week5-Input Modeling
No ratings yet
20250319-Week5-Input Modeling
71 pages
Chap 06 Slides
No ratings yet
Chap 06 Slides
40 pages
Prop Final 4
No ratings yet
Prop Final 4
119 pages
Chap 05 Promodel Tricks and Data Collection and Analysis
No ratings yet
Chap 05 Promodel Tricks and Data Collection and Analysis
79 pages
Simulation Input Data Analysis
No ratings yet
Simulation Input Data Analysis
43 pages
Concept of Variation 1
No ratings yet
Concept of Variation 1
7 pages
Unit .......
No ratings yet
Unit .......
45 pages
Chapter 9 Input Modeling
No ratings yet
Chapter 9 Input Modeling
10 pages
10.4324 9781003124092-5 Chapterpdf
No ratings yet
10.4324 9781003124092-5 Chapterpdf
16 pages
Simulation Output Data Analysis
No ratings yet
Simulation Output Data Analysis
23 pages
Amit_Khilare_Used_Device_Data_PM_Project
No ratings yet
Amit_Khilare_Used_Device_Data_PM_Project
25 pages
Measure Phase and Data Collection
No ratings yet
Measure Phase and Data Collection
55 pages
MAT 211 Introduction To Business Statistics I Lecture Notes
No ratings yet
MAT 211 Introduction To Business Statistics I Lecture Notes
69 pages
Handouts 1 ENDATA130 Introduction To Data Analysis
No ratings yet
Handouts 1 ENDATA130 Introduction To Data Analysis
52 pages
Chapter 9 Input Modeling
No ratings yet
Chapter 9 Input Modeling
10 pages
Data Science With Python - Lesson 02 - Data Analytics Overview
No ratings yet
Data Science With Python - Lesson 02 - Data Analytics Overview
54 pages
EDA - Unit 1
No ratings yet
EDA - Unit 1
82 pages
Unit 2
No ratings yet
Unit 2
20 pages
Lecture 1 Inferential Statistics
No ratings yet
Lecture 1 Inferential Statistics
32 pages
Bahan Ajar Minggu 11 Simsis
No ratings yet
Bahan Ajar Minggu 11 Simsis
11 pages
Data and Monte Carlo Simulations
No ratings yet
Data and Monte Carlo Simulations
66 pages
ML Unit 1 Part 2
No ratings yet
ML Unit 1 Part 2
56 pages
Chapter Five:: Analyses and Interpretation of Data
No ratings yet
Chapter Five:: Analyses and Interpretation of Data
72 pages
Notes
No ratings yet
Notes
12 pages
Business Statistics - Session 1 - 3
No ratings yet
Business Statistics - Session 1 - 3
63 pages
CBA Lecture 6
No ratings yet
CBA Lecture 6
24 pages
Statistics Traffic Data Analysis
No ratings yet
Statistics Traffic Data Analysis
25 pages
مبادئ الاحصاء
No ratings yet
مبادئ الاحصاء
66 pages
LECT-3-Introduction To Statics-Economics
No ratings yet
LECT-3-Introduction To Statics-Economics
47 pages
Data Management
No ratings yet
Data Management
43 pages
GET305 NOTES
No ratings yet
GET305 NOTES
19 pages
RDA imp
No ratings yet
RDA imp
26 pages
Input Modeling: Discrete-Event System Simulation
No ratings yet
Input Modeling: Discrete-Event System Simulation
14 pages
SSGB_Part 2_1 Crown Secured Measure Phase
No ratings yet
SSGB_Part 2_1 Crown Secured Measure Phase
102 pages
Sim Notes 1
No ratings yet
Sim Notes 1
11 pages
Unit 4 Big Data Complete Notes
No ratings yet
Unit 4 Big Data Complete Notes
32 pages
Data Science and Visualization
No ratings yet
Data Science and Visualization
37 pages
Tutorial - 1-Graphs
No ratings yet
Tutorial - 1-Graphs
40 pages
Chapter One - Introduction
No ratings yet
Chapter One - Introduction
156 pages
Complete Lectures PME
No ratings yet
Complete Lectures PME
330 pages
Complete Lectures PME
0% (1)
Complete Lectures PME
329 pages
Artificial intelligence: AI in the technologies synthesis of creative solutions
From Everand
Artificial intelligence: AI in the technologies synthesis of creative solutions
Alexander V. Andreichikov
No ratings yet
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Newton Shell
No ratings yet
Newton Shell
2 pages
AM-IS-ARE and PERSONAL PROMOUNS
No ratings yet
AM-IS-ARE and PERSONAL PROMOUNS
4 pages
Nature and Scope of Economics 2
No ratings yet
Nature and Scope of Economics 2
14 pages
Objective 1) To Configure The Pin 2 To Pin 13 of The Arduino Board. 2) To Create Program That Simulates 10 Lights Sequence
No ratings yet
Objective 1) To Configure The Pin 2 To Pin 13 of The Arduino Board. 2) To Create Program That Simulates 10 Lights Sequence
6 pages
Catalog Pressure Switches Series S Asco en 4687222
No ratings yet
Catalog Pressure Switches Series S Asco en 4687222
8 pages
Assenco Tyre
No ratings yet
Assenco Tyre
3 pages
BogdanVatra Extending QT Android Apps With JNI
No ratings yet
BogdanVatra Extending QT Android Apps With JNI
56 pages
For Ludhiana
No ratings yet
For Ludhiana
4 pages
Immersion of 21 Century Learners On The Kapampangan History Immersion of 21 Century Learners On The Kapampangan History
No ratings yet
Immersion of 21 Century Learners On The Kapampangan History Immersion of 21 Century Learners On The Kapampangan History
10 pages
A Project Report On: ROLL: NO 10R11E0040
No ratings yet
A Project Report On: ROLL: NO 10R11E0040
6 pages
Fernando Sor Etude in B Minor PDF
25% (4)
Fernando Sor Etude in B Minor PDF
2 pages
Akash Varshney: Contact No.
No ratings yet
Akash Varshney: Contact No.
2 pages
Touching Architecture Affective Atmospheres and Embodied Encounters 1st Edition Anthony Brand - The ebook in PDF and DOCX formats is ready for download now
100% (1)
Touching Architecture Affective Atmospheres and Embodied Encounters 1st Edition Anthony Brand - The ebook in PDF and DOCX formats is ready for download now
67 pages
Robert Venturi
0% (1)
Robert Venturi
2 pages
298 Reg Code
No ratings yet
298 Reg Code
4 pages
Spare Parts List: Chain Saws 365SP From 2018-12
No ratings yet
Spare Parts List: Chain Saws 365SP From 2018-12
33 pages
Assignment 1
No ratings yet
Assignment 1
33 pages
Type 10 Multiple Choice Exercises
No ratings yet
Type 10 Multiple Choice Exercises
22 pages
Activity-4-handout-Kath-Fisher-reflection-paper
No ratings yet
Activity-4-handout-Kath-Fisher-reflection-paper
11 pages
Economic Analysis of Organic and Convectional Turmeric Cultivation of Erode District in Tamil Nadu
No ratings yet
Economic Analysis of Organic and Convectional Turmeric Cultivation of Erode District in Tamil Nadu
4 pages
Yusuf Khan - INFINITE SERIES 2019-05-28-1
No ratings yet
Yusuf Khan - INFINITE SERIES 2019-05-28-1
9 pages
III II Ps Lab Manual
100% (1)
III II Ps Lab Manual
87 pages
Classroom Observation Tool
No ratings yet
Classroom Observation Tool
1 page
2. DE ngay 14.11 Co Huong gui- 2023 - Copy
No ratings yet
2. DE ngay 14.11 Co Huong gui- 2023 - Copy
16 pages
Nokia 2000 Series
No ratings yet
Nokia 2000 Series
3 pages
Hydraulic Accumulators SIL Certificate Exida
No ratings yet
Hydraulic Accumulators SIL Certificate Exida
17 pages
BISMCS 343 Syllabus Winter 2022
No ratings yet
BISMCS 343 Syllabus Winter 2022
6 pages
Axis Industries Heat Meters 20 Years of Experience
No ratings yet
Axis Industries Heat Meters 20 Years of Experience
24 pages

SM_Lect_07 (1)

Uploaded by

SM_Lect_07 (1)

Uploaded by

Simulation And Modeling

CS-805: SIMULATION and MODELING

Analysis of Simulation Data

Analysis of Simulation Data

Input Modelling : Why ?

• The ultimate use of input data is to drive the

• The data may not exist

• Collection of historical data

• The data may be collected in real time

• Automatic data capture

• Data Collection Devices

• Time collection mode and units

• Other data collection consideration

• Identify the data type

• The practitioner must first decide

• There are two common approaches for determining how to handle

Same data with different interval

• Sample size 10000

• Simulation modelling Handbook: a practical

Wish you a fruitful Simulation

You might also like