Analysis of Simulation Output
Analysis of Simulation Output
SIMULATION OUTPUT
Durgaraj katuwal
Why analysis of simulation
output ?
■ Many simulation includes some sort of randomness, which can arise in
a variety of ways e.g. in a simulation of manufacturing system, the
processing times required at a station may have random variations or
the arrival times of new jobs may not be known in advance.
■ In a bank, customers arrive at random times & amount of time spent
at a teller is not known beforehand. Because of the randomness of the
components driving a simulation, the o/p from simulation is also
random. So, statistical techniques must be used to analyze the
results.
■ To test different ideas, to learn about the system behavior in new
situation, to learn about simulation model and the corresponding
simulation system.
Nature of the problem
■ Once a stochastic variable has been introduced into simulation model,
almost all the system variables describing the system behavior also
become stochastic.
■ Hence it needs some statistical method to analyze the simulation
output. •
■ A large body of statistical methods has been developed over the years
to analyze results in science, engineering and other fields.
■ It seem natural to attempt applying these methods to analyze the
simulation output but most of them pre-suppose that the results are
mutually independent (IID) and the simulation process almost never
produce raw output that is IID. For example: customer waiting times
from queuing system are not IID. Thus it is difficult to apply classical
statistical techniques to analysis of simulation
Nature of the problem
■ Once a stochastic variable has been introduced into simulation model, almost all the
system variables describing the system behavior also become stochastic.
■ Hence it needs some statistical method to analyze the simulation output.
■ A large body of statistical methods has been developed over the years to analyze
results in science, engineering and other fields.
■ It seem natural to attempt applying these methods to analyze the simulation output
but most of them pre-suppose that the results are mutually independent (IID) and
the simulation process almost never produce raw output that is IID. For example:
customer waiting times from queuing system are not IID. Thus it is difficult to apply
classical statistical techniques to analysis of simulation model.
■ Making a variable stochastic changes the properties of remaining variables to
stochastic since the endogenous events make one variable depend upon another
■ While characterizing the property of a variable, in every time unit the value
changed is compared to predefined intervals known as confidence interval to find
similarity between estimated and simulated value
■ Simulation results are not mutually independent definitions
Estimation Methods
probability distribution with a finite mean μ and finite variance 𝜎 2 .
■ A random variable is drawn from an infinite population that has a stationary
■ Random numbers that meet all these conditions are said to be IID (Independently
and Identically Distributed) variable for which the central limit theorem can be
applied.
The theorem states that the sum of 𝑛 IID variable drawn from a population that
has mean μ and a variance of 𝜎 ^2 is approximately distributed as a normal
■
Solution : 𝜃` = (5.5 + 6.1 + 5.7 + 6.6 + 5.2 + 6.0 + 5.6 + 6.3 + 5.9 +
5.8)/10 = 5.87 Now, bias of estimator = 5.87 – 6.1 = - 0.23
Estimation Methods
Estimation Methods
b) Interval Estimation/Confidence Interval Estimation
Confidence interval is a measure used to analyze the correctness of the
point estimator. In the above example we got point estimation of height
of the people of a city but we don't know whether to accept or reject it.
Confidence intervals are based on the premise that the data being
defined by
Estimation Methods
Here, the quantity tn-1, α/2 is found on the student distribution table.
Example 5.2 Q.N.->
The daily production time of a product in a factory for
120 days is 5.8 hours and sample standard deviation
(S) is 1.6. Calculate confidence interval for 95%
confidence level.
Example 5.3
Q.N.->Following are the random sample of marks of
student of the orchid college. If the population mean
is 51.1 , find the bias of the point estimator.
51, 52, 56 ,50, 48 ,49, 59.8, 45.9, 58, 47
Simulation Run statistics
■ In the estimation method, it is assumed that the observations are
mutually independent and the distribution from which they are draws is
stationary.
■ Unfortunately many statistics of interest in simulation do not meet these
conditions.
■ For example: consider a single server system in which the arrival occurs
with poison distribution and service time has an exponential and queue
discipline is FIFO.
■ Where, Poisson distribution means a discrete frequency distribution which
gives the probability of a number of independent events occurring in a
fixed time.
■ Suppose the study objective is to measure the mean waiting time. In
simulation run, the simplest approach to estimate the mean
waiting time by accumulating the waiting time of n successive
entities and dividing by n. this is the sample mean, denoted by
Waiting time measured in this way is not independent.
Simulation Run statistics
■ Whenever a waiting line forms, the waiting time of each entity on the
line clearly depends upon the waiting time of its predecessors. Such
data are called auto-correlated. Another problem that must be faced
is that distribution is not stationary. TE early arrivals get the service
quickly, so a sample mean that include early arrivals is biased. The
following figure show the mean waiting time for different sample
sizes.
Simulation Run statistics
■ Consider a single server system in which the arrival occurs with a
Poisson distribution and the service time has an Exponential
distribution. Suppose the study objective is to measure the mean
waiting time defined as the time entities spend waiting to receive
𝑀/𝑀/1.
service (excluding service time). This system is commonly denoted by
for 𝑀/𝑀/1 system, starting from an initial empty state with a server
the expected value of sample mean depends upon the sample length
utilization of 0.9.
Replications of Runs
■ One way of obtaining independent result is to repeat simulation.
Repeating the experiment with different random numbers for the