ExpertFit Student Version Overview
ExpertFit Student Version Overview
ExpertFit
Student Version 8
ExpertFit can perform the three main types of analyses given in Table 1.1.
Module Description
Data Analysis Used to determine what probability distribution best
represents a data set. You can either have ExpertFit
determine the best distribution automatically or specify
the distributions for consideration manually. See
Chapter 2 for further discussion and an example.
Task-Time Models Used to specify a probability distribution for a task time
when no data are available. Based on subjective
estimates of the minimum task time, the most-likely task
time, and, say, the 90th percentile of the task time,
ExpertFit specifies a Weibull, lognormal, or triangular
distribution as a model for the task time. This analysis
type is not available in the Student Version.
Machine-Breakdown Used to model the random downtimes of a machine
Models
when no downtime data are available. Based on
subjective estimates of such parameters as machine
efficiency (e.g., 0.90) and mean downtime, ExpertFit
specifies a busy-time distribution and a downtime
distribution. This analysis type is not available in
the Student Version.
In addition, the important features given in Table 1.2 can be accessed from the
Menu Bar at the top of the screen.
Table 1.2. Additional important ExpertFit features.
Module Description
Distribution Viewer Used to display/calculate characteristics (e.g., the
density function or moments) of a distribution without
having to enter a data set.
Batch Mode Used to fit distributions to one or more data sets with
only a few mouse clicks.
There are two modes of operation (see the Mode pull-down menu) for the Data
Analysis module: Standard and Advanced. Standard Mode is sufficient for 95 percent
of all analyses and is easier to use. It focuses the user on those features that are the
most important at a particular point in an analysis. Advanced Mode contains a large
number of additional features for the sophisticated user. A user can switch from one
mode to another at any time. In this brochure we will only discuss the Standard Mode.
There are also two levels of precision when fitting distributions (see the
Precision pull-down menu): Normal and High. Normal Precision provides good
estimates, for many data sets, of the parameters of a distribution and has a small
execution time. High Precision (the default) provides better parameter estimates for
most data sets, but will have a larger execution time for data sets containing many
observations.
The use of the Data Analysis module to determine what probability distribution
best represents a data set is based on the sequential application of the four tabs shown
in Table 2.1.
Table 2.1. Tabs for the Data Analysis module.
Although there are different ways that these four tabs could be used to
determine the best distribution for a data set, the following are the explicit steps that we
recommend for real-valued data:
1. Obtain a data set using the Data tab.
2. View the resulting Data-Summary Table (in Data tab) – provides information on the
shape and range of the true underlying density function.
3. Make a histogram of your data (used in Step 5) using the Data tab – see the
Constructing a Histogram from Your Data tutorial in the Help pull-down menu.
4. Determine the distribution that is the best representation for your data using the
Automated Fitting option in the Models tab.
5. Confirm using the Comparisons tab that the best distribution as determined by
ExpertFit is, in fact, satisfactory in an absolute sense.
6. If you are doing simulation modeling, then either represent the best distribution (if
good in an absolute sense) or an empirical distribution based on your data (if the
best distribution is not satisfactory) in your simulation software using the
Applications tab.
An example of the use of the Data Analysis module is given next.
Example 2.1: Customer Service Times
This example illustrates how you can use the six-step approach given at the
beginning of this chapter to find the probability distribution that best represents a data
set. We discuss both the ExpertFit commands necessary to accomplish a particular
part of the analysis and the actual results of the analysis. This analysis will use High
Precision.
At window: Do:
Project 1 Click on New.
Project-Element Editing Select Fit distributions to data.
In the Project-Element Name edit box, enter
Example 2.1.
Click on OK.
Project 1 Click on Analyze.
Data tab Click on Enter Data.
Enter-Data Options Click on Apply.
Open In the File Name scroll list, select
EXAM21_UsersGuide.EXP.
A: The Data-Summary Table for this set of n = 450 service times (read in the Data
tab) is given in Table 2.2. The positive value of the sample skewness indicates that the
underlying distribution of the data is skewed to the right (i.e., it has a longer right tail
than left tail). This is supported by the sample mean being larger than the sample
median.
7
Data Characteristic Value
8
of the firstfor
Steps interval from 0.064
Action B: to 0:
Histogram Options Click on the equal-sign ("=") button next to 0.06400.
At window: Do:
Change the value to 0.0 in the edit box.
Data tab Click on Histogram.
Click on OK.
HistogramThe
Options
following shows howClick on Apply.
to change the lower endpoint
Perform similar actions to change the interval width to 0.25
and the number of intervals to 13.Examine the Histogram Plot.
Click on Done.
B: In Figure 2.1 we present the default ExpertFit histogram for the service-time data.
Note that the histogram is quite “ragged,” since the interval width is too small. Using a
trial-and-error approach discussed in the Constructing a Histogram from Your Data
tutorial, we determined that a better histogram is obtained by using an interval width of
0.25. The improved “smooth” histogram is shown in Figure 2.2. In general we
recommend that you construct your own histogram rather than rely on the ExpertFit
default. There is no definitive prescription for choosing histogram intervals!
9
Note that the histogram interval width can also be changed by using the
two buttons with arrowheads at the top of the histogram screen. The left (right)
10
Histogram
0.10
0.08
0.06
Proportion
0.04
0.02
0.00
0.13 0.53 0.93 1.33 1.73 2.13 2.52 2.92
Interval Midpoint
23 intervals of w idth 0.133
11
Histogram
0.17
0.13
0.10
Proportion
0.07
0.03
0.00
0.13 0.63 1.13 1.63 2.13 2.63 3.13
Interval Midpoint
13 intervals of w idth 0.25
Figure 2.2. Histogram of the service-time data with an interval width of 0.25.
12
button decreases (increases) the interval width by 5 percent, and can be applied
repeatedly.
13
Steps for Action C:
At window: Do:
Data tab Click on the Models tab.
Models tab Click on Automated Fitting.
Note that the third-best fitting model is a Rayleigh distribution with an estimated
location parameter (denoted by “E”) of 0.062. (If we were to click on View/Delete
Models at the Models tab, we would see that the normal distribution was not
automatically fit to our non-negative service-time data. This is because the normal
distribution can take on negative values. However, the normal distribution could, if
desired, be fit to our data using the Fit Individual Models option at the Models tab.)
14
Table 2.3. Evaluation of the candidate models.
Evaluation: Good
Suggestion: Additional evaluations using Comparisons Tab might be informative.
See Help for more information.
15
Steps for Action D:
At window: Do:
Comparisons tab Click on Graphical Comparisons.
Graphical-Comparisons
Options Select Density-Histogram Plot.
Click on Apply.
Density-Histogram
Plot Click on Done.
16
Density-Histogram Plot
0.17
0.14
0.10
Density/Proportion
0.07
0.03
0.00
0.13 0.63 1.13 1.63 2.13 2.63 3.13
Interval Midpoint
13 intervals of w idth 0.25 1 - Weibull
Figure 2.3. Density-Histogram Plot for the fitted Weibull distribution and the
service-time data.
17
Steps for Action E:
At window: Do:
Graphical-Comparisons
Options Select Distribution-Function-Differences Plot.
Click on Apply.
Distribution-Function-
Differences Plot Click on Done.
Graphical-Comparisons
Options Click on Done.
18
Distribution-Function-Differences Plot
0.20
0.13
0.07
Difference (Proportion)
0.00
-0.07
-0.13
-0.20
0.06 0.50 0.93 1.37 1.81 2.24 2.68 3.11
x
Use caution if plot crosses line 1 - Weibull (mean diff. = 0.00501)
19
Steps for Action F:
At window: Do:
Comparisons tab Click on Goodness-of-Fit Tests.
Options for
Goodness-of-Fit Tests Select Anderson-Darling Test.
Click on Apply.
Options for
Goodness-of-Fit Tests Click on Done.
20
Steps for Action G:
At window: Do:
Comparisons tab Click on Applications tab.
Applications tab Click on Simulation Representation in the
Use a Specified Distribution (Model) section.
Simulation-Representation
Options Select the simulation software of your choice.
Click on Apply.
Simulation-Software
Representation Click on Done.
Simulation-Representation Click on Done.
Options
Applications tab In the File menu, select Close Data Analysis.
G: If you are using ExpertFit in the context of simulation modeling, we now see how
to put the selected Weibull distribution into the proper format for several different
simulation-software products using the Applications tab. In particular, the above
actions show how to represent the Weibull distribution in the software product of your
choice; the actual representations for selected products are shown in Table 2.4.
21
Table 2.4. Simulation-software representations for the Weibull distribution.
22
3. Example Data Sets
Example
23