ECE 3141 Statistical Distributions & Queuing Theory in Traffic Delay Problems
ECE 3141 Statistical Distributions & Queuing Theory in Traffic Delay Problems
5.1 Introduction
Statistical Distributions: In designing new traffic facilities or new control plans, it is necessary to predict
the performance of traffic with respect to some particular characteristic (e.g., the frequency of headways
of a particular size, the number of cars likely to arrive in an interval, and speeds exceeding a certain
value). It is often desirable to be able to make a prediction with a minimum amount of data available or
assumed. For instance, it may be necessary in designing a pedestrian control system to predict the
frequency of headways of greater than 10 sec; in designing a left-turn pocket it may be necessary to
predict how many times per hour the number of cars arriving during one signal cycle will exceed four.
Statistical distribution models enable the traffic engineer to make these predictions with a minimal
amount of information.
Statistical distributions are useful in describing a wide variety of phenomena where there is a high
element of randomness. In traffic the most important distributions are counting distributions-those useful
in describing the occurrence of things that can be counted-and interval distributions-those useful in
describing the occurrence of the (time) intervals between events. Distributions are also used,
however, in describing such phenomena as speeds and gap acceptance
‘Traffic’ in traffic engineering refers to vehicle, people or both that pass a specified point during a
given time period. Traffic can be human or vehicular. Traffic can be motorized or non-motorized.
Knowledge of fundamental flow characteristics and associated analytical techniques is an essential
requirement in planning, design and operation of transportation systems. Traffic flow
characteristics include;
Traffic engineers and planners assess traffic and environmental impacts of proposed system
modifications and thus can be accomplished only through a supply-demand framework that
requires understanding of flow characteristics and their interactions. Designers determine link
sizes and configure systems and must carefully evaluate the trade-off between traffic flow levels
and levels of service. Operators identify locations and causes of existing system defects and
generate operational improvement plans and predict their effects; all these require traffic flow
analysis.
Understanding the basic principles of traffic flow and traffic flow analysis is vital in designing and
operating transport systems with greatest possible efficiency and safety.
The analytical process for transport systems (w.r.t traffic flow) consist of predicting an output as
function of specified inputs. A flow chart is shown below to illustrate this analytical process and
to emphasize the importance of knowing fundamental flow characteristics and basic analytical
techniques.
Shock-wave analysis and queueing analysis are techniques available for the analysis of
oversaturated traffic systems. Traffic stream models serve as a beginning point for shock wave
analysis.
Shock-wave analysis
Flow-speed -density states change overs space and time. When these changes of states occur, a
boundary is established that demarks the time-space domain of the flow state from another. This
boundary is referred to as shockwave.
In some situations, the shockwave can be very mild like platoon of high-speed vehicles catching
up to a slightly slow-moving vehicle. In other situations, the shockwave can be very a significant
change in flow states, as when high speed vehicles approach a queue of stopped vehicles.
Shockwaves are defined as boundary conditions in the time-space domain that demark a
discontinuity in flow-density conditions
Queueing models
When demand exceeds capacity for a period of time or an initial time headway is less than the
service time (at microscopic level) at a specific location, a queue if formed. The queue maybe a
moving or a stopped one. Essentially excess vehicles are stopped upstream of the bottleneck or
service area, and their departure is delayed to a later time period. Queueing or waiting line
phenomena are everyday occurrences. Examples include vehicles waiting to be served at a petrol
station, passengers or vehicles lined up at a transit terminal, aero planes waiting clearance for take-
off or landing, vehicles at an intersection, toll plazas, parking facilities etc.
An arrival pattern,
A service facility,
A queueing discipline.
When all three components are constant, the system can be analyzed by deterministic methods.
Probabilistic systems are however more common.
Two analytical techniques can be applied in studying queueing processes;
i) Shockwave analysis,
ii) Queueing analysis
Shockwave analysis can be applied when the demand-capacity process is deterministic, and is
particularly well suited to evaluating the space occupied by the queueing process and to interacting
queueing processes.
Queuing analysis can be employed for deterministic and stochastic (probabilistic) processes, and
the vehicles in the process are considered as being stored in a vertical queue.
The mean arrival value is expressed as a flow rate, such as vehicles/hour, or as time headway, such
as seconds/vehicle. The arrival distribution can be specified as a deterministic distribution or a
probabilistic distribution. The term demand or input is sometimes substituted for the term arrival.
The mean service value is expressed as a flow rate such as vehicles/hour, or as time headway such
as seconds /vehicle. The service distribution can also be specified as a deterministic or probabilistic
distribution. The term ‘capacity’, ‘departure’ or ‘output’ is sometimes substituted for the term
service.
The fifth and last element to be specified is queueing discipline. The most common queue
discipline encountered is referred to as ‘first in, first out’. That is the vehicles are served in the
order in which they arrive, and the often symbol ‘FIFO’ is used to designate this queue discipline.
Other queue disciplines include ‘first in, last out’(FILO) systems encountered in elevators and
‘served in random order’(SIRO) systems.
A classification scheme in needed to assess the input characteristic in order to select the appropriate
queuing analysis: deterministic or stochastic queueing analysis. If either the arrival and /or the
service distribution is probabilistic, the exact arrival and /or service time of each vehicle is
unknown, and stochastic queueing analysis must be selected. On the other hand, if both arrival and
service distribution are deterministic, the arrival and service times of each vehicle are known and
deterministic queueing analysis is selected. The next two major portions of the unit are directed to
deterministic queuing analysis and stochastic queueing analysis
Queues or waiting lines arise when demand for a service facility exceeds capacity of that facility,
i.e. customers do not get service immediately upon request but must wait, or the service facilities
stand idle and wait for customers. Some customers wait when the total number of customers
requiring service exceeds the number of service facilities, some service facilities stand idle when
the total number of service facilities exceed the number of customers requiring service.
Waiting lines or queues are a common occurrence both in everyday life or in variety of business
and industrial situations. Most waiting line problems are centered about the question of finding the
ideal level of service that a firm should provide.
For example;
Supermarkets must decide how many cash register check out points should be opened.
Gasoline stations must decide how many pumps should be opened and how many
attendants should be on duty
Manufacturing plants must decide the optimal number of mechanics to have on duty in
each shift to repair machines that break down
Banks must decide how many teller windows to keep open to serve customers during
various hours of the day.
Evolution of queueing theory
Queueing theory has its beginning in the research work of a Danish engineer named A. K. Erlang.
In 1909 Erlang experimented with fluctuating demand in telephone traffic. 8 years later he
published a report addressing the delays in automatic dialing equipment. At the end of World War
II, Erlang’s early work was extended to more general and to business applications of waiting lines.
Some examples of waiting lines/queues are given in the table below;
The size represents the total number of potential customers who will require the service.
According to source
The source of the customers can be finite or infinite. For example, all people of a city or a state (or
others) could be potential customers at a supermarket. The number of people being very large, it
can be taken to be infinite. Whereas, there are many situations in businesses and industrial
conditions where we cannot consider the population to be infinite -it is finite.
According to numbers
The customers may arrive for a service individually of in groups. Single arrivals are illustrated by
patients waiting for a doctor, students reaching at a library counter etc. on the other hand, families
visiting restaurants, ships discharging cargo at a dock are examples of bulk or batch arrivals.
According to time
Customers arrive in the system at a service facility according to some known schedule (for example
one patient every 15 minutes or one a candidate for interview (every half an hour) or else they
arrive randomly. Arrivals are considered random when they are independent of one another and
their occurrence cannot be predicted exactly. The queueing models wherein customers’ arrivals
are known with certainty are categorized as deterministic models. (insofar as this characteristic is
concerned) and are easier to handle. On the other hand, a substantial majority of the queueing
models are based on the premise that the customers enter the system stochastically at random pints
in time.
Patterns of arrivals at the system
The arrival process (or pattern) of customers to the service system is classified into two categories;
static and dynamic. These two are further classified based on the nature of the arrival rate and the
control that can be exercised on the arrival process.
In static arrival process, the control depends on the nature of arrival rate
(random or constant). Random arrivals are either at a constant rate or varying with time. Thus, to
analyze the queueing system, it is necessary to attempt to describe the probability distributions of
arrivals. For such distributions we obtain average time between successive arrivals, also called
inter-arrival time (time between two consecutive arrivals), and the average arrival rate (i.e. number
of customers arriving per unit of time at the service system.
Output variables
Utilization rate ρ (server utilization, percentage of the time that a server is busy, where
c=the number of parallel servers)
Probability of n customers in the system Pn
Average number of customers in the system L (service and queue)
Average number of customers in the queue Lq
Average time spent by a customer in the system w (service and queue)
Average time spent by a customer in the queue wq
Behavior of arrivals
Another thing to consider in the queuing structure is the behavior or attitude of the customers
entering the queueing system. On this basis the customers may be classified as being;
a) Patient, or
b) Impatient
If a customer on arriving at the service system stays in the system until served, no matter how long
he has to wait for service is called a patient customer. Machines arriving at a maintenance shop
in a plant are examples of patient customers.
Whereas, the customer who wait for a certain time in the queue and leaves the service without
getting service due to certain reasons such as a long queue Infront of him is called an impatient
customer.
Balking – some customers, even before joining the queue get discouraged by seeing the
number of customers already in the service system or estimating the excessive waiting time
for a desired service, decide to return for service at a later time. In queueing theory this is
known as balking
Reneging – customers after joining the queue, wait for service sometime and leave the
service system due to intolerable delay, so they renege.
For example, a customer who has just arrived at a grocery store and finds that a the salesman are
busy in serving the customers already in the system, will either for the service till his/her patience
is exhausted or estimates that his waiting time maybe excessive and so leaves immediately to seek
service elsewhere.
Jockeying – customers who switch from one queue to another hoping to receive a service
more quickly are said to be jockeying.
SERVICE SYSTEM
The service is provided by aa service facility (or facilities). This maybe a person (a bank teller, a
barber) a machine (an elevator, a gasoline pump) or a space (an airport runway, parking lot, a
hospital bed) to mention just a few. A service facility may include one person or several people
operating as a team.
i) Single Server-Single Queue – the models that involve one queue- one service station
facility are called single serve models where customers wait till the service point is
ready to take him/her for servicing. Students arriving at a library counter is an
example of singe server facility.
ii) Single Server- Several Queues -- in this type of facility there are several queues and
the customer may join any one of them but there is only one service channel as
illustrated in the figure below;
iii) Several (parallel) Servers – Single Queue – in this type of model there is more than
one server and each server provides the same type of service facility. The customers
wait in a single queue until one of the service channels is ready to take them in for
servicing.
iv) Several Servers- Several Queues—This type of model consists of several servers
where each of the serves has a different queue. Different cash counters in an electricity
office where the customers can make payment in respect of their electricity bills
provide an example of this type of model.
v) Several facilities in a series – In this, a customer enters first station and gets a portion
of the service and then moves on to the next station, get some service then again moves
on to the next station …… and so on, and finally leaves the system having received the
compete service. For example, machining a certain steel item may consists of cutting,
turning, knurling, grinding, and packaging operations, each of which is performed by
a single serve in a series arrangement.
b) Speed of service
In a queueing system, the speed with which service is provided can be expressed in either of two
ways- as service rate or as a service time.
The service rate describes the number of customers serviced during a particular time period
The service times indicates the amount of time needed to service a customer
The service rates and times are reciprocal of each other and either of them is sufficient to
indicate the capacity of the facility.
Thus, if a cashier can attend, on an average 5 customers in an hour, the service rate would be
expressed as 5 customers /hour and service time would be equal to 12 minutes /customer.
Generally, we consider the service time only. If these service times are known exactly, the problem
can be handled easily. But as generally happens. If these are different and not known with certainty,
we have to consider the distribution of service times in order to analyze the queueing system.
Generally, the queueing models are based on the assumption that the service times are
exponentially distributed about some average service time.
QUEUE CONFIGURATION
The queueing process refers to the number of queues, and their respective
lengths. The number of queues depend upon the layout of service system. Thus, there may be a
single queue or multiple queues.
Length or size of the queue depends upon the operational situation such
as;
Physical space,
Legal restrictions and,
Attitude of the customers.
In certain cases, a service system is unable to accommodate more than the required number of
customers at a time. No further customers are allowed to enter until space becomes available to
accommodate new customers. Such type of situations is referred to as finite (or limited) source
queue. Examples of finite source queues are cinema halls, restaurants etc.
On the other hand, if a service system is able to accommodate any number of customers at a time,
then it is referred to as infinite (or unlimited) source. Example is in a sale department, here the
customer orders are received, there is no restriction in the number of orders that can come in, so
that a queue of any size can form.
In many other situations, when arriving customers experience long queue(s) Infront of a service
facility, they often do not enter a service facility even though additional waiting space is available.
The queue length is such cases depend upon the attitude of the customers. For example, when a
motorist finds that there are many vehicles waiting at a petrol station, in most of the cases he does
not stop at this station but seeks the service elsewhere.
QUEUE DISCIPLINE
In the queue structure, the important thing to know is the queue discipline.
The queue discipline is the order or manner in which customers from the queue are selected for
service. There are a number of ways in which customers in the queue re served. Some of these are:
a) Static queue disciplines are based on individual customer’s status in the queue. Few of
such disciplines are;
i) If the customers are served in the order of their arrival, then this is known as first-come,
first-served (FCFS) service discipline. Prepaid taxi queue at airport where a taxi is
engaged on first-come, first-served basis is an example of this discipline.
ii) Last-come-first-served (LCFS)—Sometimes, the customers are serviced in the reverse
order of their entry so that the ones who join the last are served first. For example, assume
that letters to be typed, or the order of forms to be processed accumulate in a pile, each new
addition being put on top of them. The typist of the clerk might process these letters or the
orders by taking each new task from the top of the pile. Thus, a just arriving task would be
the next to be serviced provided that no fresh task arrived before ii is picked up. Similarly,
people who join the elevator last are first ones to leave it.
b) Dynamic queue disciplines are based on the individual customer attributes in the queue.
Few of such disciplines are;
i) Service in random order (SIRO)—under this rule customer are selected for
service at a random order, irrespective of their arrivals in the service system. In
every customer in the queue is equally likely to be selected. The time of arrival of
the customers is, therefore, of no relevance in such as a case.
ii) Priority service— Under this rule customer are grouped in priority classes on the
basis if some attributes such as service time or urgency or according to some
identifiable characteristic, and FCFS rule is used to within each class to provide
service. Treatment of VIPs in preference to other patients in a hospital is an example
of priority service.
o SPT (shortest processing time first) is one of the queue disciplines. To which
category, static or dynamic class of disciplines, does it belong?
5.3 DETERMINISTIC QUEUEING MODELS
The fundamentals of queueing theory maybe best introduced by means of an example. Consider
an airport runway being used for takeoffs only. Air traffic rules state that there can only be one
aircraft on the runway at a time. Each aircraft spends about the same time on the runway, so that
runway can process takeoffs at fairly constant rate. In the queuing theory jargon, the runway would
be the server, and the maximum rate at which it can process takeoffs, under given conditions,
would be called at its service rate. For most transportation applications, the service rate of a facility
will be identical to its capacity under a given set of conditions.
Arrivals of aircrafts waiting to take off are not constant. A plot of the cumulative number of aircraft
demanding takeoff versus time might look like figure below.
The step function in figure 1 above is called arrival function, A(t). in most transportation
applications the arrival function (or arrival curve, if the step function is approximated with a
smooth curve) will be identical with the cumulative demand for facility or system for time period
being considered
Now suppose that each aircraft takes fixed amount of time to clear the runway. Given this service
time, it is possible to plot a departure function D*(t), which shows when each aircraft clears the
runway. It is also possible to plot a plot a function D(t), which shows when each aircraft begins its
takeoff run. The departure function D(t) is usually of more interest than D* (t) , since the takeoff
itself is a necessary part of the use of the runway, and should not be considered queueing delay.
Figure below shows the departure functions D*(t) and D(t) in relation to the arrival function A(t)
The difference between A(t) and D(t) (or D*(t)) represents aircraft which have arrived at the
runway but have nit begun or completed their takeoffs. These aircrafts waiting to be served are
said to be in a queue, and the line of the aircraft (or whatever) waiting to be served is called a
queue.
From the queueing diagram shown on the figure above, it is possible to determine a number of
interesting things:
In many applications it is appropriate to approximate step functions A(t) and D(t), with smooth
curves, and to ignore the finite service time – if indeed, a finite service time exists. In other cases,
it’s more convenient to approximate A(t) and D(t) with stepwise linear functions.
Queueing diagrams have several important properties. It is very important that students understand
these properties and be able to apply them in actual queueing problems. These are;
i) The slope of D(t) is the departure rate; the slope of A(t) is the arrival rate.
ii) The departure rate cannot exceed the service rate or capacity of the server. It may be
less.
iii) Cumulative departures can never exceed cumulative arrivals. D(t) can never be above
A(t) in the queueing diagram.
iv) When a queue is present, the departure rate will be equal to service rate. When no queue
is present, the departure rate will be equal to arrival rate. The queue first form when
arrival rate exceeds the service rate.
An excellent starting point is provided by D/D/1 queueing model, which is the simlest waiting line
model. It assumes the following;
i) Deterministic arrival,
ii) Deterministic service time,
iii) Single channel server,
iv) FIFO and
v) Infinite queue length.
For a simple queueing model, an important measure is its traffic intensity whereby;
Examples;
1)
Customers arrive at a movie theatre ticket window at the rate of 10/hour and are serviced
at a constant rate of 12/hour. Describe how this system will perform.
Solution
2)
Customers arrive at the ticket counter of a local movie theatre at the rate of
240 persons /hr. at 5.30 P.M. After 10minutes, the arrival rate declines to 60persons /hr. and
continues at that level for 20 minutes. If the time required to serve each customer is 20 seconds,
describe the performance of the system.
Solution
240 𝑝𝑒𝑟𝑠𝑜𝑛/ℎ𝑟
𝜆1 = = 4 𝑝𝑒𝑟𝑠𝑜𝑛/𝑚𝑖𝑛 t ≤ 10 min
60 𝑚𝑖𝑛/ℎ𝑟
60 𝑝𝑒𝑟𝑠𝑜𝑛/ℎ𝑟
𝜆2 = = 1 𝑝𝑒𝑟𝑠𝑜𝑛/𝑚𝑖𝑛 t > 10 min
60 𝑚𝑖𝑛/ℎ𝑟
60 𝑝𝑒𝑟𝑠𝑜𝑛/ℎ𝑟
𝜇 = = 3 𝑝𝑒𝑟𝑠𝑜𝑛𝑠 /𝑚𝑖𝑛 for all t
20 𝑚𝑖𝑛/ℎ𝑟
Therefore, the number of person arrivals at a time t equals 4t for t ≤ 10 min. also, the number of
persons arrivals [40+1(t-10)] for t > 10 min and the number of persons with tickets (departures) is
3t for all t.
These equations are depicted in the figure below. Notice that when the arrival curve is above the
departure curve a queue will exist. The queue will dissipate at the time when arrival and departure
curves intersect.
⸫ 40+1(t-10) = 3t and t = 15
Thus, the queue which begun to form at 5.30 P.M., will dissipate at 5.45P.M.
The longest queue will occur at time t = 10 of 10 persons, and the longest delay will be 13.33 -
10 = 3.33 minutes.
1 1
𝑇𝑜𝑡𝑎𝑙 𝑑𝑒𝑙𝑎𝑦 = (10 ∗ 10) + (10 ∗ 5) = 75 𝑝𝑒𝑟𝑠𝑜𝑛/𝑚𝑖𝑛
2 2
3)
Vehicles arrive at a tool booth at a rate given by λ(t) = 5.2 – 0.20t, where λ(t)
is in vehicles /min and t is in minutes. The tollbooth operator processes the vehicles at a rate of μ
(t) = 3 where μ (t) is in vehicles /minute. Determine the total delay, maximum queue length and
the time when the queue will dissipate.
The signalized intersection serves as one of the best examples of deterministic queueing analysis
at the macroscopic level because its relatively simple and because of our personal experience with
such queuing processes. Queueing on only one approach to at a signalized intersection having only
two signal phases for that approach is considered.
Only an undersaturated situation is considered in this example. That is in each cycle the arrive
demand is less than the capacity of the approach, no vehicles wait longer than on the cycle, and
there is no overflow from one cycle to the next. A number of simplifying assumptions are included
with this first example of deterministic queueing analysis to emphasize the concept and reduce
confusion with more complicated real-life situations. Later examples demonstrate techniques for
handling more complicated real-life situations.
The figure below provides for all input requirements needed to solve this problem.
The arrival rate ( 𝜆) is specified in vehicles /hour and is constant for the study period. The service
rate (𝜇) has tow states: zero when signal is effectively red and up to saturation flow rate (s) when
the signal is green. Note that the service rate can be equivalent to saturation flow only when a
queue is present. Otherwise, the service rate is equal to arrival rate if the signal is green. The queue
discipline is assumed to be “first in, first out” (FIFO) system.
Directly below the flow rate versus time diagram, accumulative vehicles versus time diagram is
constructed. A horizontal line (in the figure below) such as the arrival rate ( 𝜆) appearts as a sloping
line with the slope equal to the flow rate. Thus, the arrival rate goes through the origin and the
slope up to the right with a slope equal to the arrival rate.
Now the service rate in the former figure is transformed to the latter figure. During the red period
the service rate is zero. So, the service is shown as horizontal line in the lower diagram. At the
start of the green period a queue is present, and the service rate is equal to the situation flow rate
(s). As shown in the latter figure, the cumulative arrival intersects the cumulative service line
during the green period. At this point in time the queue is dissipated and the cumulative service
line overlays the cumulative arrival line until the end of green period. Then the pattern repeats
itself, with the service rate varying again from zero to saturation flow rate to arrival flow rate.
A series of identical triangles are formed, with cumulative arrival line forming the top side of
triangles and the cumulative service line forming the other side of triangles. Each triangle
represents one cycle length and can be analyzed to calculate a set f five measures of performance;
The time duration of the queue is represented by the horizontal projection of the queuing triangle.
It starts at the beginning of the red period and continues until the queue is dissipated. Its value
caries between effective red time and cycle length., and is expressed in seconds. Two measures of
performance are associated with queue time duration: time duration of queue (t Q) and percent time
queue is present (PtQ).
𝜆𝑡𝑄 = 𝜇(𝑡𝑄 − 𝑟)
𝜆𝑡𝑄 = 𝜇𝑡𝑄 − 𝜇𝑟
𝑡𝑄 (𝜇 − 𝜆) = 𝜇𝑟
𝜇𝑟
𝑡𝑄 =
𝜇−𝜆
100𝑡𝑄
𝑃𝑡𝑄 =
𝐶
The time duration of the queue is helpful in understanding the storage and de-storage of vehicles
in the queue and is also in assessing the degree of saturation of the approach to the signalized
intersection. The figure below is an extension of the former figure with additional information
displayed.
Near the middle of the second green phase tQ terminates, and the queue is dissipated. This point in
time is projected on to the figure (a) above, and three areas are denoted. The fist area (A1)
represents the number of vehicles stored during previous red phase. When the signal changes to
green, the second area (A2) begins to enlarge. The second area (A2) denotes the number of vehicles
that are de-stored. When the second area is equal to the first area, the queue is dissipated. The third
area also has some significance, for it represents how many additional vehicles could be served
during this signal cycle.
The number of vehicles experiencing queuing is represented by vertical projection of the queueing
triangle. The first vehicle experiencing the queue is the vehicle that arrives just after the signal turn
red. All vehicles arriving during the red as well as vehicles arriving during the green but before
the queue is dissipated experience the queueing processes and are forced to stop or slow down
significantly. Its value caries between 𝜆𝑟 and 𝜆𝐶 and is expressed in number of vehicles. Three
measures of performance are associated with the number of vehicles experiencing queuing:
𝜆𝑡𝑄
𝑁𝑄 =
3600
𝜆𝐶
𝑁=
3600
100𝑡𝑄
𝑃𝑁𝑄 =
𝐶
The queue length is represented by the vertical distance through the triangle. At the beginning of
the red, the queue length is zero and increases to its maximum value at the end of the red period.
The queue length decreases until the arrival line intersects the service line when the queue length
is equal to zero. The queue length remains equal to zero until the end of the green period when the
pattern repeats itself. Three queue measures of performance are of primary interest;
𝜆𝑟
𝑄𝑚 =
3600
𝑄𝑚 𝜆𝑟
𝑄̅𝑄 = =
2 7200
𝑄𝑚 𝑡𝑄
𝑄̅ =
2𝐶
Individual delay is represented by the horizontal line across the triangle. The first vehicles to arrive
after the beginning of the red encounters the largest individual delay. Each vehicle arriving
thereafter experiences smaller and smaller individual delay until the queue is dissipated. Vehicle
arriving thereafter until the beginning of the next red encounter no individual delay. Three delay
measures of performance are of primary interest:
𝑑𝑀 = 𝑟
𝑟
𝑑̅𝑄 =
2
𝑟𝑡𝑄
𝑑̅ =
2𝐶
The total delay cycle is represented by the cross-sectional area of the queueing diagram triangle
and is expressed in vehicle-seconds. Any of the following equations can be used to calculate the
total delay (TD) in vehicle -seconds;
𝑁𝑄 𝑟
𝑇𝐷 =
2
𝑄𝑀 𝑡𝑄
𝑇𝐷 =
2
𝑇𝐷 = 𝑑̅ 𝑁
Queueing patterns
A variety of queueing patterns can be encountered and a classification of these patterns is proposed
in this section. The classification scheme is based on how arrival and service rates vary over time.
For instance, the signalized intersection example previously presented had the characteristics of a
constant arrival rate over time and a varying service rate over time.
A proposed classification scheme is shown in the figure below, which is a 2 x 2 matrix resulting
in four cells. Each cell can then further be subdivided into subclasses of queueing patterns.
Consider the upper left-hand cell that represents the pattern of a constant arrival rate and a constant
service rate over time. Two rather less interesting sub patterns are encountered. If the arrival rate
is less than the service rate, no queuing is ever encountered. If on the other hand, the arrival rate
is greater than service rate, the queue has a never-ending growth with queue length equal to product
of time and the difference between arrival and service rates.
The upper right-hand cell represents the pattern where the arrival rate is constant over time while
the service rate varies over time. It should be noted that the service rate must be less than the arrival
rate for some period of time, but greater than the arrival rate for other periods of time. Again, this
cell can be further subdivided into subclasses of queueing patterns. Two are shown in the figure
above, but the service rate does not have to be in the form of square-wave. That is, several changes
in service rates of different amounts can be encountered, and the changes do not have to occur
instantaneously but during transitional periods. The signalized intersection example fits the
diagram to the left in this cell, while the occurrence of an incident or accident would result in a
diagram similar to that shown to the right in this cell.
The lower left-cell represents the pattern where the arrival rate varies over time, while the service
rate is constant over time. For queueing to occur and then be dissipated, the arrival rate must be
greater than the service rate for some periods of time and less than service rate during other periods
of time. Two subclasses of queueing patterns have been shown in the figure above. The one on the
left in the cell indicates a square-wave type of arrival rate, while the one on the right in the cell
provides for transitional periods during changes in arrival rates.
The lower right-hand cell represents the more complex situation where both arrival and service
rates vary over time. For queueing to occur then be dissipated, the arrival rate must exceed the
service rate and the later be less than service rate. Two subclasses of queueing patterns are shown
in the figure above. The one on the left cell indicates a square- wave type of arrival and an inverted
square-wave type of service rate. The diagram on the right-hand side is an extension of the first
one with transitional periods during changes in arrival and service rates. Analysis of these more
involved queueing patterns are extensions of queueing patterns that are described in the next two
sections. Simulation is often employed when these more complex queueing patterns are
encountered particularly when sensitivity of parameter values is to be investigated.
The figure above and the accompanying text suggested two possible queueing patterns when
service rate vary over time while arrival rates are constant. Signalized intersections and at-grade
highway-railroad crossings are examples of one queueing pattern, while an incident or an accident
site could cause other queueing pattern. The essential difference between these two cases is
whether the reduced service rate goes to zero or not. Another less obvious difference is whether
the service rate variation is repetitive, such as at a fixed-time traffic signal, or whether it is not
repetitive, such as at a highway-railroad at-grade crossing, incident or accident site, or at a traffic-
responsive controlled traffic signal. Since the signalized intersection was addressed, attention is
now given to queueing analysis at the site of an incident.
The queueing diagram for an incident situation is shown in the figure below;
The figure provides for all input requirements needed to solve this problem. The arrival rate (𝜆 )
is specified in vehicles per hour and is constant for the study period. The normal service rate
(without an incident) is indicated in the diagram as μ, and since it exceeds arrival rate, no queuing
would normally exist. However, an incident occurs that reduces the service rate to μR which is
below the arrival rate, and this lower service rate maintained for t R hours. As in the case for most
highway situations, a “first in, first out” (FIFO) queue discipline is assumed.
In the figure below, a cumulative vehicle versus time diagram is constructed. The arrival is shown
as a straight line passing through the origin with a slope up and to the right equivalent to arrival
rate (𝜆). For the first period of time the service rate becomes equivalent to μR and maintains a flatter
slope until the incident is removed. Then the service rate increases to μ, and the service line has a
steeper slope. This continues until the arrival line and service line intercept, at which the service
line once again overlays the arrival line.
5.4 STOCHASTIC QUEUEING MODELS AND APPROACHES TO DELAY PROBLEMS
5.4.1 INTRODOCTION
So far, the queueing models presented here have assumed that arrival (and possibly) service
functions vary over time in some known fashion. Such queueing models are known as
deterministic queueing models. They are appropriate for many transportation applications because
of the prevalence of distinct traffic peaks in which the capacities of servers are exceeded on a
regular basis. Much of the literature of queuing theory, however, concentrates on a different
situation, in which there are constant long-term average arrival and service rates, but there are also
short-term random fluctuations around these average rates. Under these conditions, arrival rates
will exceed service rates for short time intervals, and queue will form. Models of this type of
queueing are referred to as stochastic queueing models.
Stochastic queueing models are classified as to assumptions made about the distributions of the
arrivals and service times and the number of channels (parallel servers) involved. For instance, an
M/D/1 queueing system is one in which arrivals are exponentially distributed (that is, the number
of arrivals during a given time interval follows a Poisson distribution; the negative exponential
distribution is the distribution of times between arrivals under this assumption), the service rate is
deterministic (that is, no random variation), and there is one channel. Features of interest for
stochastic queuing systems include;
2𝜌 − 𝜌 2
𝑄=
2(1 − 𝜌)
𝜌
𝑤=
2𝜇(1 − 𝜌)
2−𝜌
𝑡=
2𝜇(1 − 𝜌)
The most commonly encountered stochastic queuing model is the M/M/1 model, in which both
arrivals and service times are exponentially distributed and there is one channel. For the M/M/1
model, the average queue length, waiting time and total delay are given by;
𝜌2
𝑄=
1−𝜌
𝜆
𝑤=
𝜇(𝜇 − 𝜆)
1
𝑡=
𝜇−𝜆
Stochastic queuing systems are said to be under-saturated when traffic intensity is less than 1.0
and to become saturated when it reaches or exceeds 1.0. Note that for both of the models presented
above; queue lengths, waiting times, and total delays approach infinity as the traffic intensity
approaches 1.0. This is as a result of the assumptions that the arrival rate continues for an indefinite
period of time. Because of this feature, stochastic queueing models are not appropriate for
situations in which there is a definite nonrandom peak in which the arrival rate exceeds the capacity
of the server. On the other hand, deterministic queuing models ignore the random fluctuations in
arrivals and services times which are almost always present and so underestimate queue lengths
and delays. Neither, model is accurate representation of conditions during the transition from
under-saturated to saturated conditions. For this situation model known as the diffusion
approximation has been proposed.
5.4.2 DERIVATION
Output variables
Utilization rate 𝜌( server utilization, percentage of time that server is busy, where c = the
number of parallel servers).
Probability of n customers in the system Pn.
Average number of customers in the system L (service and queue).
Average number of customers in the queue Lq.
Average time spent by a customer in the system W (service and queue).
Average time spent by a customer in the queue Wq.
Transient versus steady state behavior
If 𝜌 > 1 then the system is unstable; on average more customers arrive than the system can
handle.
A ‘traffic intensity’ a is used for systems with finite population.
Little’s Equation
Average number of customers in the system for M/M/1 in steady state condition
Steady state, so P0 *1 = P1 *m
But also; P0 *1 +P2 *m = P1 * (1+m)
Pn = 1 n/m * P0
For the sum of P, the following holds;
𝑛=∞
∑ 𝑃𝑛 = 1
𝑛=0
𝜆𝑛
If the system in the steady state then 𝜆/ 𝜇 ˂ 1 and: ∑𝑛=∞
𝑛=𝑜 𝜇 can be replaced by;
1/ (1- 𝜆/ 𝜇)
𝜆
Using this, we can calculate tata P0 equals; 𝑃0 = 1 − 𝜇
𝜆 𝜆𝑛
Thus, we get for Pn; 𝑃𝑛 = (1 − 𝜇)(𝜇 )
Fill in and use Little’s equation. Then we get the table with the most important values for
M/M/1 system.
𝜆
𝐿=
𝜇−𝜆
𝐿 1
𝑊= =
𝜆 𝜇−𝜆
1 𝜆
𝑊𝑄 = 𝑊 − =
𝜇 𝜇(𝜇 − 𝜆)
𝜆2
𝐿𝑄 = 𝜆𝑊𝑄 =
𝜇(𝜇 − 𝜆)
𝜆 𝜆𝑛
𝑃𝑛 = (1 − )( )
𝜇 𝜇
Exercise
On average one customer per minute arrives (number of customers per minute Poisson distributed).
Average service time is 40 seconds per customer (exponentially distributed). What is;
M/D/1 system
N/2 𝜆= 𝜇
1-PN {1-aN/1-An+1 𝜆≠ 𝜇
{N/N+1 𝜆= 𝜇
𝝀e 𝜆(1-PN) = 𝜇(1-P0) = 𝜇 e
𝝆 𝜆e/ 𝜇= 1-P0
W L/𝜆e
WQ W-1/ 𝜇
LQ 𝜆eWQ = L-(1-P0)
Pn {(1-a) an/1-aN+1 𝜆≠ 𝜇
𝑛 = 0,1, … , 𝑁
{1/…… 𝜆= 𝜇
Exercise
Small travel agent with one employee, 3 chairs to wait, 𝜆= 5 customers per hour (Poisson), 𝜇= 10
customers per hour (Poisson). Calculate;
More servers,
If the number of customers in the system n ˂ c then new arrival can be served immediately,
Stable system if 𝜆˂ c𝜇,
Utilization rate is not 𝜌 = 𝜆 /𝜇 but 𝜌 = 𝜆/c 𝜇,
If 𝜌 > then the system grows with ( 𝜆 - c 𝜇),
Complex formulas, so often tables or graphs are used.
Exercise
Customers at a post office. 1 queue and more servers. Arrival intensity 2 per minute (Poisson),
service time 40 seconds (exponential). Determine;
A flow chart depicting various queuing situations is shown in figure below. If the arrival
distribution and /or service distribution is probabilistic, the exact arrival and /or service time of
each vehicle is unknown and stochastic queuing analysis rather than deterministic queueing
analysis must be selected.
To use stochastic queuing analysis, the traffic intensity (𝜌) must be less than 1. Traffic intensity
is;
𝜆
𝜌=
𝜇
The arrival and service rates can be expressed in a variety of ways. For example, the time interval
maybe specified in second, hours and so on, but the arrival and service rates must employ the same
time units. In some cases, mean arrival times and /or, mean service times are given and require
conversion to arrival and service rates as shown below;
3600
𝜆=
ℎ̅
3600
𝜇=
𝑠̅
If the traffic intensity is greater than 1, the only possible solution approaches are the convert the
queuing process to a deterministic queuing problem or to introduce multi-time slice, varying mean
arrival rates, and mean service rates and solve using microscopic simulation techniques.
Deterministic queuing analysis was discussed and simulation techniques will be discussed alter.
The reminder of this chapter is devoted to analytical approach of solving stochastic queuing
problems.
There are many types of probability distribution of arrival and service rates. A classification
scheme based on a few of more common distributions is shown in the Table below. If the arrival
and service distributions both have constant mean values, a deterministic queuing approach is
employed. The letter D denotes a constant mean value, while M, E and G represent random,
Earlang, and generalized forms of probability distributions.an unlimited number of probability
distributions can be utilized, but sets of mathematical equations have been derived for only a few
of the more common probability distributions.
Referring to the table, each cell is denoted by two letters (i.e. M/D). This represents a stochastic
queuing distribution situation in which arrivals are randomly distributed while service rimes are
constant. This code identification scheme can be further extended by identifying the number of
service channels, queue limits, and the queuing discipline characteristics. For example;
identifies the same stochastic queuing problem described above and in addition indicates there is
a single service channel with queue discipline characteristics of permitted queue length of infinity
with no diversion and a “first in, first out” service system. If only a code of M/D is given, it is
understood to represent M/D/1 (∞, FIFO).
Three stochastic queueing situations based on the analytical approach are presented in the next
three sections. The first two deal with single-channel services, with random arrival distributions
and with random or constant service distributions [M/M/1(∞, FIFO) and M/D/1 (∞, FIFO)]. The
final section is devoted to Multi-channel systems [M/M/N (∞, FIFO)].
Random service problem
The set of equations used for stochastic queuing problem in which both the arrival and service
times are randomly distributed is shown in the table below. The discussion in this section is
devoted to single-service channel with no limit on queue length and “first in, first out” service.
The traffic intensity, 𝜌, is the ratio if mean arrival rate to the mean service rate and can vary from
zero to 1. As 𝜌 increases the various queuing performance measures increase at an increasing rate
until 𝜌 = 1, the various measures go to infinity. The probability of some or more vehicles being in
the system at any point in time is equal to traffic intensity. Therefore, the probability of an empty
system is equal to (1- 𝜌). If the analyst is untested in more specific information about the number
of vehicles in the system, Gerlough and Huber have shown that;
𝑃 (𝑛) = 𝜌 𝑛 (1 − 𝜌)
𝜌 = traffic intensity
The probabilities of n or fewer vehicles in the system as a function of traffic intensity are shown
the figure below. For example, if the arrival rate is 100 vehicles per hour and service rate is 150
vehicles per hour, the traffic intensity is equal to 0.67. From the figure, the probability of exactly
n vehicles in the system and cumulative probabilities can be obtained. For example, the probability
of exactly 1,1,2,3,4 and 5 vehicles in the system is 0.33, 0.22, 0.15, 0.10, 0.07 and 0.04
respectively. Note that as 𝜌 approaches 1, the probabilities of a large number of vehicles in the
system increases very rapidly.
The queue length characteristics as a faction if traffic intensity is shown the follow figure. Using
the previously calculated traffic intensity value of 0.67 and the equations shown in the previous
table, the queue length characteristics would be;
𝐸 (𝑛) = 𝜌 + 𝐸(𝑚)
Special consideration should also be given to the relationship between the average number waiting
when a queue is present [E (m/m > 0)] and the average number waiting to be served [E(m)].
Contrary to initial intuition, the greatest difference occurs under low traffic intensity, and the
difference approaches zero when traffic intensity approaches 1 because the probability of an empty
system approaches zero.
In reference to the average number in the system [E(n)] and its variance [Var (n)], they are
approximately equal under low traffic intensity, while the ratios becomes 1 to 10 when the traffic
intensity equals 0.9.
The effects of arrival and service rates on average waiting time and total waiting time are shown
figures below. The two figures are constructed in a similar manner, when the mean arrival is plotted
on the vertical axis and the mean service rate on the horizontal scale. Two sets of contour lines are
constructed on the figures. The traffic intensity continue line appear as radial lines extending up
and to the right from the origin. Steeper slopes represent highest traffic intensities. The second set
of contours lines represent average waiting time in first figure and the total system waiting time in
the second figure. The two diagrams clearly show how waiting times increase rapidly as traffic
intensity approached unity.
The set of equations for a stochastic queuing analysis problem in which arrivals are randomly
distributed and the service rates is constant is known shown in the table below. The discission in
this section is devoted to single-channel service with no limit on queue length and ‘first in, first
out” service.
As an example, the queuing performance in single-channel system with random arrivals and
constant service. The figure below demonstrates the effects of arrival and service rates on average
number and average time in the system. The contour lines in first figure represent the average
number of vehicles in the system and extend up to the right from the origin. Traffic intensity (𝜌)
can also be shown as contour lines in the same diagram. The diagram clearly shows the rapid
increase in the average number in the system as traffic intensity approaches a value of 1.
The second figure shows contour lines representing average vehicle time in the system. This
relationship is a little more complex since the average time is a function of the traffic intensity and
mean service rate. The diagram clearly shows the rapid increase in the average time in the system
as traffic intensity approaches a value of 1 and as mean service time approaches zero.
Examples
M/D/1
Question
Customers arrive at a ticket counter of a local movies theater at the rate of 165 persons per hour
over the period until, the movie starts, while the service rate is 195 persons per hour. Compute the
characteristics of the system.
Solution
165
- Mean arrival rate: 𝜆= = 2.75 𝑝𝑒𝑟𝑠𝑜𝑛𝑠 /𝑚𝑖𝑛
6
(exponential)
- Constant service rate: 𝜇 = 3.25 𝑝𝑒𝑟𝑜𝑠𝑛𝑠 /𝑚𝑖𝑛
𝜌 = 0.846
𝜌2 0.8462
- Mean number of units in the system: 𝐿𝑞 = 2(1−𝜌) = 2(1.154) = 2.327 𝑝𝑒𝑟𝑠𝑜𝑛𝑠
1.154 𝑚𝑖𝑛
- Service facility utilization factor: 𝜌 = 0.846 𝑚𝑖𝑛
- Probability of no units in the system: 𝑃0 = 1 − 𝜌 = 0.154 𝑚𝑖𝑛
M/M/1
Question
A movie theatre ticket both has a mean arrival rate 3 person/minute and service rate is 4 persons
/minute. Calculate the characteristics of this queuing system applying the M/M/1 model.
Solution
𝜆
𝜆 = 3; μ = 4; 𝜌 = 𝜇 = 0.75
𝜆 3
𝐿 = 𝜇−𝜆 = 4−3 = 3 𝑝𝑒𝑟𝑠𝑜𝑛𝑠
𝜌2 0.752
𝐿𝑞 = 1−𝜌 = 1−0.75 = 2.25 𝑝𝑒𝑟𝑠𝑜𝑛𝑠
1 1
𝑊 = 𝜇−𝜆 = 1 = 1.00 𝑚𝑖𝑛
𝜆 3
𝑊𝑞 = 𝜇(𝜇−𝜆) = 4(1) = 0.75 𝑚𝑖𝑛
𝜆
𝐼 = 1 − 𝜇 = 1 − 0.75 = 0.25
Engineers and managers making decisions regarding the level of service that needs to be
maintained in handling goods, vehicles and people often have o consider the overall savings that
would accrue if an additional worker were to be put in operation. After all, the savings of
customer’s tike is highly important for a business that intends to increase its clientele. The cost
trade-off relationship between the total cost of the facility and the level -of- service is shown in
the figure below. The objective o to minimize the total cost of service and waiting time to achieve
respectable service level.
Multichannel problem
In the previous discussion a single file of arriving vehicles being served a single channel was
addressed. If the multichannel server is provided (such as several tolls at a toll plaza), two types
of entry control could be implemented. One type would be to introduce as many entry channels as
service channels and divide the arrival demand equally between channels. That is, the arrival rate
of each channel would be equal to the total arrival rate divided by the number of channels. The
service rate for each channel would be equal to the total service rate divided by the number of
channels. The traffic intensity for each individual channel or for the combination of all channels
would be;
𝜆⁄𝑁 𝜆
𝜌= =
𝜇 ⁄𝑁 𝜇
The key feature of this approach is that once the arriving traffic divides equally between channels
serving the various service areas, the traffic cannot change channels even if their channel is queued
and a parallel channel is empty. This causes inefficient use of the service area and results in greeter
delays and loner queues.
Another type of entry control can overcome this inefficient use of service area by combining the
arrival traffic into one channel and then selecting the lead vehicle which is waiting, to go to the
available empty service channel. This will keep the service are busy as long as the queue is present.
The difficulty is the practical implementation of such an entry control scheme. Probably the most
successful examples of such an entry control schemes are at banks and in airport check-in counters.
An example problem is presented to demonstrate the application of the two entry control schemes
and to provide a comparison of their performance. In this example, assume a total arrival rate of
800 vehicles per hour with the arrival randomly distributed. Two toll gates are available for service,
and each toll gate a service time of 6 seconds, which is also randomly distributed. There is no limit
on the length of the queue. The difference is the entry control scheme, that is, how the vehicles are
served.
- Consider the first entry control scheme in which the arriving vehicles are distributed
equally between two toll gates. In concept, there is a barrier between the two toll gates,
and once a vehicle is assigned a toll gate service channel it cannot change to the other
channel. Therefore, each channel is independent of the other, and the arrival rate and
service tomes per channel are calculated and analyzed as a single-channel problem as
discussed earlier. In essence this multichannel problem is converted to a single- channel
problem, M/M/1(∞, FIFO). The arrival rate per channel is 400 vehicles per hour and
the service rate per toll gate is 600 vehicles per hour. The traffic intensity (𝜌) is
calculated as 0.67. the various queueing performance characteristics can be calculated
using previously discussed equations. These results and a comparison of results with
the other entry control scheme are discussed next.
- Now considering the other entry control scheme, where a single arrival queue is
formed and the lead vehicle in the queue is always selected to go to the first available
service channel. This is a multichannel problem that is analyzed as a multichannel
problem and classified as M/M/2(∞, FIFO). The total arrival rate (𝜆) is 800 vehicles
per hour, the service(μ) per channel is 600 vehicles per hour, and the total service
rate(μN) is 1,200 vehicles per hour. Note that while 𝜆 denotes the total arrival rate, μ
denotes the service rate per channel and μN denotes the total service rate. The queueing
performance equations for random arrival-random service distribution system (where
N > 1) are summarized in the table below.
Queuing characteristics M/M/N (∞, FIFO)
Symbol Definition EQUATION
𝜌 Traffic intensity 𝜆
𝜇
Uf Utilization factor 𝜌
𝑁
P(0) Probability of an empty system 𝑁−1
𝜌𝑛 𝜌𝑁
∑( )+
𝑛! 1−𝜌
𝑛=0 𝑁! [ 𝑁 ]
The traffic intensity (𝜌) can be calculated to be 1.33. a new term utilization factor (Uf) is
introduced, which is the ratio of the total arrival ret divided by total service rate, which in this
example is equally to 0.67. While 𝜌 can exceed 1.0, Uf cannot.
The queueing performance results of the two entry control schemes are summarized in the table
below.
In cases where simulation model itself becomes the end product rather the use of the model, or
where the builder concentrates too much effort in small, less significant aspect of the model and
loses sight of the total model and its use and output, it is not worth the cost and time
Steps in Developing a Simulation Model:
A flow chart of procedural elements of simulation are given in Figure 5.5.1 below.
SUMMARY
The reasons for simulation include (but need not be limited to):
1. The need to test the behavior of a new system or operating procedure prior to its actual
construction:
(a) The construction of the new system may be very expensive and/ or time consuming.
(b) Experimentation with the real system may entail considerable risk (such as traffic accidents).
2. The need to test alternate systems under identical conditions. For instance, it is never possible
to exactly reproduce a specific traffic condition in the field; in simulation it is quite routine to
submit the same traffic conditions to several alternative systems.
3. One can experiment with various control policies without disrupting real traffic and making
that traffic part of an experiment.
4. The same can be said of alternative designs; one can experiment with alternative designs for
remedies to existing or future conditions, such lane additions, rerouting of traffic, changes in
direction of links, suitability of junction and intersection designs, etc.
5. Above all, these alternative control implementations and designs can be considered rapidly,
without major capital investment.
6. Whenever desired, various alternatives can be considered with exactly the same traffic load
and conditions, something that is simply not feasible in the real world.
Interpretation of Results
The interpretation of simulation experiment results is similar to the interpretation of any
experimental results. Often the result is the testing of some hypothesis; in other cases, the
interpretation consists of fitting some curve to the output of the experiment.
Conclusion:
In formulating a simulation model, one strives for sufficient realism to adequately describe the
phenomena of interest. However, one should not go "overboard" in including extra details that
will not significantly affect the results. Programs should be properly checked out and validated.
Traffic simulation programs that are properly modeled and validated constitute important
experimental facilities for traffic study.