Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 14

Queuing Analysis

Based on noted from Appendix A of


Stallings Operating System text

07/17/24 1
Queuing Model and
Analysis Queue1
• Queuing theory deals
with modeling and
analyzing systems with Queue2
queues of items and
servers that process the
items.

server

Queue3

07/17/24 2
Goals of Queuing Analysis
• Typically used in analysis of networking system; examples,
– increase in disk access time
– Increase in process load
– Increase in rate of arrival of packets, processes
• Especially useful of analysis of performance when either the
load on a system is expected to increase or a design change is
contemplated.
• While it is a popular method in network analysis, it has gained
popularity within a system esp. with the advent of multi-core
processors.

07/17/24 3
Analysis methods
• After the fact analysis: let the system run some
n number times, collect the “real” data and
analyze – problems?
• Predict some simple trends /projections based
on experience – problems?
• Develop analytical model based on queuing
theory – problems?
• Run simulation (not real systems) and collect
data to analyze –problems?
07/17/24 4
Single server queue

queue Dispatching departures


arrivals discipline
server
λ= arrival rate
w = mean # items waiting Ts = mean service time
Tw = mean waiting time ρ = utilization

r mean # items residing in the system


Tr = mean residence time

07/17/24 5
Multi-server /single queue
queue Dispatching
arrivals discipline
server0
λ= arrival rate

server1
……….

Servern-1

07/17/24 6
Multi-server /Multiple queues
queue
server0

arrivals queue
server1
……….
queue

Servern-1

07/17/24 7
Parameters
• Items arrive at the facility at some average
rate (items arriving per second) l.
• At any given time, a certain number of items
will be waiting in the queue (zero or more);
• The average number waiting is w, and the
mean time that an item must wait is Tw.
• The server handles incoming items with an
average service time Ts;

07/17/24 8
More parameters
• Utilization, ρ, is the fraction of time that the
server is busy, measured over some interval of
time.
• Finally, two parameters apply to the system as a
whole.
• The average number of items resident in the
system, including the item being served (if any)
and the items waiting (if any), is r;
• and the average time that an item spends in the
system, waiting and being served, is Tr; we refer
to this as the mean residence time
07/17/24 9
Analysis
• As the arrival rate, which is the rate of traffic passing through the
system, increases, the utilization increases and with it, congestion.
The queue becomes longer, increasing waiting time. At ρ = 1, the
server becomes saturated, working 100% of the time.
• Thus, the theoretical maximum input rate that can be handled by
the system is:
λmax = 1/Ts
• However, queues become very large near system saturation,
growing without bound when ρ = 1. Practical considerations, such
as response time requirements or buffer sizes, usually limit the
input rate for a single server to 70-90% of the theoretical
maximum.
• For multi server queue for N servers:
λmax = N/Ts

07/17/24 10
Specific Metrics
• The fundamental task of a queuing analysis is as follows: Given the
following information as input:
· Arrival rate
· Service time
• Provide as output information concerning:
· Items waiting
· Waiting time
· Items in residence
· Residence time.
• We would like to know their average values (w, Tw, r, Tr) and the
respective variability the σ’s
• We are also interested in some probabilities: what is probability
that items waiting in line < M is 0.99?

07/17/24 11
Important Assumptions
• The arrival rate obeys the Poisson distribution, which is equivalent
to saying that the inter-arrival times are exponential;
• On other words, the arrivals occur randomly and independent of
one another.
• A convenient notation has been developed for summarizing the
principal assumptions that are made in developing a queuing
model.
• The notation is X/Y/N, where X refers to the distribution of the
inter-arrival times, Y refers to the distribution of service times, and
N refers to the number of servers.
• M/M/1 refers to a single-server queuing model with Poisson
arrivals and exponential service times.
• M/G/1 and M/M/1 and M/D/1

07/17/24 12
Little’s Law
• Very simple law that works from a Case Western
Reserve University professor Dr. Little
• Average number of customers in a system =
average arrival rate * average time spent in the
system
• r = Tr * λ
• w = Tw * λ
• Tr = Tw + Ts
• Extend it to the M/M/1 queuing model
07/17/24 13
Examples
• Page 21-22-23
• Database server (can be substituted for any
server).
• Tightly-coupled multi-processor system
• Necessary formulae are in pages: 14, 18 (Table
3 and Table 4)

07/17/24 14

You might also like