M/M/1 queue
   HOME

TheInfoList



OR:

In
queueing theory Queueing theory is the mathematical study of waiting lines, or queues. A queueing model is constructed so that queue lengths and waiting time can be predicted. Queueing theory is generally considered a branch of operations research because the ...
, a discipline within the mathematical
theory of probability Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
, an M/M/1 queue represents the queue length in a system having a single server, where arrivals are determined by a
Poisson process In probability, statistics and related fields, a Poisson point process is a type of random mathematical object that consists of points randomly located on a mathematical space with the essential feature that the points occur independently of one ...
and job service times have an exponential distribution. The model name is written in
Kendall's notation In queueing theory, a discipline within the mathematical theory of probability, Kendall's notation (or sometimes Kendall notation) is the standard system used to describe and classify a queueing node. D. G. Kendall proposed describing queueing mod ...
. The model is the most elementary of queueing models and an attractive object of study as
closed-form expression In mathematics, a closed-form expression is a mathematical expression that uses a finite number of standard operations. It may contain constants, variables, certain well-known operations (e.g., + − × ÷), and functions (e.g., ''n''th ro ...
s can be obtained for many metrics of interest in this model. An extension of this model with more than one server is the M/M/c queue.


Model definition

An M/M/1 queue is a stochastic process whose
state space A state space is the set of all possible configurations of a system. It is a useful abstraction for reasoning about the behavior of a given system and is widely used in the fields of artificial intelligence and game theory. For instance, the to ...
is the set where the value corresponds to the number of customers in the system, including any currently in service. * Arrivals occur at rate λ according to a
Poisson process In probability, statistics and related fields, a Poisson point process is a type of random mathematical object that consists of points randomly located on a mathematical space with the essential feature that the points occur independently of one ...
and move the process from state ''i'' to ''i'' + 1. * Service times have an exponential distribution with rate parameter μ in the M/M/1 queue, where 1/μ is the mean service time. * All arrival times and services times are (usually) assumed to be independent of one another. * A single server serves customers one at a time from the front of the queue, according to a
first-come, first-served Queueing theory is the mathematical study of waiting lines, or queues. A queueing model is constructed so that queue lengths and waiting time can be predicted. Queueing theory is generally considered a branch of operations research because the ...
discipline. When the service is complete the customer leaves the queue and the number of customers in the system reduces by one. * The buffer is of infinite size, so there is no limit on the number of customers it can contain. The model can be described as a
continuous time Markov chain Continuity or continuous may refer to: Mathematics * Continuity (mathematics), the opposing concept to discreteness; common examples include ** Continuous probability distribution or random variable in probability and statistics ** Continuous ...
with
transition rate matrix Transition or transitional may refer to: Mathematics, science, and technology Biology * Transition (genetics), a point mutation that changes a purine nucleotide to another purine (A ↔ G) or a pyrimidine nucleotide to another pyrimidine (C ↔ ...
::Q=\begin -\lambda & \lambda \\ \mu & -(\mu+\lambda) & \lambda \\ &\mu & -(\mu+\lambda) & \lambda \\ &&\mu & -(\mu+\lambda) & \lambda &\\ &&&&\ddots \end on the state space . This is the same continuous time Markov chain as in a birth–death process. The
state space A state space is the set of all possible configurations of a system. It is a useful abstraction for reasoning about the behavior of a given system and is widely used in the fields of artificial intelligence and game theory. For instance, the to ...
diagram for this chain is as below.


Transient solution

We can write a probability mass function dependent on ''t'' to describe the probability that the M/M/1 queue is in a particular state at a given time. We assume that the queue is initially in state ''i'' and write ''p''''k''(''t'') for the probability of being in state ''k'' at time ''t''. Then ::p_k(t)=e^ \left \rho^ I_(at) + \rho^ I_(at) + (1-\rho) \rho^ \sum_^ \rho^I_j(at) \right/math> where i is the initial number of customers in the station at time t=0,\rho=\lambda/\mu, a=2\sqrt and I_k is the
modified Bessel function of the first kind Bessel functions, first defined by the mathematician Daniel Bernoulli and then generalized by Friedrich Bessel, are canonical solutions of Bessel's differential equation x^2 \frac + x \frac + \left(x^2 - \alpha^2 \right)y = 0 for an arbitrar ...
. Moments for the transient solution can be expressed as the sum of two
monotone function In mathematics, a monotonic function (or monotone function) is a function between ordered sets that preserves or reverses the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of order ...
s.


Stationary analysis

The model is considered stable only if λ < μ. If, on average, arrivals happen faster than service completions the queue will grow indefinitely long and the system will not have a stationary distribution. The stationary distribution is the limiting distribution for large values of ''t''. Various performance measures can be computed explicitly for the M/M/1 queue. We write ρ = λ/μ for the utilization of the buffer and require ρ < 1 for the queue to be stable. ρ represents the average proportion of time which the server is occupied.


Average number of customers in the system

The probability that the
stationary process In mathematics and statistics, a stationary process (or a strict/strictly stationary process or strong/strongly stationary process) is a stochastic process whose unconditional joint probability distribution does not change when shifted in time. Con ...
is in state ''i'' (contains ''i'' customers, including those in service) is : \pi_i=(1-\rho)\rho^i.\, We see that the number of customers in the system is geometrically distributed with parameter 1 − ''ρ''. Thus the average number of customers in the system is ''ρ''/(1 − ''ρ'') and the variance of number of customers in the system is ''ρ''/(1 − ''ρ'')2. This result holds for any work conserving service regime, such as processor sharing.


Busy period of server

The busy period is the time period measured between the instant a customer arrives to an empty system until the instant a customer departs leaving behind an empty system. The busy period has probability density function ::f(t)=\begin \frace^I_1(2t\sqrt) & t>0\\ 0 & \text\end where ''I''1 is a
modified Bessel function of the first kind Bessel functions, first defined by the mathematician Daniel Bernoulli and then generalized by Friedrich Bessel, are canonical solutions of Bessel's differential equation x^2 \frac + x \frac + \left(x^2 - \alpha^2 \right)y = 0 for an arbitrar ...
, obtained by using
Laplace transform In mathematics, the Laplace transform, named after its discoverer Pierre-Simon Laplace (), is an integral transform that converts a function of a real variable (usually t, in the '' time domain'') to a function of a complex variable s (in the ...
s and inverting the solution. The Laplace transform of the M/M/1 busy period is given by ::\mathbb E( e^ )= \frac(\lambda + \mu + \theta - \sqrt) which gives the moments of the busy period, in particular the mean is 1/(''μ'' − ''λ'') and variance is given by ::\frac.


Response time

The average response time or sojourn time (total time a customer spends in the system) does not depend on scheduling discipline and can be computed using Little's law as 1/(''μ'' − ''λ''). The average time spent waiting is 1/(''μ'' − ''λ'') − 1/''μ'' = ''ρ''/(''μ'' − ''λ''). The distribution of response times experienced does depend on scheduling discipline.


First-come, first-served discipline

For customers who arrive and find the queue as a stationary process, the response time they experience (the sum of both waiting time and service time) has transform (''μ'' − ''λ'')/(''s'' + ''μ'' − ''λ'') and therefore
probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) ca ...
::f(t)= \begin (\mu-\lambda)e^ & t>0\\ 0 & \text \end


Processor sharing discipline

In an M/M/1-PS queue there is no waiting line and all jobs receive an equal proportion of the service capacity. Suppose the single server serves at rate 16 and there are 4 jobs in the system, each job will experience service at rate 4. The rate at which jobs receive service changes each time a job arrives at or departs from the system. For customers who arrive to find the queue as a stationary process, the
Laplace transform In mathematics, the Laplace transform, named after its discoverer Pierre-Simon Laplace (), is an integral transform that converts a function of a real variable (usually t, in the '' time domain'') to a function of a complex variable s (in the ...
of the distribution of response times experienced by customers was published in 1970, for which an integral representation is known. The waiting time distribution (response time less service time) for a customer requiring ''x'' amount of service has transform :W^\ast(s, x) = \frac where ''r'' is the smaller root of the equation :\lambda r^2 - (\lambda + \mu + s)r + \mu = 0. The mean response time for a job arriving and requiring amount ''x'' of service can therefore be computed as ''x μ''/(''μ'' − ''λ''). An alternative approach computes the same results using a spectral expansion method.


Diffusion approximation

When the utilization ''ρ'' is close to 1 the process can be approximated by a
reflected Brownian motion In probability theory, reflected Brownian motion (or regulated Brownian motion, both with the acronym RBM) is a Wiener process in a space with reflecting boundaries. In the physical literature, this process describes diffusion in a confined spac ...
with drift parameter ''λ'' – ''μ'' and variance parameter ''λ'' + ''μ''. This heavy traffic limit was first introduced by
John Kingman __NOTOC__ Sir John Frank Charles Kingman (born 28 August 1939) is a British mathematician. He served as N. M. Rothschild and Sons Professor of Mathematical Sciences and Director of the Isaac Newton Institute at the University of Cambridge fro ...
.


References

{{DEFAULTSORT:M M 1 queue Single queueing nodes