In
probability theory
Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
and
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the Poisson distribution () is a
discrete probability distribution that expresses the probability of a given number of
events occurring in a fixed interval of time if these events occur with a known constant mean rate and
independently of the time since the last event. It can also be used for the number of events in other types of intervals than time, and in dimension greater than 1 (e.g., number of events in a given area or volume).
The Poisson distribution is named after
French mathematician
Siméon Denis Poisson. It plays an important role for
discrete-stable distributions.
Under a Poisson distribution with the
expectation of ''λ'' events in a given interval, the probability of ''k'' events in the same interval is:
:
For instance, consider a call center which receives an average of ''λ ='' 3 calls per minute at all times of day. If the calls are independent, receiving one does not change the probability of when the next one will arrive. Under these assumptions, the number ''k'' of calls received during any minute has a Poisson probability distribution. Receiving ''k ='' 1 to 4 calls then has a probability of about 0.77, while receiving 0 or at least 5 calls has a probability of about 0.23.
A classic example used to motivate the Poisson distribution is the number of
radioactive decay
Radioactive decay (also known as nuclear decay, radioactivity, radioactive disintegration, or nuclear disintegration) is the process by which an unstable atomic nucleus loses energy by radiation. A material containing unstable nuclei is conside ...
events during a fixed observation period.
History
The distribution was first introduced by
Siméon Denis Poisson (1781–1840) and published together with his probability theory in his work ''Recherches sur la probabilité des jugements en matière criminelle et en matière civile'' (1837). The work theorized about the number of wrongful convictions in a given country by focusing on certain
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s that count, among other things, the number of discrete occurrences (sometimes called "events" or "arrivals") that take place during a
time
Time is the continuous progression of existence that occurs in an apparently irreversible process, irreversible succession from the past, through the present, and into the future. It is a component quantity of various measurements used to sequ ...
-interval of given length. The result had already been given in 1711 by
Abraham de Moivre in ''De Mensura Sortis seu; de Probabilitate Eventuum in Ludis a Casu Fortuito Pendentibus'' . This makes it an example of
Stigler's law and it has prompted some authors to argue that the Poisson distribution should bear the name of de Moivre.
In 1860,
Simon Newcomb fitted the Poisson distribution to the number of stars found in a unit of space.
A further practical application was made by
Ladislaus Bortkiewicz in 1898. Bortkiewicz showed that the frequency with which soldiers in the Prussian army were accidentally killed by horse kicks could be well modeled by a Poisson distribution..
Definitions
Probability mass function
A discrete
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
is said to have a Poisson distribution with parameter
if it has a
probability mass function
In probability and statistics, a probability mass function (sometimes called ''probability function'' or ''frequency function'') is a function that gives the probability that a discrete random variable is exactly equal to some value. Sometimes i ...
given by:
:
where
* is the number of occurrences (
)
* is
Euler's number (
)
* ''k''! = ''k''(''k–''1) ··· (3)(2)(1) is the
factorial
In mathematics, the factorial of a non-negative denoted is the Product (mathematics), product of all positive integers less than or equal The factorial also equals the product of n with the next smaller factorial:
\begin
n! &= n \times ...
.
The positive
real number
In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
is equal to the
expected value
In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...
of and also to its
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
.
:
The Poisson distribution can be applied to systems with a
large number of possible events, each of which is rare. The number of such events that occur during a fixed time interval is, under the right circumstances, a random number with a Poisson distribution.
The equation can be adapted if, instead of the average number of events
we are given the average rate
at which events occur. Then
and:
:
Examples
The Poisson distribution may be useful to model events such as:
* the number of meteorites greater than 1-meter diameter that strike Earth in a year;
* the number of laser photons hitting a detector in a particular time interval;
* the number of students achieving a low and high mark in an exam; and
* locations of defects and dislocations in materials.
Examples of the occurrence of random points in space are: the locations of asteroid impacts with earth (2-dimensional), the locations of imperfections in a material (3-dimensional), and the locations of trees in a forest (2-dimensional).
Assumptions and validity
The Poisson distribution is an appropriate model if the following assumptions are true:
* , a nonnegative integer, is the number of times an event occurs in an interval.
* The occurrence of one event
does not affect the probability of a second event.
* The average rate at which events occur is independent of any occurrences.
* Two events cannot occur at exactly the same instant.
If these conditions are true, then is a Poisson random variable; the distribution of is a Poisson distribution.
The Poisson distribution is also the
limit of a
binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
, for which the probability of success for each trial equals divided by the number of trials, as the number of trials approaches infinity (see
Related distributions).
Examples of probability for Poisson distributions
On a particular river, overflow floods occur once every 100 years on average. Calculate the probability of = 0, 1, 2, 3, 4, 5, or 6 overflow floods in a 100-year interval, assuming the Poisson model is appropriate.
Because the average event rate is one overflow flood per 100 years, = 1
:
:
:
:
:
The probability for 0 to 6 overflow floods in a 100-year period.
In this example, it is reported that the average number of goals in a World Cup soccer match is approximately 2.5 and the Poisson model is appropriate.
Because the average event rate is 2.5 goals per match, = 2.5 .
:
:
:
:
:
The probability for 0 to 7 goals in a match.
Once in an interval events: The special case of = 1 and = 0
Suppose that astronomers estimate that large meteorites (above a certain size) hit the earth on average once every 100 years ( event per 100 years), and that the number of meteorite hits follows a Poisson distribution. What is the probability of meteorite hits in the next 100 years?
:
Under these assumptions, the probability that no large meteorites hit the earth in the next 100 years is roughly 0.37. The remaining is the probability of 1, 2, 3, or more large meteorite hits in the next 100 years.
In an example above, an overflow flood occurred once every 100 years The probability of no overflow floods in 100 years was roughly 0.37, by the same calculation.
In general, if an event occurs on average once per interval ( = 1), and the events follow a Poisson distribution, then In addition, as shown in the table for overflow floods.
Examples that violate the Poisson assumptions
The number of students who arrive at the
student union per minute will likely not follow a Poisson distribution, because the rate is not constant (low rate during class time, high rate between class times) and the arrivals of individual students are not independent (students tend to come in groups). The non-constant arrival rate may be modeled as a
mixed Poisson distribution, and the arrival of groups rather than individual students as a
compound Poisson process.
The number of magnitude 5 earthquakes per year in a country may not follow a Poisson distribution, if one large earthquake increases the probability of aftershocks of similar magnitude.
Examples in which at least one event is guaranteed are not Poisson distributed; but may be modeled using a
zero-truncated Poisson distribution.
Count distributions in which the number of intervals with zero events is higher than predicted by a Poisson model may be modeled using a
zero-inflated model.
Properties
Descriptive statistics
* The
expected value
In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...
of a Poisson random variable is .
* The
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
of a Poisson random variable is also .
* The
coefficient of variation is
while the
index of dispersion is 1.
* The
mean absolute deviation about the mean is
* The
mode of a Poisson-distributed random variable with non-integer is equal to
which is the largest integer less than or equal to . This is also written as
floor(). When is a positive integer, the modes are and − 1.
* All of the
cumulants of the Poisson distribution are equal to the expected value . The th
factorial moment of the Poisson distribution is .
* The
expected value
In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...
of a
Poisson process is sometimes decomposed into the product of ''intensity'' and ''exposure'' (or more generally expressed as the integral of an "intensity function" over time or space, sometimes described as "exposure").
Median
Bounds for the median (
) of the distribution are known and are
sharp:
Higher moments
The higher non-centered
moments of the Poisson distribution are
Touchard polynomials in :
where the braces denote
Stirling numbers of the second kind. In other words,
When the expected value is set to ''λ ='' 1,
Dobinski's formula implies that the ‑th moment is equal to the number of
partitions of a set of size .
A simple upper bound is:
Sums of Poisson-distributed random variables
If
for
are
independent, then
A converse is
Raikov's theorem, which says that if the sum of two independent random variables is Poisson-distributed, then so are each of those two independent random variables.
Maximum entropy
It is a
maximum-entropy distribution among the set of generalized binomial distributions
with mean
and
, where a generalized binomial distribution is defined as a distribution of the sum of N independent but not identically distributed Bernoulli variables.
Other properties
* The Poisson distributions are
infinitely divisible probability distributions.
* The directed
Kullback–Leibler divergence of
from
is given by
* If
is an integer, then
satisfies
and
* Bounds for the tail probabilities of a Poisson random variable
can be derived using a
Chernoff bound argument.
* The upper tail probability can be tightened (by a factor of at least two) as follows:
where
is the Kullback–Leibler divergence of
from
.
* Inequalities that relate the distribution function of a Poisson random variable
to the
Standard normal distribution function
are as follows:
where
is the Kullback–Leibler divergence of
from
and
is the Kullback–Leibler divergence of
from
.
Poisson races
Let
and
be independent random variables, with
then we have that
The upper bound is proved using a standard Chernoff bound.
The lower bound can be proved by noting that
is the probability that
where
which is bounded below by
where
is
relative entropy (See the entry on
bounds on tails of binomial distributions for details). Further noting that
and computing a lower bound on the unconditional probability gives the result. More details can be found in the appendix of Kamath et al.
Related distributions
As a Binomial distribution with infinitesimal time-steps
The Poisson distribution can be derived as a limiting case to the
binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
as the number of trials goes to infinity and the
expected number of successes remains fixed — see
law of rare events below. Therefore, it can be used as an approximation of the binomial distribution if is sufficiently large and ''p'' is sufficiently small. The Poisson distribution is a good approximation of the binomial distribution if is at least 20 and ''p'' is smaller than or equal to 0.05, and an excellent approximation if ≥ 100 and ≤ 10. Letting
and
be the respective
cumulative density functions of the binomial and Poisson distributions, one has:
One derivation of this uses
probability-generating functions. Consider a
Bernoulli trial (coin-flip) whose probability of one success (or expected number of successes) is
within a given interval. Split the interval into ''n'' parts, and perform a trial in each subinterval with probability
. The probability of ''k'' successes out of ''n'' trials over the entire interval is then given by the binomial distribution
whose generating function is:
Taking the limit as ''n'' increases to infinity (with ''x'' fixed) and applying the product limit definition of the
exponential function, this reduces to the generating function of the Poisson distribution:
General
* If
and
are independent, then the difference
follows a
Skellam distribution.
* If
and
are independent, then the distribution of
conditional on
is a
binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
. Specifically, if
then
More generally, if ''X''
1, ''X''
2, ..., ''X''
are independent Poisson random variables with parameters
1,
2, ...,
then
*: given
it follows that
In fact,
* If
and the distribution of
conditional on ''X'' = is a
binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
,
then the distribution of Y follows a Poisson distribution
In fact, if, conditional on
follows a
multinomial distribution,
then each
follows an independent Poisson distribution
* The Poisson distribution is a
special case of the discrete compound Poisson distribution (or stuttering Poisson distribution) with only a parameter. The discrete compound Poisson distribution can be deduced from the limiting distribution of univariate multinomial distribution. It is also a
special case of a
compound Poisson distribution.
* For sufficiently large values of , (say >1000), the
normal distribution
In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
f(x) = \frac ...
with mean and variance (standard deviation
) is an excellent approximation to the Poisson distribution. If is greater than about 10, then the normal distribution is a good approximation if an appropriate
continuity correction is performed, i.e., if , where ''x'' is a non-negative integer, is replaced by .
*
Variance-stabilizing transformation: If
then
and
Under this transformation, the convergence to normality (as
increases) is far faster than the untransformed variable. Other, slightly more complicated, variance stabilizing transformations are available, one of which is
Anscombe transform. See
Data transformation (statistics) for more general uses of transformations.
* If for every ''t'' > 0 the number of arrivals in the time interval follows the Poisson distribution with mean ''λt'', then the sequence of inter-arrival times are independent and identically distributed
exponential random variables having mean 1/.
* The
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ever ...
s of the Poisson and
chi-squared distributions are related in the following ways:
and
Poisson approximation
Assume
where
then
is
multinomially distributed
conditioned on
This means, among other things, that for any nonnegative function
if
is multinomially distributed, then
where
The factor of
can be replaced by 2 if
is further assumed to be monotonically increasing or decreasing.
Bivariate Poisson distribution
This distribution has been extended to the
bivariate case. The
generating function for this distribution is
with
The marginal distributions are Poisson(''θ''
1) and Poisson(''θ''
2) and the correlation coefficient is limited to the range
A simple way to generate a bivariate Poisson distribution
is to take three independent Poisson distributions
with means
and then set
The probability function of the bivariate Poisson distribution is
Free Poisson distribution
The free Poisson distribution with jump size
and rate
arises in
free probability theory as the limit of repeated
free convolution
as .
In other words, let
be random variables so that
has value
with probability
and value 0 with the remaining probability. Assume also that the family
are
freely independent. Then the limit as
of the law of
is given by the Free Poisson law with parameters
This definition is analogous to one of the ways in which the classical Poisson distribution is obtained from a (classical) Poisson process.
The measure associated to the free Poisson law is given by
where
and has support
This law also arises in
random matrix theory as the
Marchenko–Pastur law. Its
free cumulants are equal to
Some transforms of this law
We give values of some important transforms of the free Poisson law; the computation can be found in e.g. in the book ''Lectures on the Combinatorics of Free Probability'' by A. Nica and R. Speicher
The R-transform of the free Poisson law is given by
The Cauchy transform (which is the negative of the
Stieltjes transformation) is given by
The S-transform is given by
in the case that
Weibull and stable count
Poisson's probability mass function
can be expressed in a form similar to the product distribution of a
Weibull distribution and a variant form of the
stable count distribution.
The variable
can be regarded as inverse of Lévy's stability parameter in the stable count distribution:
where
is a standard stable count distribution of shape
and
is a standard Weibull distribution of shape
Statistical inference
Parameter estimation
Given a sample of measured values
for we wish to estimate the value of the parameter of the Poisson population from which the sample was drawn. The
maximum likelihood estimate is
:
Since each observation has expectation , so does the sample mean. Therefore, the maximum likelihood estimate is an
unbiased estimator
In statistics, the bias of an estimator (or bias function) is the difference between this estimator's expected value and the true value of the parameter being estimated. An estimator or decision rule with zero bias is called ''unbiased''. In stat ...
of . It is also an efficient estimator since its variance achieves the
Cramér–Rao lower bound (CRLB). Hence it is
minimum-variance unbiased. Also it can be proven that the sum (and hence the sample mean as it is a one-to-one function of the sum) is a complete and sufficient statistic for .
To prove sufficiency we may use the
factorization theorem. Consider partitioning the probability mass function of the joint Poisson distribution for the sample into two parts: one that depends solely on the sample
, called
, and one that depends on the parameter
and the sample
only through the function
Then
is a sufficient statistic for
:
The first term
depends only on
. The second term
depends on the sample only through
Thus,
is sufficient.
To find the parameter that maximizes the probability function for the Poisson population, we can use the logarithm of the likelihood function:
:
We take the derivative of
with respect to and compare it to zero:
:
Solving for gives a stationary point.
:
So is the average of the
''i'' values. Obtaining the sign of the second derivative of ''L'' at the stationary point will determine what kind of extreme value is.
:
Evaluating the second derivative ''at the stationary point'' gives:
:
which is the negative of times the reciprocal of the average of the k
i. This expression is negative when the average is positive. If this is satisfied, then the stationary point maximizes the probability function.
For
completeness, a family of distributions is said to be complete if and only if
implies that
for all
If the individual
are iid
then
Knowing the distribution we want to investigate, it is easy to see that the statistic is complete.
:
For this equality to hold,
must be 0. This follows from the fact that none of the other terms will be 0 for all
in the sum and for all possible values of
Hence,
for all
implies that
and the statistic has been shown to be complete.
Confidence interval
The
confidence interval for the mean of a Poisson distribution can be expressed using the relationship between the cumulative distribution functions of the Poisson and
chi-squared distributions. The chi-squared distribution is itself closely related to the
gamma distribution, and this leads to an alternative expression. Given an observation from a Poisson distribution with mean ''μ'', a confidence interval for ''μ'' with confidence level is
:
or equivalently,
:
where
is the
quantile function (corresponding to a lower tail area ''p'') of the chi-squared distribution with degrees of freedom and
is the quantile function of a
gamma distribution with shape parameter n and scale parameter 1. This interval is '
exact' in the sense that its
coverage probability is never less than the nominal .
When quantiles of the gamma distribution are not available, an accurate approximation to this exact interval has been proposed (based on the
Wilson–Hilferty transformation):
:
where
denotes the
standard normal deviate with upper tail area .
For application of these formulae in the same context as above (given a sample of measured values
''i'' each drawn from a Poisson distribution with mean ), one would set
:
calculate an interval for and then derive the interval for .
Bayesian inference
In
Bayesian inference, the
conjugate prior for the rate parameter of the Poisson distribution is the
gamma distribution. Let
:
denote that is distributed according to the gamma
density
Density (volumetric mass density or specific mass) is the ratio of a substance's mass to its volume. The symbol most often used for density is ''ρ'' (the lower case Greek letter rho), although the Latin letter ''D'' (or ''d'') can also be u ...
''g'' parameterized in terms of a
shape parameter ''α'' and an inverse
scale parameter ''β'':
:
Then, given the same sample of measured values
''i'' as before, and a prior of Gamma(''α'', ''β''), the posterior distribution is
:
Note that the posterior mean is linear and is given by
:
It can be shown that gamma distribution is the only prior that induces linearity of the conditional mean. Moreover, a converse result exists which states that if the conditional mean is close to a linear function in the
distance than the prior distribution of must be close to gamma distribution in
Levy distance.
The posterior mean E[] approaches the maximum likelihood estimate
in the limit as
which follows immediately from the general expression of the mean of the
gamma distribution.
The
posterior predictive distribution for a single additional observation is a
negative binomial distribution, sometimes called a gamma–Poisson distribution.
Simultaneous estimation of multiple Poisson means
Suppose
is a set of independent random variables from a set of
Poisson distributions, each with a parameter
and we would like to estimate these parameters. Then, Clevenson and Zidek show that under the normalized squared error loss
when
then, similar as in
Stein's example for the Normal means, the MLE estimator
is
inadmissible.
In this case, a family of
minimax estimators is given for any
and
as
:
Occurrence and applications
Some applications of the Poisson distribution to
count data (number of events):
*
telecommunication
Telecommunication, often used in its plural form or abbreviated as telecom, is the transmission of information over a distance using electronic means, typically through cables, radio waves, or other communication technologies. These means of ...
: telephone calls arriving in a system,
*
astronomy
Astronomy is a natural science that studies celestial objects and the phenomena that occur in the cosmos. It uses mathematics, physics, and chemistry in order to explain their origin and their overall evolution. Objects of interest includ ...
: photons arriving at a telescope,
*
chemistry
Chemistry is the scientific study of the properties and behavior of matter. It is a physical science within the natural sciences that studies the chemical elements that make up matter and chemical compound, compounds made of atoms, molecules a ...
: the
molar mass distribution of a
living polymerization,
*
biology
Biology is the scientific study of life and living organisms. It is a broad natural science that encompasses a wide range of fields and unifying principles that explain the structure, function, growth, History of life, origin, evolution, and ...
: the number of mutations on a strand of
DNA per unit length,
*
management
Management (or managing) is the administration of organizations, whether businesses, nonprofit organizations, or a Government agency, government bodies through business administration, Nonprofit studies, nonprofit management, or the political s ...
: customers arriving at a counter or call centre,
*
finance and insurance: number of losses or claims occurring in a given period of time,
*
seismology
Seismology (; from Ancient Greek σεισμός (''seismós'') meaning "earthquake" and -λογία (''-logía'') meaning "study of") is the scientific study of earthquakes (or generally, quakes) and the generation and propagation of elastic ...
: asymptotic Poisson model of risk for large earthquakes,
*
radioactivity
Radioactive decay (also known as nuclear decay, radioactivity, radioactive disintegration, or nuclear disintegration) is the process by which an unstable atomic nucleus loses energy by radiation. A material containing unstable nuclei is conside ...
: decays in a given time interval in a radioactive sample,
*
optics
Optics is the branch of physics that studies the behaviour and properties of light, including its interactions with matter and the construction of optical instruments, instruments that use or Photodetector, detect it. Optics usually describes t ...
: number of photons emitted in a single laser pulse (a major vulnerability of
quantum key distribution protocols, known as photon number splitting).
More examples of counting events that may be modelled as Poisson processes include:
* soldiers killed by horse-kicks each year in each corps in the
Prussia
Prussia (; ; Old Prussian: ''Prūsija'') was a Germans, German state centred on the North European Plain that originated from the 1525 secularization of the Prussia (region), Prussian part of the State of the Teutonic Order. For centuries, ...
n cavalry. This example was used in a book by
Ladislaus Bortkiewicz (1868–1931),
* yeast cells used when brewing
Guinness beer. This example was used by
William Sealy Gosset (1876–1937),
* phone calls arriving at a
call centre
A call centre ( Commonwealth spelling) or call center ( American spelling; see spelling differences) is a managed capability that can be centralised or remote that is used for receiving or transmitting a large volume of enquiries by telephone ...
within a minute. This example was described by
A.K. Erlang (1878–1929),
* goals in sports involving two competing teams,
* deaths per year in a given age group,
* jumps in a stock price in a given time interval,
* times a
web server
A web server is computer software and underlying Computer hardware, hardware that accepts requests via Hypertext Transfer Protocol, HTTP (the network protocol created to distribute web content) or its secure variant HTTPS. A user agent, co ...
is accessed per minute (under an assumption of
homogeneity),
*
mutation
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
s in a given stretch of
DNA after a certain amount of radiation,
*
cells infected at a given
multiplicity of infection,
* bacteria in a certain amount of liquid,
*
photons arriving on a pixel circuit at a given illumination over a given time period,
* landing of
V-1 flying bomb
The V-1 flying bomb ( "Vengeance Weapon 1") was an early cruise missile. Its official Reich Aviation Ministry () name was Fieseler Fi 103 and its suggestive name was (hellhound). It was also known to the Allies as the buzz bomb or doodlebug a ...
s on London during World War II, investigated by R. D. Clarke in 1946.
In
probabilistic number theory,
Gallagher showed in 1976 that, if a certain version of the unproved
prime r-tuple conjecture holds, then the counts of
prime number
A prime number (or a prime) is a natural number greater than 1 that is not a Product (mathematics), product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime ...
s in short intervals would obey a Poisson distribution.
Law of rare events

The rate of an event is related to the probability of an event occurring in some small subinterval (of time, space or otherwise). In the case of the Poisson distribution, one assumes that there exists a small enough subinterval for which the probability of an event occurring twice is "negligible". With this assumption one can derive the Poisson distribution from the binomial one, given only the information of expected number of total events in the whole interval.
Let the total number of events in the whole interval be denoted by
Divide the whole interval into
subintervals
of equal size, such that
(since we are interested in only very small portions of the interval this assumption is meaningful). This means that the expected number of events in each of the subintervals is equal to
Now we assume that the occurrence of an event in the whole interval can be seen as a sequence of
Bernoulli trials, where the
-th
Bernoulli trial corresponds to looking whether an event happens at the subinterval
with probability
The expected number of total events in
such trials would be
the expected number of total events in the whole interval. Hence for each subdivision of the interval we have approximated the occurrence of the event as a Bernoulli process of the form
As we have noted before we want to consider only very small subintervals. Therefore, we take the limit as
goes to infinity.
In this case the
binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
converges to what is known as the Poisson distribution by the
Poisson limit theorem.
In several of the above examples — such as the number of mutations in a given sequence of DNA — the events being counted are actually the outcomes of discrete trials, and would more precisely be modelled using the
binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
, that is
In such cases is very large and is very small (and so the expectation is of intermediate magnitude). Then the distribution may be approximated by the less cumbersome Poisson distribution
This approximation is sometimes known as the ''law of rare events'', since each of the individual
Bernoulli events rarely occurs.
The name "law of rare events" may be misleading because the total count of success events in a Poisson process need not be rare if the parameter is not small. For example, the number of telephone calls to a busy switchboard in one hour follows a Poisson distribution with the events appearing frequent to the operator, but they are rare from the point of view of the average member of the population who is very unlikely to make a call to that switchboard in that hour.
The variance of the binomial distribution is 1 − ''p'' times that of the Poisson distribution, so almost equal when ''p'' is very small.
The word ''law'' is sometimes used as a synonym of
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
, and ''convergence in law'' means ''convergence in distribution''. Accordingly, the Poisson distribution is sometimes called the "law of small numbers" because it is the probability distribution of the number of occurrences of an event that happens rarely but has very many opportunities to happen. ''The Law of Small Numbers'' is a book by Ladislaus Bortkiewicz about the Poisson distribution, published in 1898.
Poisson point process
The Poisson distribution arises as the number of points of a
Poisson point process located in some finite region. More specifically, if ''D'' is some region space, for example Euclidean space R
''d'', for which , ''D'', , the area, volume or, more generally, the Lebesgue measure of the region is finite, and if denotes the number of points in ''D'', then
:
Poisson regression and negative binomial regression
Poisson regression
In statistics, Poisson regression is a generalized linear model form of regression analysis used to model count data and contingency tables. Poisson regression assumes the response variable ''Y'' has a Poisson distribution, and assumes the lo ...
and
negative binomial regression are useful for analyses where the dependent (response) variable is the count of the number of events or occurrences in an interval.
Biology
The
Luria–Delbrück experiment tested against the hypothesis of Lamarckian evolution, which should result in a Poisson distribution.
Katz and Miledi measured the
membrane potential with and without the presence of
acetylcholine
Acetylcholine (ACh) is an organic compound that functions in the brain and body of many types of animals (including humans) as a neurotransmitter. Its name is derived from its chemical structure: it is an ester of acetic acid and choline. Par ...
(ACh). When ACh is present,
ion channels
Ion channels are pore-forming membrane proteins that allow ions to pass through the channel pore. Their functions include establishing a resting membrane potential, shaping action potentials and other electrical signals by gating the flow of ...
on the membrane would be open randomly at a small fraction of the time. As there are a large number of ion channels each open for a small fraction of the time, the total number of ion channels open at any moment is Poisson distributed. When ACh is not present, effectively no ion channels are open. The membrane potential is
. Subtracting the effect of noise, Katz and Miledi found the mean and variance of membrane potential to be
, giving
. (pp. 94-95
)
During each cellular replication event, the number of mutations is roughly Poisson distributed.
For example, the HIV virus has 10,000 base pairs, and has a mutation rate of about 1 per 30,000 base pairs, meaning the number of mutations per replication event is distributed as
. (p. 64
)
Other applications in science
In a Poisson process, the number of observed occurrences fluctuates about its mean with a
standard deviation
In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its Expected value, mean. A low standard Deviation (statistics), deviation indicates that the values tend to be close to the mean ( ...
These fluctuations are denoted as ''Poisson noise'' or (particularly in electronics) as ''
shot noise
Shot noise or Poisson noise is a type of noise which can be modeled by a Poisson process.
In electronics shot noise originates from the discrete nature of electric charge. Shot noise also occurs in photon counting in optical devices, where s ...
''.
The correlation of the mean and standard deviation in counting independent discrete occurrences is useful scientifically. By monitoring how the fluctuations vary with the mean signal, one can estimate the contribution of a single occurrence, ''even if that contribution is too small to be detected directly''. For example, the charge ''e'' on an electron can be estimated by correlating the magnitude of an
electric current
An electric current is a flow of charged particles, such as electrons or ions, moving through an electrical conductor or space. It is defined as the net rate of flow of electric charge through a surface. The moving particles are called charge c ...
with its
shot noise
Shot noise or Poisson noise is a type of noise which can be modeled by a Poisson process.
In electronics shot noise originates from the discrete nature of electric charge. Shot noise also occurs in photon counting in optical devices, where s ...
. If ''N'' electrons pass a point in a given time ''t'' on the average, the
mean current is
; since the current fluctuations should be of the order
(i.e., the standard deviation of the
Poisson process), the charge
can be estimated from the ratio
An everyday example is the graininess that appears as photographs are enlarged; the graininess is due to Poisson fluctuations in the number of reduced
silver
Silver is a chemical element; it has Symbol (chemistry), symbol Ag () and atomic number 47. A soft, whitish-gray, lustrous transition metal, it exhibits the highest electrical conductivity, thermal conductivity, and reflectivity of any metal. ...
grains, not to the individual grains themselves. By
correlating the graininess with the degree of enlargement, one can estimate the contribution of an individual grain (which is otherwise too small to be seen unaided).
In
causal set theory the discrete elements of spacetime follow a Poisson distribution in the volume.
The Poisson distribution also appears in
quantum mechanics
Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
, especially
quantum optics. Namely, for a
quantum harmonic oscillator system in a
coherent state, the probability of measuring a particular energy level has a Poisson distribution.
Computational methods
The Poisson distribution poses two different tasks for dedicated software libraries: ''evaluating'' the distribution
, and ''drawing random numbers'' according to that distribution.
Evaluating the Poisson distribution
Computing
for given
and
is a trivial task that can be accomplished by using the standard definition of
in terms of exponential, power, and factorial functions. However, the conventional definition of the Poisson distribution contains two terms that can easily overflow on computers:
and . The fraction of
to ! can also produce a rounding error that is very large compared to ''e''
−, and therefore give an erroneous result. For numerical stability the Poisson probability mass function should therefore be evaluated as
:
which is mathematically equivalent but numerically stable. The natural logarithm of the
Gamma function
In mathematics, the gamma function (represented by Γ, capital Greek alphabet, Greek letter gamma) is the most common extension of the factorial function to complex numbers. Derived by Daniel Bernoulli, the gamma function \Gamma(z) is defined ...
can be obtained using the
lgamma
function in the
C standard library (C99 version) or
R, the
gammaln
function in
MATLAB or
SciPy, or the
log_gamma
function in
Fortran 2008 and later.
Some computing languages provide built-in functions to evaluate the Poisson distribution, namely
*
R: function
dpois(x, lambda)
;
*
Excel: function
POISSON( x, mean, cumulative)
, with a flag to specify the cumulative distribution;
*
Mathematica: univariate Poisson distribution as
PoissonDistribution \lambda">math>\lambda/code>, bivariate Poisson distribution as MultivariatePoissonDistribution \theta_,">math>\theta_,/code>,.
Random variate generation
The less trivial task is to draw integer random variate from the Poisson distribution with given
Solutions are provided by:
* R: function rpois(n, lambda)
;
* GNU Scientific Library
The GNU Scientific Library (or GSL) is a software library for numerical computations in applied mathematics and science. The GSL is written in C (programming language), C; wrappers are available for other programming languages. The GSL is part of ...
(GSL): functio
gsl_ran_poisson
A simple algorithm to generate random Poisson-distributed numbers ( pseudo-random number sampling) has been given by Knuth:
algorithm ''poisson random number (Knuth)'':
init:
Let L ← ''e''−λ, k ← 0 and p ← 1.
do:
k ← k + 1.
Generate uniform random number u in ,1and let p ← p × u.
while p > L.
return k − 1.
The complexity is linear in the returned value , which is on average. There are many other algorithms to improve this. Some are given in Ahrens & Dieter, see below.
For large values of , the value of = ''e''− may be so small that it is hard to represent. This can be solved by a change to the algorithm which uses an additional parameter STEP such that ''e''−STEP does not underflow:
algorithm ''poisson random number (Junhao, based on Knuth)'':
init:
Let Left ← , k ← 0 and p ← 1.
do:
k ← k + 1.
Generate uniform random number u in (0,1) and let p ← p × u.
while p < 1 and Left > 0:
if Left > STEP:
p ← p × ''e''STEP
Left ← Left − STEP
else:
p ← p × ''e''Left
Left ← 0
while p > 1.
return k − 1.
The choice of STEP depends on the threshold of overflow. For double precision floating point format the threshold is near ''e''700, so 500 should be a safe ''STEP''.
Other solutions for large values of include rejection sampling
In numerical analysis and computational statistics, rejection sampling is a basic technique used to generate observations from a distribution. It is also commonly called the acceptance-rejection method or "accept-reject algorithm" and is a type o ...
and using Gaussian approximation.
Inverse transform sampling is simple and efficient for small values of , and requires only one uniform random number ''u'' per sample. Cumulative probabilities are examined in turn until one exceeds ''u''.
algorithm ''Poisson generator based upon the inversion by sequential search'':
init:
Let x ← 0, p ← ''e''−λ, s ← p.
Generate uniform random number u in ,1
while u > s do:
x ← x + 1.
p ← p × / x.
s ← s + p.
return x.
See also
* Binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
* Compound Poisson distribution
* Conway–Maxwell–Poisson distribution
In probability theory and statistics, the Conway–Maxwell–Poisson (CMP or COM–Poisson) distribution is a discrete probability distribution named after Richard W. Conway, William L. Maxwell, and Siméon Denis Poisson that generalizes the Pois ...
* Erlang distribution
* Exponential distribution
In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuousl ...
* Gamma distribution
* Hermite distribution
* Index of dispersion
* Negative binomial distribution
* Poisson clumping
* Poisson point process
* Poisson regression
In statistics, Poisson regression is a generalized linear model form of regression analysis used to model count data and contingency tables. Poisson regression assumes the response variable ''Y'' has a Poisson distribution, and assumes the lo ...
* Poisson sampling
* Poisson wavelet
* Queueing theory
* Renewal theory
* Robbins lemma
* Skellam distribution
* Tweedie distribution
In probability and statistics, the Tweedie distributions are a family of probability distributions which include the purely continuous normal, gamma and inverse Gaussian distributions, the purely discrete scaled Poisson distribution, and th ...
* Zero-inflated model
* Zero-truncated Poisson distribution
References
Citations
Sources
*
*
*
{{Authority control
1711 introductions
Articles with example pseudocode
Conjugate prior distributions
Factorial and binomial topics
Infinitely divisible probability distributions
Abraham de Moivre