probability theory Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...

and

statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...

, the discrete uniform distribution is a symmetric

probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

wherein each of some finite

whole number An integer is the number zero ( 0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number ( −1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative ...

''n'' of outcome values are equally likely to be observed. Thus every one of the ''n'' outcome values has equal probability 1/''n''. Intuitively, a discrete uniform distribution is "a known, finite number of outcomes all equally likely to happen." A simple example of the discrete uniform distribution comes from throwing a fair six-sided die. The possible values are 1, 2, 3, 4, 5, 6, and each time the die is thrown the probability of each given value is 1/6. If two dice were thrown and their values added, the possible sums would not have equal probability and so the distribution of sums of two dice rolls is not uniform. Although it is common to consider discrete uniform distributions over a contiguous range of integers, such as in this six-sided die example, one can define discrete uniform distributions over any

finite set In mathematics, particularly set theory, a finite set is a set that has a finite number of elements. Informally, a finite set is a set which one could in principle count and finish counting. For example, is a finite set with five elements. Th ...

. For instance, the six-sided die could have abstract symbols rather than numbers on each of its faces. Less simply, a random permutation is a

permutation In mathematics, a permutation of a set can mean one of two different things: * an arrangement of its members in a sequence or linear order, or * the act or process of changing the linear order of an ordered set. An example of the first mean ...

generated uniformly randomly from the permutations of a given set and a uniform spanning tree of a graph is a

spanning tree In the mathematical field of graph theory, a spanning tree ''T'' of an undirected graph ''G'' is a subgraph that is a tree which includes all of the vertices of ''G''. In general, a graph may have several spanning trees, but a graph that is no ...

selected with uniform probabilities from the full set of spanning trees of the graph. The discrete uniform distribution itself is non-parametric. However, in the common case that its possible outcome values are the integers in an interval

, b /math>, then ''a'' and ''b'' are parameters of the distribution and n = b - a + 1. In these cases the

cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...

(CDF) of the discrete uniform distribution can be expressed, for any ''k'', as

F(k;a,b) = \min \left( \max \left( \frac, 0 \right) , 1 \right),

or simply

F(k;a,b) = \frac

on the distribution's support

k \in, b

Estimation of maximum

The problem of estimating the maximum

N

of a discrete uniform distribution on the integer interval

,N /math> from a sample of ''k'' observations is commonly known as the German tank problem, following the practical application of this maximum estimation problem, during

World War II World War II or the Second World War (1 September 1939 – 2 September 1945) was a World war, global conflict between two coalitions: the Allies of World War II, Allies and the Axis powers. World War II by country, Nearly all of the wo ...

, by Allied forces seeking to estimate German tank production. A uniformly minimum variance unbiased (UMVU) estimator for the distribution's maximum in terms of ''m,'' the sample maximum, and ''k,'' the

sample size Sample size determination or estimation is the act of choosing the number of observations or replicates to include in a statistical sample. The sample size is an important feature of any empirical study in which the goal is to make inferences abo ...

, is

\hat=\frac m - 1 = m + \frac - 1.

This can be seen as a very simple case of

maximum spacing estimation In statistics, maximum spacing estimation (MSE or MSP), or maximum product of spacing estimation (MPS), is a method for estimating the parameters of a univariate parametric model, statistical model. The method requires maximization of the geometr ...

. This has a variance of

\frac\frac \approx \frac \text k \ll N

so a standard deviation of approximately

\tfrac N k

, the population-average gap size between samples. The sample maximum

m

itself is the maximum likelihood estimator for the population maximum, but it is biased. If samples from a discrete uniform distribution are not numbered in order but are recognizable or markable, one can instead estimate population size via a mark and recapture method.

Random permutation

See rencontres numbers for an account of the probability distribution of the number of fixed points of a uniformly distributed random permutation.

Properties

The family of uniform discrete distributions over ranges of integers with one or both bounds unknown has a finite-dimensional sufficient statistic, namely the triple of the sample maximum, sample minimum, and sample size. Uniform discrete distributions over bounded integer ranges do not constitute an exponential family of distributions because their support varies with their parameters. For families of distributions in which their supports do not depend on their parameters, the Pitman–Koopman–Darmois theorem states that only exponential families have sufficient statistics of dimensions that are bounded as sample size increases. The uniform distribution is thus a simple example showing the necessity of the conditions for this theorem.

References

{{Probability distributions, discrete-finite Discrete distributions Location-scale family probability distributions su:Sebaran seragam#Kasus diskrit

Estimation of maximum

Random permutation

Properties

See also

References