mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...

, concentration of measure (about a

median The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...

) is a principle that is applied in

measure theory In mathematics, the concept of a measure is a generalization and formalization of geometrical measures (length, area, volume) and other common notions, such as magnitude (mathematics), magnitude, mass, and probability of events. These seemingl ...

probability Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...

and

combinatorics Combinatorics is an area of mathematics primarily concerned with counting, both as a means and as an end to obtaining results, and certain properties of finite structures. It is closely related to many other areas of mathematics and has many ...

, and has consequences for other fields such as

Banach space In mathematics, more specifically in functional analysis, a Banach space (, ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and ...

theory. Informally, it states that "A random variable that depends in a Lipschitz way on many independent variables (but not too much on any of them) is essentially constant". The concentration of measure phenomenon was put forth in the early 1970s by Vitali Milman in his works on the local theory of

s, extending an idea going back to the work of Paul Lévy. It was further developed in the works of Milman and Gromov, Maurey, Pisier, Schechtman, Talagrand, Ledoux, and others.

The general setting

Let

(X, d)

be a

metric space In mathematics, a metric space is a Set (mathematics), set together with a notion of ''distance'' between its Element (mathematics), elements, usually called point (geometry), points. The distance is measured by a function (mathematics), functi ...

with a measure

\mu

on the

Borel set In mathematics, a Borel set is any subset of a topological space that can be formed from its open sets (or, equivalently, from closed sets) through the operations of countable union, countable intersection, and relative complement. Borel sets ...

s with

\mu(X) = 1

. Let :

\alpha(\varepsilon) = \sup \left\,

where :

A_\varepsilon = \left\

is the

\varepsilon

-''extension'' (also called

\varepsilon

-fattening in the context of the Hausdorff distance) of a set

A

. The function

\alpha(\cdot)

is called the ''concentration rate'' of the space

X

. The following equivalent definition has many applications: :

\alpha(\varepsilon) = \sup \left\,

where the supremum is over all 1-Lipschitz functions

F: X \to \mathbb

, and the median (or Levy mean)

M = \operatorname F

is defined by the inequalities :

\mu \ \geq 1/2, \, \mu \ \geq 1/2.

Informally, the space

X

exhibits a concentration phenomenon if

\alpha(\varepsilon)

decays very fast as

\varepsilon

grows. More formally, a family of metric measure spaces

(X_n, d_n, \mu_n)

is called a ''Lévy family'' if the corresponding concentration rates

\alpha_n

satisfy :

\forall \varepsilon > 0 \,\, \alpha_n(\varepsilon) \to 0 \text n\to \infty,

and a ''normal Lévy family'' if :

\forall \varepsilon > 0 \,\, \alpha_n(\varepsilon) \leq C \exp(-c n \varepsilon^2)

for some constants

c,C>0

. For examples see below.

Concentration on the sphere

The first example goes back to Paul Lévy. According to the spherical isoperimetric inequality, among all subsets

A

of the sphere

S^n

with prescribed spherical measure

\sigma_n(A)

, the spherical cap :

\left\,

for suitable

R

, has the smallest

\varepsilon

-extension

A_\varepsilon

(for any

\varepsilon > 0

). Applying this to sets of measure

\sigma_n(A) = 1/2

(where

\sigma_n(S^n) = 1

), one can deduce the following concentration inequality: :

\sigma_n(A_\varepsilon) \geq 1 - C \exp(- c n \varepsilon^2),

where

C,c

are universal constants. Therefore

(S^n)_n

meet the definition above of a normal Lévy family. Vitali Milman applied this fact to several problems in the local theory of Banach spaces, in particular, to give a new proof of Dvoretzky's theorem.

Concentration of measure in physics

All classical statistical physics is based on the concentration of measure phenomena: The fundamental idea (‘theorem’) about equivalence of ensembles in thermodynamic limit ( Gibbs, 1902 and

Einstein Albert Einstein (14 March 187918 April 1955) was a German-born theoretical physicist who is best known for developing the theory of relativity. Einstein also made important contributions to quantum mechanics. His mass–energy equivalence f ...

, 1902–1904) is exactly the thin shell concentration theorem. For each mechanical system consider the

phase space The phase space of a physical system is the set of all possible physical states of the system when described by a given parameterization. Each possible state corresponds uniquely to a point in the phase space. For mechanical systems, the p ...

equipped by the invariant Liouville measure (the phase volume) and conserving energy ''E''. The

microcanonical ensemble In statistical mechanics, the microcanonical ensemble is a statistical ensemble that represents the possible states of a mechanical system whose total energy is exactly specified. The system is assumed to be isolated in the sense that it canno ...

is just an invariant distribution over the surface of constant energy E obtained by Gibbs as the limit of distributions in

with constant density in thin layers between the surfaces of states with energy ''E'' and with energy ''E'' + Δ''E''. The

canonical ensemble In statistical mechanics, a canonical ensemble is the statistical ensemble that represents the possible states of a mechanical system in thermal equilibrium with a heat bath at a fixed temperature. The system can exchange energy with the hea ...

is given by the probability density in the phase space (with respect to the phase volume)

\rho = e^,

where quantities ''F'' = constant and ''T'' = constant are defined by the conditions of probability normalisation and the given expectation of energy ''E''. When the number of particles is large, then the difference between average values of the macroscopic variables for the canonical and microcanonical ensembles tends to zero, and their fluctuations are explicitly evaluated. These results are proven rigorously under some regularity conditions on the energy function ''E'' by Khinchin (1943). The simplest particular case when ''E'' is a sum of squares was well-known in detail before Khinchin and Lévy and even before Gibbs and Einstein. This is the

Maxwell–Boltzmann distribution In physics (in particular in statistical mechanics), the Maxwell–Boltzmann distribution, or Maxwell(ian) distribution, is a particular probability distribution named after James Clerk Maxwell and Ludwig Boltzmann. It was first defined and use ...

of the particle energy in ideal gas. The microcanonical ensemble is very natural from the naïve physical point of view: this is just a natural equidistribution on the isoenergetic hypersurface. The canonical ensemble is very useful because of an important property: if a system consists of two non-interacting subsystems, i.e. if the energy ''E'' is the sum,

E=E_1(X_1)+E_2(X_2)

, where

X_1, X_2

are the states of the subsystems, then the equilibrium states of subsystems are independent, the equilibrium distribution of the system is the product of equilibrium distributions of the subsystems with the same ''T''. The equivalence of these ensembles is the cornerstone of the mechanical foundations of thermodynamics.

Other examples

* Borell–TIS inequality * Gaussian isoperimetric inequality * McDiarmid's inequality * Talagrand's concentration inequality *

Asymptotic equipartition property In information theory, the asymptotic equipartition property (AEP) is a general property of the output samples of a stochastic source. It is fundamental to the concept of typical set used in theories of data compression. Roughly speaking, the t ...

References

External links