In
mathematics
Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
, concentration of measure (about a
median
The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...
) is a principle that is applied in
measure theory
In mathematics, the concept of a measure is a generalization and formalization of geometrical measures (length, area, volume) and other common notions, such as magnitude (mathematics), magnitude, mass, and probability of events. These seemingl ...
,
probability
Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
and
combinatorics
Combinatorics is an area of mathematics primarily concerned with counting, both as a means and as an end to obtaining results, and certain properties of finite structures. It is closely related to many other areas of mathematics and has many ...
, and has consequences for other fields such as
Banach space
In mathematics, more specifically in functional analysis, a Banach space (, ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and ...
theory. Informally, it states that "A random variable that depends in a
Lipschitz way on many independent variables (but not too much on any of them) is essentially constant".
The concentration of measure phenomenon was put forth in the early 1970s by
Vitali Milman in his works on the local theory of
Banach space
In mathematics, more specifically in functional analysis, a Banach space (, ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and ...
s, extending an idea going back to the work of
Paul Lévy. It was further developed in the works of Milman and
Gromov,
Maurey,
Pisier,
Schechtman,
Talagrand,
Ledoux, and others.
The general setting
Let
be a
metric space
In mathematics, a metric space is a Set (mathematics), set together with a notion of ''distance'' between its Element (mathematics), elements, usually called point (geometry), points. The distance is measured by a function (mathematics), functi ...
with a
measure on the
Borel set
In mathematics, a Borel set is any subset of a topological space that can be formed from its open sets (or, equivalently, from closed sets) through the operations of countable union, countable intersection, and relative complement. Borel sets ...
s with
.
Let
:
where
:
is the
-''extension'' (also called
-fattening in the context of
the Hausdorff distance) of a set
.
The function
is called the ''concentration rate'' of the space
. The following equivalent definition has many applications:
:
where the supremum is over all 1-Lipschitz functions
, and
the median (or Levy mean)
is defined by the inequalities
:
Informally, the space
exhibits a concentration phenomenon if
decays very fast as
grows. More formally,
a family of metric measure spaces
is called a ''Lévy family'' if
the corresponding concentration rates
satisfy
:
and a ''normal Lévy family'' if
:
for some constants
. For examples see below.
Concentration on the sphere
The first example goes back to
Paul Lévy. According to the
spherical isoperimetric inequality, among all subsets
of the sphere
with prescribed
spherical measure , the spherical cap
:
for suitable
, has the smallest
-extension
(for any
).
Applying this to sets of measure
(where
), one can deduce the following
concentration inequality:
:
where
are universal constants. Therefore
meet the definition above of a normal Lévy family.
Vitali Milman applied this fact to several problems in the local theory of Banach spaces, in particular, to give a new proof of
Dvoretzky's theorem.
Concentration of measure in physics
All classical statistical physics is based on the concentration of measure phenomena:
The fundamental idea (‘theorem’) about equivalence of ensembles in thermodynamic limit (
Gibbs, 1902 and
Einstein
Albert Einstein (14 March 187918 April 1955) was a German-born theoretical physicist who is best known for developing the theory of relativity. Einstein also made important contributions to quantum mechanics. His mass–energy equivalence f ...
, 1902–1904) is exactly the thin shell concentration theorem. For each mechanical system consider the
phase space
The phase space of a physical system is the set of all possible physical states of the system when described by a given parameterization. Each possible state corresponds uniquely to a point in the phase space. For mechanical systems, the p ...
equipped by the invariant
Liouville measure (the phase volume) and conserving energy ''E''. The
microcanonical ensemble
In statistical mechanics, the microcanonical ensemble is a statistical ensemble that represents the possible states of a mechanical system whose total energy is exactly specified. The system is assumed to be isolated in the sense that it canno ...
is just an invariant distribution over the surface of constant energy E obtained by Gibbs as the limit of distributions in
phase space
The phase space of a physical system is the set of all possible physical states of the system when described by a given parameterization. Each possible state corresponds uniquely to a point in the phase space. For mechanical systems, the p ...
with constant density in thin layers between the surfaces of states with energy ''E'' and with energy ''E'' + Δ''E''. The
canonical ensemble
In statistical mechanics, a canonical ensemble is the statistical ensemble that represents the possible states of a mechanical system in thermal equilibrium with a heat bath at a fixed temperature. The system can exchange energy with the hea ...
is given by the probability density in the phase space (with respect to the phase volume)
where quantities ''F'' = constant and ''T'' = constant are defined by the conditions of probability normalisation and the given expectation of energy ''E''.
When the number of particles is large, then the difference between average values of the macroscopic variables for the canonical and microcanonical ensembles tends to zero, and their
fluctuations are explicitly evaluated. These results are proven rigorously under some regularity conditions on the energy function ''E'' by
Khinchin (1943).
The simplest particular case when ''E'' is a sum of squares was well-known in detail before
Khinchin and Lévy and even before Gibbs and Einstein. This is the
Maxwell–Boltzmann distribution
In physics (in particular in statistical mechanics), the Maxwell–Boltzmann distribution, or Maxwell(ian) distribution, is a particular probability distribution named after James Clerk Maxwell and Ludwig Boltzmann.
It was first defined and use ...
of the particle energy in ideal gas.
The microcanonical ensemble is very natural from the naïve physical point of view: this is just a natural equidistribution on the isoenergetic hypersurface. The canonical ensemble is very useful because of an important property: if a system consists of two non-interacting subsystems, i.e. if the energy ''E'' is the sum,
, where
are the states of the subsystems, then the equilibrium states of subsystems are independent, the equilibrium distribution of the system is the product of equilibrium distributions of the subsystems with the same ''T''. The equivalence of these ensembles is the cornerstone of the mechanical foundations of thermodynamics.
Other examples
*
Borell–TIS inequality
*
Gaussian isoperimetric inequality
*
McDiarmid's inequality
*
Talagrand's concentration inequality
*
Asymptotic equipartition property
In information theory, the asymptotic equipartition property (AEP) is a general property of the output samples of a stochastic source. It is fundamental to the concept of typical set used in theories of data compression.
Roughly speaking, the t ...
References
Further reading
*
*
External links
*{{cite web , first=Ferenc , last=Huszár , title=Gaussian Distributions are Soap Bubbles , date=November 9, 2017 , url=https://www.inference.vc/high-dimensional-gaussian-distributions-are-soap-bubble/ – blog post illustrating one of the implications of concentration of measure
Measure theory
Asymptotic geometric analysis