Giry Monad
   HOME

TheInfoList



OR:

In
mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
, the Giry monad is a construction that assigns to a
measurable space In mathematics, a measurable space or Borel space is a basic object in measure theory. It consists of a set and a σ-algebra, which defines the subsets that will be measured. It captures and generalises intuitive notions such as length, area, an ...
a space of
probability measure In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...
s over it, equipped with a canonical sigma-algebra. It is one of the main examples of a probability monad. It is implicitly used in
probability theory Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
whenever one considers
probability measure In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...
s which depend measurably on a parameter (giving rise to
Markov kernel In probability theory, a Markov kernel (also known as a stochastic kernel or probability kernel) is a map that in the general theory of Markov processes plays the role that the transition matrix does in the theory of Markov processes with a finit ...
s), or when one has ''probability measures over probability measures'' (such as in
de Finetti's theorem In probability theory, de Finetti's theorem states that exchangeable random variables, exchangeable observations are conditionally independent relative to some latent variable. An epistemic probability probability distribution, distribution could ...
). Like many iterable constructions, it has the category-theoretic structure of a
monad Monad may refer to: Philosophy * Monad (philosophy), a term meaning "unit" **Monism, the concept of "one essence" in the metaphysical and theological theory ** Monad (Gnosticism), the most primal aspect of God in Gnosticism * ''Great Monad'', an ...
, on the category of measurable spaces.


Construction

The Giry monad, like every
monad Monad may refer to: Philosophy * Monad (philosophy), a term meaning "unit" **Monism, the concept of "one essence" in the metaphysical and theological theory ** Monad (Gnosticism), the most primal aspect of God in Gnosticism * ''Great Monad'', an ...
, consists of three structures: * A
functorial In mathematics, specifically category theory, a functor is a mapping between categories. Functors were first considered in algebraic topology, where algebraic objects (such as the fundamental group) are associated to topological spaces, and ma ...
assignment, which in this case assigns to a measurable space X a space of probability measures PX over it; * A natural map \delta:X\to PX called the ''unit'', which in this case assigns to each element of a space the
Dirac measure In mathematics, a Dirac measure assigns a size to a set based solely on whether it contains a fixed element ''x'' or not. It is one way of formalizing the idea of the Dirac delta function, an important tool in physics and other technical fields. ...
over it; * A natural map \mathcal:PPX\to PX called the ''multiplication'', which in this case assigns to each ''probability measure over probability measures'' its
expected value In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first Moment (mathematics), moment) is a generalization of the weighted average. Informa ...
.


The space of probability measures

Let (X, \mathcal) be a
measurable space In mathematics, a measurable space or Borel space is a basic object in measure theory. It consists of a set and a σ-algebra, which defines the subsets that will be measured. It captures and generalises intuitive notions such as length, area, an ...
. Denote by PX the set of
probability measure In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...
s over (X, \mathcal). We equip the set PX with a sigma-algebra as follows. First of all, for every measurable set A\in \mathcal, define the map \varepsilon_A:PX\to\mathbb by p\longmapsto p(A). We then define the sigma algebra \mathcal on PX to be the smallest sigma-algebra which makes the maps \varepsilon_A measurable, for all A\in\mathcal (where \mathbb is assumed equipped with the
Borel sigma-algebra In mathematics, a Borel set is any subset of a topological space that can be formed from its open sets (or, equivalently, from closed sets) through the operations of countable union, countable intersection, and relative complement. Borel sets are ...
). Equivalently, \mathcal can be defined as the smallest sigma-algebra on PX which makes the maps :p\longmapsto\int_X f \,dp measurable for all bounded measurable f:X\to\mathbb. The assignment (X,\mathcal)\mapsto (PX,\mathcal ) is part of an
endofunctor In mathematics, specifically category theory, a functor is a mapping between categories. Functors were first considered in algebraic topology, where algebraic objects (such as the fundamental group) are associated to topological spaces, and ma ...
on the category of measurable spaces, usually denoted again by P. Its action on
morphisms In mathematics, a morphism is a concept of category theory that generalizes structure-preserving maps such as homomorphism between algebraic structures, functions from a set to another set, and continuous functions between topological spaces. Al ...
, i.e. on measurable maps, is via the pushforward of measures. Namely, given a measurable map f:(X,\mathcal)\to(Y,\mathcal), one assigns to f the map f_*:(PX,\mathcal )\to(PY,\mathcal ) defined by :f_*p\,(B)=p(f^(B)) for all p\in PX and all measurable sets B\in\mathcal.


The Dirac delta map

Given a measurable space (X,\mathcal), the map \delta:(X,\mathcal)\to(PX,\mathcal) maps an element x\in X to the
Dirac measure In mathematics, a Dirac measure assigns a size to a set based solely on whether it contains a fixed element ''x'' or not. It is one way of formalizing the idea of the Dirac delta function, an important tool in physics and other technical fields. ...
\delta_x\in PX, defined on measurable subsets A\in\mathcal by : \delta_x(A) = 1_A(x) = \begin 1 & \textx\in A, \\ 0 & \textx\notin A. \end


The expectation map

Let \mu\in PPX, i.e. a probability measure over the probability measures over (X,\mathcal). We define the probability measure \mathcal\mu\in PX by : \mathcal\mu(A) = \int_ p(A)\,\mu(dp) for all measurable A\in\mathcal. This gives a measurable,
natural Nature is an inherent character or constitution, particularly of the ecosphere or the universe as a whole. In this general sense nature refers to the laws, elements and phenomena of the physical world, including life. Although humans are part ...
map \mathcal:(PPX,\mathcal)\to(PX,\mathcal).


Example: mixture distributions

A
mixture distribution In probability and statistics, a mixture distribution is the probability distribution of a random variable that is derived from a collection of other random variables as follows: first, a random variable is selected by chance from the collection a ...
, or more generally a
compound distribution In probability and statistics, a compound probability distribution (also known as a mixture distribution or contagious distribution) is the probability distribution that results from assuming that a random variable is distributed according to some ...
, can be seen as an application of the map \mathcal. Let's see this for the case of a finite mixture. Let p_1,\dots,p_n be probability measures on (X,\mathcal), and consider the probability measure q given by the mixture : q(A) = \sum_^n w_i\,p_i(A) for all measurable A\in\mathcal, for some weights w_i\ge 0 satisfying w_1+\dots+w_n=1. We can view the mixture q as the average q=\mathcal\mu, where the measure on measures \mu\in PPX, which in this case is discrete, is given by : \mu = \sum_^n w_i\,\delta_ . More generally, the map \mathcal:PPX\to PX can be seen as the most general, non-parametric way to form arbitrary
mixture In chemistry, a mixture is a material made up of two or more different chemical substances which can be separated by physical method. It is an impure substance made up of 2 or more elements or compounds mechanically mixed together in any proporti ...
or
compound distribution In probability and statistics, a compound probability distribution (also known as a mixture distribution or contagious distribution) is the probability distribution that results from assuming that a random variable is distributed according to some ...
s. The triple (P,\delta,\mathcal) is called the Giry monad.


Relationship with Markov kernels

One of the properties of the sigma-algebra \mathcal is that given
measurable space In mathematics, a measurable space or Borel space is a basic object in measure theory. It consists of a set and a σ-algebra, which defines the subsets that will be measured. It captures and generalises intuitive notions such as length, area, an ...
s (X,\mathcal) and (Y,\mathcal), we have a bijective correspondence between
measurable function In mathematics, and in particular measure theory, a measurable function is a function between the underlying sets of two measurable spaces that preserves the structure of the spaces: the preimage of any measurable set is measurable. This is in ...
s (X,\mathcal)\to(PY,\mathcal) and
Markov kernel In probability theory, a Markov kernel (also known as a stochastic kernel or probability kernel) is a map that in the general theory of Markov processes plays the role that the transition matrix does in the theory of Markov processes with a finit ...
s (X,\mathcal)\to(Y,\mathcal). This allows to view a Markov kernel, equivalently, as a measurably parametrized probability measure. In more detail, given a measurable function f:(X,\mathcal)\to(PY,\mathcal), one can obtain the Markov kernel f^\flat:(X,\mathcal)\to(Y,\mathcal) as follows, : f^\flat(B, x) = f(x)(B) for every x\in X and every measurable B\in\mathcal (note that f(x)\in PY is a probability measure). Conversely, given a Markov kernel k:(X,\mathcal)\to(Y,\mathcal), one can form the measurable function k^\sharp:(X,\mathcal)\to(PY,\mathcal) mapping x\in X to the probability measure k^\sharp(x)\in PY defined by : k^\sharp(x)(B) = k(B, x) for every measurable B\in\mathcal. The two assignments are mutually inverse. From the point of view of
category theory Category theory is a general theory of mathematical structures and their relations. It was introduced by Samuel Eilenberg and Saunders Mac Lane in the middle of the 20th century in their foundational work on algebraic topology. Category theory ...
, we can interpret this correspondence as an adjunction : \mathrm_\mathrm (X,PY) \cong \mathrm_\mathrm (X,Y) between the category of measurable spaces and the
category of Markov kernels In mathematics, the category of Markov kernels, often denoted Stoch, is the category whose objects are measurable spaces and whose morphisms are Markov kernels. It is analogous to the category of sets and functions, but where the arrows can be ...
. In particular, the category of Markov kernels can be seen as the
Kleisli category In category theory, a Kleisli category is a category naturally associated to any monad ''T''. It is equivalent to the category of free ''T''-algebras. The Kleisli category is one of two extremal solutions to the question: "''Does every monad aris ...
of the Giry monad.


Product distributions

Given
measurable space In mathematics, a measurable space or Borel space is a basic object in measure theory. It consists of a set and a σ-algebra, which defines the subsets that will be measured. It captures and generalises intuitive notions such as length, area, an ...
s (X,\mathcal) and (Y,\mathcal), one can form the measurable space (PX,\mathcal)\times (PY,\mathcal)=(X\times Y, \mathcal\times\mathcal) with the
product sigma-algebra Product may refer to: Business * Product (business), an item that can be offered to a market to satisfy the desire or need of a customer. * Product (project management), a deliverable or set of deliverables that contribute to a business solution ...
, which is the product in the category of measurable spaces. Given
probability measure In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...
s p\in PX and q\in PY, one can form the
product measure In mathematics, given two measurable spaces and measures on them, one can obtain a product measurable space and a product measure on that space. Conceptually, this is similar to defining the Cartesian product of sets and the product topology o ...
p\otimes q on (X\times Y, \mathcal\times\mathcal). This gives a
natural Nature is an inherent character or constitution, particularly of the ecosphere or the universe as a whole. In this general sense nature refers to the laws, elements and phenomena of the physical world, including life. Although humans are part ...
, measurable map : (PX,\mathcal)\times (PY,\mathcal)\to \big(P(X\times Y), \mathcal\big) usually denoted by \nabla or by \otimes. The map \nabla:PX\times PY\to P(X\times Y) is in general not an isomorphism, since there are probability measures on X\times Y which are not product distributions, for example in case of
correlation In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...
. However, the maps \nabla:PX\times PY\to P(X\times Y) and the isomorphism 1\cong P1 make the Giry monad a monoidal monad, and so in particular a commutative strong monad.


Further properties

* If a measurable space (X,\mathcal) is standard Borel, so is (PX,\mathcal). Therefore the Giry monad restricts to the
full subcategory In mathematics, specifically category theory, a subcategory of a category ''C'' is a category ''S'' whose objects are objects in ''C'' and whose morphisms are morphisms in ''C'' with the same identities and composition of morphisms. Intuitivel ...
of standard Borel spaces. * The
algebra Algebra is a branch of mathematics that deals with abstract systems, known as algebraic structures, and the manipulation of expressions within those systems. It is a generalization of arithmetic that introduces variables and algebraic ope ...
s for the Giry monad include
compact Compact as used in politics may refer broadly to a pact or treaty; in more specific cases it may refer to: * Interstate compact, a type of agreement used by U.S. states * Blood compact, an ancient ritual of the Philippines * Compact government, a t ...
convex Convex or convexity may refer to: Science and technology * Convex lens, in optics Mathematics * Convex set, containing the whole line segment that joins points ** Convex polygon, a polygon which encloses a convex set of points ** Convex polytop ...
subsets of
Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces ...
s, as well as the extended positive real line ,\infty/math>, with the algebra structure map given by taking
expected values In probability theory, the expected value (also called expectation, expectancy, expectation operator, mathematical expectation, mean, expectation value, or first moment) is a generalization of the weighted average. Informally, the expected val ...
. For example, for ,\infty/math>, the structure map e:P ,\inftyto ,\infty/math> is given by : p \longmapsto \int_ x\,p(dx) :whenever p is supported on ,\infty) and has finite expected value, and e(p)=\infty otherwise.


See also

* Mixture distribution * Compound distribution">Mixture distribution">,\infty) and has finite expected value, and e(p)=\infty otherwise.


See also

* Mixture distribution * Compound distribution * de Finetti theorem * Measurable space *
Markov kernel In probability theory, a Markov kernel (also known as a stochastic kernel or probability kernel) is a map that in the general theory of Markov processes plays the role that the transition matrix does in the theory of Markov processes with a finit ...
*
Monad (category theory) In category theory, a branch of mathematics, a monad is a triple (T, \eta, \mu) consisting of a functor ''T'' from a category to itself and two natural transformations \eta, \mu that satisfy the conditions like associativity. For example, if F, ...
*
Monad (functional programming) In functional programming, monads are a way to structure computations as a sequence of steps, where each step not only produces a value but also some extra information about the computation, such as a potential failure, non-determinism, or side e ...
* Category of measurable spaces *
Category of Markov kernels In mathematics, the category of Markov kernels, often denoted Stoch, is the category whose objects are measurable spaces and whose morphisms are Markov kernels. It is analogous to the category of sets and functions, but where the arrows can be ...
* Categorical probability


Citations


References

* * * * * * * * {{refend


Further reading


Monads of probability, measures, and valuations
in
nLab The ''n''Lab is a wiki for research-level notes, expositions and collaborative work, including original research, in mathematics, physics, and philosophy, with a focus on methods from type theory, category theory, and homotopy theory. The ''n''Lab ...
. * https://ncatlab.org/nlab/show/Giry+monad


External links


What is a probability monad?
video tutorial. Measure theory Probability theory Category theory