In
probability theory
Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
, an
event is said to happen almost surely (sometimes abbreviated as a.s.) if it happens with
probability
Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
1 (with respect to the probability measure). In other words, the set of outcomes on which the event does not occur has probability 0, even though the set might not be empty. The concept is analogous to the concept of "
almost everywhere
In measure theory (a branch of mathematical analysis), a property holds almost everywhere if, in a technical sense, the set for which the property holds takes up nearly all possibilities. The notion of "almost everywhere" is a companion notion to ...
" in
measure theory
In mathematics, the concept of a measure is a generalization and formalization of geometrical measures (length, area, volume) and other common notions, such as magnitude (mathematics), magnitude, mass, and probability of events. These seemingl ...
. In probability experiments on a finite
sample space
In probability theory, the sample space (also called sample description space, possibility space, or outcome space) of an experiment or random trial is the set of all possible outcomes or results of that experiment. A sample space is usually den ...
with a non-zero probability for each outcome, there is no difference between ''almost surely'' and ''surely'' (since having a probability of 1 entails including all the
sample points); however, this distinction becomes important when the sample space is an
infinite set
In set theory, an infinite set is a set that is not a finite set. Infinite sets may be countable or uncountable.
Properties
The set of natural numbers (whose existence is postulated by the axiom of infinity) is infinite. It is the only set ...
, because an infinite set can have non-empty subsets of probability 0.
Some examples of the use of this concept include the strong and uniform versions of the
law of large numbers
In probability theory, the law of large numbers is a mathematical law that states that the average of the results obtained from a large number of independent random samples converges to the true value, if it exists. More formally, the law o ...
, the continuity of the paths of
Brownian motion
Brownian motion is the random motion of particles suspended in a medium (a liquid or a gas). The traditional mathematical formulation of Brownian motion is that of the Wiener process, which is often called Brownian motion, even in mathematical ...
, and the
infinite monkey theorem
The infinite monkey theorem states that a monkey hitting keys independently and at randomness, random on a typewriter keyboard for an infinity, infinite amount of time will almost surely type any given text, including the complete works of Willi ...
. The terms almost certainly (a.c.) and almost always (a.a.) are also used. Almost never describes the opposite of ''almost surely'': an event that happens with probability zero happens ''almost never''.
Formal definition
Let
be a
probability space
In probability theory, a probability space or a probability triple (\Omega, \mathcal, P) is a mathematical construct that provides a formal model of a random process or "experiment". For example, one can define a probability space which models ...
. An
event happens ''almost surely'' if
. Equivalently,
happens almost surely if the probability of
not occurring is
zero
0 (zero) is a number representing an empty quantity. Adding (or subtracting) 0 to any number leaves that number unchanged; in mathematical terminology, 0 is the additive identity of the integers, rational numbers, real numbers, and compl ...
:
. More generally, any set
(not necessarily in
) happens almost surely if
is contained in a
null set
In mathematical analysis, a null set is a Lebesgue measurable set of real numbers that has measure zero. This can be characterized as a set that can be covered by a countable union of intervals of arbitrarily small total length.
The notio ...
: a subset
in
such that
The notion of almost sureness depends on the probability measure
. If it is necessary to emphasize this dependence, it is customary to say that the event
occurs ''P''-almost surely, or almost surely ''
''.
Illustrative examples
In general, an event can happen "almost surely", even if the probability space in question includes outcomes which do not belong to the event—as the following examples illustrate.
Throwing a dart
Imagine throwing a
dart at a
unit square
In mathematics, a unit square is a square whose sides have length . Often, ''the'' unit square refers specifically to the square in the Cartesian plane with corners at the four points ), , , and .
Cartesian coordinates
In a Cartesian coordinat ...
(a square with an
area
Area is the measure of a region's size on a surface. The area of a plane region or ''plane area'' refers to the area of a shape or planar lamina, while '' surface area'' refers to the area of an open surface or the boundary of a three-di ...
of 1) so that the dart always hits an exact
point in the square, in such a way that each point in the square is
equally likely to be hit. Since the square has area 1, the probability that the dart will hit any particular subregion of the square is equal to the area of that subregion. For example, the probability that the dart will hit the right half of the square is 0.5, since the right half has area 0.5.
Next, consider the event that the dart hits exactly a point in the
diagonals of the unit square. Since the area of the diagonals of the square is 0, the probability that the dart will land exactly on a diagonal is 0. That is, the dart will ''almost never'' land on a diagonal (equivalently, it will ''almost surely'' not land on a diagonal), even though the set of points on the diagonals is not empty, and a point on a diagonal is no less possible than any other point.
Tossing a coin repeatedly
Consider the case where a (possibly biased) coin is tossed, corresponding to the probability space
, where the event
occurs if a head is flipped, and
if a tail is flipped. For this particular coin, it is assumed that the probability of flipping a head is
, from which it follows that the complement event, that of flipping a tail, has probability
.
Now, suppose an experiment were conducted where the coin is tossed repeatedly, with outcomes
and the assumption that each flip's outcome is independent of all the others (i.e., they are
independent and identically distributed
Independent or Independents may refer to:
Arts, entertainment, and media Artist groups
* Independents (artist group), a group of modernist painters based in Pennsylvania, United States
* Independentes (English: Independents), a Portuguese artist ...
; ''i.i.d''). Define the sequence of random variables on the coin toss space,
where
. ''i.e.'' each
records the outcome of the
th flip.
In this case, any infinite sequence of heads and tails is a possible outcome of the experiment. However, any particular infinite sequence of heads and tails has probability 0 of being the exact outcome of the (infinite) experiment. This is because the ''i.i.d.'' assumption implies that the probability of flipping all heads over
flips is simply
. Letting
yields 0, since
by assumption. The result is the same no matter how much we bias the coin towards heads, so long as we constrain
to be strictly between 0 and 1. In fact, the same result even holds in non-standard analysis—where infinitesimal probabilities are allowed.
Moreover, the event "the sequence of tosses contains at least one
" will also happen almost surely (i.e., with probability 1).
But if instead of an infinite number of flips, flipping stops after some finite time, say 1,000,000 flips, then the probability of getting an all-heads sequence,
, would no longer be 0, while the probability of getting at least one tails,
, would no longer be 1 (i.e., the event is no longer almost sure).
Asymptotically almost surely
In
asymptotic analysis
In mathematical analysis, asymptotic analysis, also known as asymptotics, is a method of describing Limit (mathematics), limiting behavior.
As an illustration, suppose that we are interested in the properties of a function as becomes very larg ...
, a property is said to hold ''asymptotically almost surely'' (a.a.s.) if over a sequence of sets, the probability converges to 1. This is equivalent to
convergence in probability
In probability theory, there exist several different notions of convergence of sequences of random variables, including ''convergence in probability'', ''convergence in distribution'', and ''almost sure convergence''. The different notions of conve ...
. For instance, in number theory, a large number is asymptotically almost surely
composite, by the
prime number theorem
In mathematics, the prime number theorem (PNT) describes the asymptotic analysis, asymptotic distribution of the prime numbers among the positive integers. It formalizes the intuitive idea that primes become less common as they become larger by p ...
; and in
random graph theory, the statement "
is
connected" (where
denotes the graphs on
vertices with edge probability
) is true a.a.s. when, for some
:
In
number theory
Number theory is a branch of pure mathematics devoted primarily to the study of the integers and arithmetic functions. Number theorists study prime numbers as well as the properties of mathematical objects constructed from integers (for example ...
, this is referred to as "
almost all
In mathematics, the term "almost all" means "all but a negligible quantity". More precisely, if X is a set (mathematics), set, "almost all elements of X" means "all elements of X but those in a negligible set, negligible subset of X". The meaning o ...
", as in "almost all numbers are composite". Similarly, in graph theory, this is sometimes referred to as "almost surely".
See also
*
Almost
In set theory, when dealing with sets of infinite size, the term almost or nearly is used to refer to all but a negligible amount of elements in the set. The notion of "negligible" depends on the context, and may mean "of measure zero" (in a m ...
*
Almost everywhere
In measure theory (a branch of mathematical analysis), a property holds almost everywhere if, in a technical sense, the set for which the property holds takes up nearly all possibilities. The notion of "almost everywhere" is a companion notion to ...
, the corresponding concept in measure theory
*
Convergence of random variables
In probability theory, there exist several different notions of convergence of sequences of random variables, including ''convergence in probability'', ''convergence in distribution'', and ''almost sure convergence''. The different notions of conve ...
, for "almost sure convergence"
*
With high probability
In mathematics, an event that occurs with high probability (often shortened to w.h.p. or WHP) is one whose probability depends on a certain number ''n'' and goes to 1 as ''n'' goes to infinity, i.e. the probability of the event occurring can be m ...
*
Cromwell's rule
Cromwell's rule, named by statistician Dennis Lindley, states that the use of prior probabilities of 1 ("the event will definitely occur") or 0 ("the event will definitely not occur") should be avoided, except when applied to statements that ar ...
, which says that probabilities should almost never be set as zero or one
*
Degenerate distribution
In probability theory, a degenerate distribution on a measure space (E, \mathcal, \mu) is a probability distribution whose support is a null set with respect to \mu. For instance, in the -dimensional space endowed with the Lebesgue measure, an ...
, for "almost surely constant"
*
Infinite monkey theorem
The infinite monkey theorem states that a monkey hitting keys independently and at randomness, random on a typewriter keyboard for an infinity, infinite amount of time will almost surely type any given text, including the complete works of Willi ...
, a theorem using the aforementioned terms
*
List of mathematical jargon
The language of mathematics has a wide vocabulary of specialist and technical terms. It also has a certain amount of jargon: commonly used phrases which are part of the culture of mathematics, rather than of the subject. Jargon often appears in ...
Notes
References
*
*{{cite book , last=Williams , first=David , title=Probability with Martingales , date=1991 , series=Cambridge Mathematical Textbooks , publisher=Cambridge University Press , isbn=978-0521406055
Probability theory
Mathematical terminology