In
probability
Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
and
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, a Bernoulli process (named after
Jacob Bernoulli
Jacob Bernoulli (also known as James in English or Jacques in French; – 16 August 1705) was a Swiss mathematician. He sided with Gottfried Wilhelm Leibniz during the Leibniz–Newton calculus controversy and was an early proponent of Leibniz ...
) is a finite or infinite sequence of binary
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s, so it is a
discrete-time stochastic process that takes only two values, canonically 0 and 1. The component Bernoulli variables ''X''
''i'' are
identically distributed and independent. Prosaically, a Bernoulli process is a repeated
coin flipping, possibly with an unfair coin (but with consistent unfairness). Every variable ''X''
''i'' in the sequence is associated with a
Bernoulli trial
In the theory of probability and statistics, a Bernoulli trial (or binomial trial) is a random experiment with exactly two possible outcomes, "success" and "failure", in which the probability of success is the same every time the experiment is ...
or experiment. They all have the same
Bernoulli distribution. Much of what can be said about the Bernoulli process can also be generalized to more than two outcomes (such as the process for a six-sided die); this generalization is known as the
Bernoulli scheme.
The problem of determining the process, given only a limited sample of Bernoulli trials, may be called the problem of
checking whether a coin is fair.
Definition
A ''Bernoulli process'' is a finite or infinite sequence of
independent random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s ''X''
1, ''X''
2, ''X''
3, ..., such that
* for each ''i'', the value of ''X''
''i'' is either 0 or 1;
* for all values of
, the probability ''p'' that ''X''
''i'' = 1 is the same.
In other words, a Bernoulli process is a sequence of
independent identically distributed Bernoulli trial
In the theory of probability and statistics, a Bernoulli trial (or binomial trial) is a random experiment with exactly two possible outcomes, "success" and "failure", in which the probability of success is the same every time the experiment is ...
s.
Independence of the trials implies that the process is
memoryless
In probability and statistics, memorylessness is a property of probability distributions. It describes situations where previous failures or elapsed time does not affect future trials or further wait time. Only the geometric and exponential d ...
, in which past event frequencies have no influence on about future event probability frequencies. In most instances the true value of ''p'' is unknown, therefore we use past frequencies to assess/forecast/estimate future events & their probabilities indirectly via applying probabilistic inference upon ''p''.
If the process is infinite, then from any point the future trials constitute a Bernoulli process identical to the whole process, the fresh-start property.
Interpretation
The two possible values of each ''X''
''i'' are often called "success" and "failure". Thus, when expressed as a number 0 or 1, the outcome may be called the number of successes on the ''i''th "trial".
Two other common interpretations of the values are true or false and yes or no. Under any interpretation of the two values, the individual variables ''X''
''i'' may be called
Bernoulli trial
In the theory of probability and statistics, a Bernoulli trial (or binomial trial) is a random experiment with exactly two possible outcomes, "success" and "failure", in which the probability of success is the same every time the experiment is ...
s with parameter p.
In many applications time passes between trials, as the index i increases. In effect, the trials ''X''
1, ''X''
2, ... ''X''
i, ... happen at "points in time" 1, 2, ..., ''i'', .... That passage of time and the associated notions of "past" and "future" are not necessary, however. Most generally, any ''X''
i and ''X''
''j'' in the process are simply two from a set of random variables indexed by , the finite cases, or by , the infinite cases.
One experiment with only two possible outcomes, often referred to as "success" and "failure", usually encoded as 1 and 0, can be modeled as a
Bernoulli distribution.
Several random variables and
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
s beside the Bernoullis may be derived from the Bernoulli process:
*The number of successes in the first ''n'' trials, which has a
binomial distribution
In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
B(''n'', ''p'')
*The number of failures needed to get ''r'' successes, which has a
negative binomial distribution NB(''r'', ''p'')
*The number of failures needed to get one success, which has a
geometric distribution
In probability theory and statistics, the geometric distribution is either one of two discrete probability distributions:
* The probability distribution of the number X of Bernoulli trials needed to get one success, supported on \mathbb = \;
* T ...
NB(1, ''p''), a special case of the negative binomial distribution
The negative binomial variables may be interpreted as random
waiting times.
Formal definition
The Bernoulli process can be formalized in the language of
probability space
In probability theory, a probability space or a probability triple (\Omega, \mathcal, P) is a mathematical construct that provides a formal model of a random process or "experiment". For example, one can define a probability space which models ...
s as a random sequence of independent realisations of a random variable that can take values of heads or tails. The state space for an individual value is denoted by
Borel algebra
Consider the
countably infinite
In mathematics, a set is countable if either it is finite or it can be made in one to one correspondence with the set of natural numbers. Equivalently, a set is ''countable'' if there exists an injective function from it into the natural numbe ...
direct product of copies of
. It is common to examine either the one-sided set
or the two-sided set
. There is a natural
topology
Topology (from the Greek language, Greek words , and ) is the branch of mathematics concerned with the properties of a Mathematical object, geometric object that are preserved under Continuous function, continuous Deformation theory, deformat ...
on this space, called the
product topology
In topology and related areas of mathematics, a product space is the Cartesian product of a family of topological spaces equipped with a natural topology called the product topology. This topology differs from another, perhaps more natural-seemin ...
. The sets in this topology are finite sequences of coin flips, that is, finite-length
strings of ''H'' and ''T'' (''H'' stands for heads and ''T'' stands for tails), with the rest of (infinitely long) sequence taken as "don't care". These sets of finite sequences are referred to as
cylinder sets in the product topology. The set of all such strings forms a
sigma algebra, specifically, a
Borel algebra. This algebra is then commonly written as
where the elements of
are the finite-length sequences of coin flips (the cylinder sets).
Bernoulli measure
If the chances of flipping heads or tails are given by the probabilities
, then one can define a natural
measure on the product space, given by
(or by
for the two-sided process). In another word, if a
discrete random variable ''X'' has a ''Bernoulli distribution'' with parameter ''p'', where 0 ≤ ''p'' ≤ 1, and its
probability mass function
In probability and statistics, a probability mass function (sometimes called ''probability function'' or ''frequency function'') is a function that gives the probability that a discrete random variable is exactly equal to some value. Sometimes i ...
is given by
:
and
.
We denote this distribution by Ber(''p'').
Given a cylinder set, that is, a specific sequence of coin flip results