Independence is a fundamental notion in
probability theory
Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
, as in
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
and the theory of
stochastic processes. Two
events are independent, statistically independent, or stochastically independent
if, informally speaking, the occurrence of one does not affect the probability of occurrence of the other or, equivalently, does not affect the
odds
In probability theory, odds provide a measure of the probability of a particular outcome. Odds are commonly used in gambling and statistics. For example for an event that is 40% probable, one could say that the odds are or
When gambling, o ...
. Similarly, two
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s are independent if the realization of one does not affect the
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
of the other.
When dealing with collections of more than two events, two notions of independence need to be distinguished. The events are called
pairwise independent if any two events in the collection are independent of each other, while mutual independence (or collective independence) of events means, informally speaking, that each event is independent of any combination of other events in the collection. A similar notion exists for collections of random variables. Mutual independence implies pairwise independence, but not the other way around. In the standard literature of probability theory, statistics, and stochastic processes, independence without further qualification usually refers to mutual independence.
Definition
For events
Two events
Two events
and
are independent (often written as
or
, where the latter symbol often is also used for
conditional independence) if and only if their
joint probability equals the product of their probabilities:
[
indicates that two independent events and have common elements in their sample space so that they are not mutually exclusive (mutually exclusive iff ). Why this defines independence is made clear by rewriting with conditional probabilities as the probability at which the event occurs provided that the event has or is assumed to have occurred:
:
and similarly
:
Thus, the occurrence of does not affect the probability of , and vice versa. In other words, and are independent of each other. Although the derived expressions may seem more intuitive, they are not the preferred definition, as the conditional probabilities may be undefined if or are 0. Furthermore, the preferred definition makes clear by symmetry that when is independent of , is also independent of .
]
Odds
Stated in terms of odds
In probability theory, odds provide a measure of the probability of a particular outcome. Odds are commonly used in gambling and statistics. For example for an event that is 40% probable, one could say that the odds are or
When gambling, o ...
, two events are independent if and only if the odds ratio of and is unity (1). Analogously with probability, this is equivalent to the conditional odds being equal to the unconditional odds:
:
or to the odds of one event, given the other event, being the same as the odds of the event, given the other event not occurring:
:
The odds ratio can be defined as
:
or symmetrically for odds of given , and thus is 1 if and only if the events are independent.
More than two events
A finite set of events is pairwise independent if every pair of events is independent—that is, if and only if for all distinct pairs of indices ,
A finite set of events is mutually independent if every event is independent of any intersection of the other events[—that is, if and only if for every and for every k indices ,
This is called the ''multiplication rule'' for independent events. It is not a single condition involving only the product of all the probabilities of all single events; it must hold true for all subsets of events.
For more than two events, a mutually independent set of events is (by definition) pairwise independent; but the converse is not necessarily true.][
]
Log probability and information content
Stated in terms of log probability, two events are independent if and only if the log probability of the joint event is the sum of the log probability of the individual events:
:
In information theory
Information theory is the mathematical study of the quantification (science), quantification, Data storage, storage, and telecommunications, communication of information. The field was established and formalized by Claude Shannon in the 1940s, ...
, negative log probability is interpreted as information content, and thus two events are independent if and only if the information content of the combined event equals the sum of information content of the individual events:
:
See ' for details.
For real valued random variables
Two random variables
Two random variables and are independent if and only if
In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
(iff) the elements of the -system generated by them are independent; that is to say, for every and , the events and are independent events (as defined above in ). That is, and with cumulative distribution functions and , are independent iff
In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either both ...
the combined random variable has a joint cumulative distribution function
or equivalently, if the probability densities and and the joint probability density exist,
:
More than two random variables
A finite set of random variables is pairwise independent if and only if every pair of random variables is independent. Even if the set of random variables is pairwise independent, it is not necessarily ''mutually independent'' as defined next.
A finite set of random variables is mutually independent if and only if for any sequence of numbers , the events are mutually independent events (as defined above in ). This is equivalent to the following condition on the joint cumulative distribution function A finite set of random variables is mutually independent if and only if[
It is not necessary here to require that the probability distribution factorizes for all possible subsets as in the case for events. This is not required because e.g. implies .
The measure-theoretically inclined reader may prefer to substitute events for events in the above definition, where is any ]Borel set
In mathematics, a Borel set is any subset of a topological space that can be formed from its open sets (or, equivalently, from closed sets) through the operations of countable union, countable intersection, and relative complement. Borel sets ...
. That definition is exactly equivalent to the one above when the values of the random variables are real number
In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
s. It has the advantage of working also for complex-valued random variables or for random variables taking values in any measurable space (which includes topological space
In mathematics, a topological space is, roughly speaking, a Geometry, geometrical space in which Closeness (mathematics), closeness is defined but cannot necessarily be measured by a numeric Distance (mathematics), distance. More specifically, a to ...
s endowed by appropriate σ-algebras).
For real valued random vectors
Two random vectors and are called independent if
where and denote the cumulative distribution functions of and and denotes their joint cumulative distribution function. Independence of and is often denoted by .
Written component-wise, and are called independent if
:
For stochastic processes
For one stochastic process
The definition of independence may be extended from random vectors to a stochastic process. Therefore, it is required for an independent stochastic process that the random variables obtained by sampling the process at any times are independent random variables for any .
Formally, a stochastic process is called independent, if and only if for all and for all
where Independence of a stochastic process is a property ''within'' a stochastic process, not between two stochastic processes.
For two stochastic processes
Independence of two stochastic processes is a property between two stochastic processes and that are defined on the same probability space . Formally, two stochastic processes and are said to be independent if for all and for all , the random vectors and are independent, i.e. if
Independent σ-algebras
The definitions above ( and ) are both generalized by the following definition of independence for σ-algebras. Let be a probability space and let and be two sub-σ-algebras of . and are said to be independent if, whenever and ,
:
Likewise, a finite family of σ-algebras , where is an index set, is said to be independent if and only if
:
and an infinite family of σ-algebras is said to be independent if all its finite subfamilies are independent.
The new definition relates to the previous ones very directly:
* Two events are independent (in the old sense) if and only if
In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
the σ-algebras that they generate are independent (in the new sense). The σ-algebra generated by an event is, by definition,
::
* Two random variables and defined over are independent (in the old sense) if and only if the σ-algebras that they generate are independent (in the new sense). The σ-algebra generated by a random variable taking values in some measurable space consists, by definition, of all subsets of of the form , where is any measurable subset of .
Using this definition, it is easy to show that if and are random variables and is constant, then and are independent, since the σ-algebra generated by a constant random variable is the trivial σ-algebra . Probability zero events cannot affect independence so independence also holds if is only Pr- almost surely constant.
Properties
Self-independence
Note that an event is independent of itself if and only if
:
Thus an event is independent of itself if and only if it almost surely occurs or its complement almost surely occurs; this fact is useful when proving zero–one laws.
Expectation and covariance
If and are statistically independent random variables, then the expectation operator has the property
:
and the covariance