HOME

TheInfoList



OR:

The order in probability notation is used in probability theory and statistical theory in direct parallel to the big-O notation that is standard in
mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
. Where the big-O notation deals with the convergence of sequences or sets of ordinary numbers, the order in probability notation deals with convergence of sets of random variables, where convergence is in the sense of convergence in probability.


Definitions


Small O: convergence in probability

For a set of random variables ''Xn'' and a corresponding set of constants ''an'' (both indexed by ''n'', which need not be discrete), the notation :X_n=o_p(a_n) means that the set of values ''Xn''/''an'' converges to zero in probability as ''n'' approaches an appropriate limit. Equivalently, ''X''''n'' = o''p''(''a''''n'') can be written as ''X''''n''/''a''''n'' = o''p''(1), where ''X''''n'' = o''p''(1) is defined as, :\lim_ P(, \frac, \geq \varepsilon) = 0, for every positive ε. Yvonne M. Bishop, Stephen E.Fienberg, Paul W. Holland. (1975,2007) ''Discrete multivariate analysis'', Springer. ,


Big O: stochastic boundedness

The notation :X_n=O_p(a_n) \text n\to\infty means that the set of values ''Xn''/''an'' is stochastically bounded. That is, for any ''ε'' > 0, there exists a finite ''M'' > 0 and a finite ''N'' > 0 such that :P(, \frac, > M) < \varepsilon,\; \forall \; n > N.


Comparison of the two definitions

The difference between the definition is subtle. If one uses the definition of the limit, one gets: * Big O''p''(1): \forall \varepsilon \quad \exists N_, \delta_ \quad \text P(, X_n, \geq \delta_) \leq \varepsilon \quad \forall n> N_ * Small o''p''(1): \forall \varepsilon, \delta \quad \exists N_ \quad \text P(, X_n, \geq \delta) \leq \varepsilon \quad \forall n> N_ The difference lies in the δ: for stochastic boundedness, it suffices that there exists one (arbitrary large) δ to satisfy the inequality, and δ is allowed to be dependent on ε (hence the δε). On the other side, for convergence, the statement has to hold not only for one, but for any (arbitrary small) δ. In a sense, this means that the sequence must be bounded, with a bound that gets smaller as the sample size increases. This suggests that if a sequence is o''p''(1), then it is O''p''(1), i.e. convergence in probability implies stochastic boundedness. But the reverse does not hold.


Example

If (X_n) is a stochastic sequence such that each element has finite variance, then :X_n - E(X_n) = O_p\left(\sqrt\right) (see Theorem 14.4-1 in Bishop et al.) If, moreover, a_n^\operatorname(X_n) = \operatorname(a_n^X_n) is a null sequence for a sequence (a_n) of real numbers, then a_n^(X_n - E(X_n)) converges to zero in probability by Chebyshev's inequality, so :X_n - E(X_n) = o_p(a_n).


References

{{DEFAULTSORT:Big O In Probability Notation Mathematical notation Probability theory Statistical theory Convergence (mathematics)