Intuitively, an algorithmically random sequence (or

random sequence The concept of a random sequence is essential in probability theory and statistics. The concept generally relies on the notion of a sequence of random variables and many statistical discussions begin with the words "let ''X''1,...,''Xn'' be independ ...

) is a

sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is calle ...

of binary digits that appears random to any algorithm running on a (prefix-free or not)

universal Turing machine In computer science, a universal Turing machine (UTM) is a Turing machine that can simulate an arbitrary Turing machine on arbitrary input. The universal machine essentially achieves this by reading both the description of the machine to be simu ...

. The notion can be applied analogously to sequences on any finite alphabet (e.g. decimal digits). Random sequences are key objects of study in

algorithmic information theory Algorithmic information theory (AIT) is a branch of theoretical computer science that concerns itself with the relationship between computation and information of computably generated objects (as opposed to stochastically generated), such as str ...

. As different types of algorithms are sometimes considered, ranging from algorithms with specific bounds on their running time to algorithms which may ask questions of an

oracle machine In complexity theory and computability theory, an oracle machine is an abstract machine used to study decision problems. It can be visualized as a Turing machine with a black box, called an oracle, which is able to solve certain problems in a ...

, there are different notions of randomness. The most common of these is known as Martin-Löf randomness (K-randomness or 1-randomness), but stronger and weaker forms of randomness also exist. When the term "algorithmically random" is used to refer to a particular single (finite or infinite) sequence without clarification, it is usually taken to mean "incompressible" or, in the case the sequence is infinite and prefix algorithmically random (i.e., K-incompressible), "Martin-Löf–Chaitin random". It is important to disambiguate between algorithmic randomness and stochastic randomness. Unlike algorithmic randomness, which is defined for computable (and thus deterministic) processes, stochastic randomness is usually said to be a property of a sequence that is a priori known to be generated by (or is the outcome of) an

independent Independent or Independents may refer to: Arts, entertainment, and media Artist groups * Independents (artist group), a group of modernist painters based in the New Hope, Pennsylvania, area of the United States during the early 1930s * Independ ...

identically

distributed Distribution may refer to: Mathematics *Distribution (mathematics), generalized functions used to formulate solutions of partial differential equations *Probability distribution, the probability of a particular value or value range of a varia ...

equiprobable stochastic process. Because infinite sequences of binary digits can be identified with real numbers in the unit interval, random binary sequences are often called (algorithmically) random real numbers. Additionally, infinite binary sequences correspond to characteristic functions of sets of natural numbers; therefore those sequences might be seen as sets of natural numbers. The class of all Martin-Löf random (binary) sequences is denoted by RAND or MLR.

History

The first suitable definition of a random sequence was given by

Per Martin-Löf Per Erik Rutger Martin-Löf (; ; born 8 May 1942) is a Swedish logician, philosopher, and mathematical statistician. He is internationally renowned for his work on the foundations of probability, statistics, mathematical logic, and computer scie ...

in 1966. Earlier researchers such as

Richard von Mises Richard Edler von Mises (; 19 April 1883 – 14 July 1953) was an Austrian scientist and mathematician who worked on solid mechanics, fluid mechanics, aerodynamics, aeronautics, statistics and probability theory. He held the position of Gordo ...

had attempted to formalize the notion of a test for randomness in order to define a random sequence as one that passed all tests for randomness; however, the precise notion of a randomness test was left vague. Martin-Löf's key insight was to use the

theory of computation In theoretical computer science and mathematics, the theory of computation is the branch that deals with what problems can be solved on a model of computation, using an algorithm, how algorithmic efficiency, efficiently they can be solved or t ...

to formally define the notion of a test for randomness. This contrasts with the idea of randomness in

probability Probability is the branch of mathematics concerning numerical descriptions of how likely an event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and 1, where, roughly speakin ...

; in that theory, no particular element of a sample space can be said to be random. Since its inception, Martin-Löf randomness has been shown to admit many equivalent characterizations—in terms of

compression Compression may refer to: Physical science *Compression (physics), size reduction due to forces *Compression member, a structural element such as a column *Compressibility, susceptibility to compression * Gas compression *Compression ratio, of a ...

, randomness tests, and

gambling Gambling (also known as betting or gaming) is the wagering of something of value ("the stakes") on a random event with the intent of winning something else of value, where instances of strategy are discounted. Gambling thus requires three el ...

—that bear little outward resemblance to the original definition, but each of which satisfy our intuitive notion of properties that random sequences ought to have: random sequences should be incompressible, they should pass statistical tests for randomness, and it should be difficult to make money

betting Gambling (also known as betting or gaming) is the wagering of something of value ("the stakes") on a random event with the intent of winning something else of value, where instances of strategy are discounted. Gambling thus requires three elem ...

on them. The existence of these multiple definitions of Martin-Löf randomness, and the stability of these definitions under different models of computation, give evidence that Martin-Löf randomness is a fundamental property of mathematics and not an accident of Martin-Löf's particular model. The thesis that the definition of Martin-Löf randomness "correctly" captures the intuitive notion of randomness has been called the Martin-Löf–Chaitin Thesis; it is somewhat similar to the Church–Turing thesis.

Three equivalent definitions

Martin-Löf's original definition of a random sequence was in terms of constructive null covers; he defined a sequence to be random if it is not contained in any such cover.

Gregory Chaitin Gregory John Chaitin ( ; born 25 June 1947) is an Argentine-American mathematician and computer scientist. Beginning in the late 1960s, Chaitin made contributions to algorithmic information theory and metamathematics, in particular a computer-t ...

Leonid Levin Leonid Anatolievich Levin ( ; russian: Леони́д Анато́льевич Ле́вин; uk, Леоні́д Анато́лійович Ле́він; born November 2, 1948) is a Soviet-American mathematician and computer scientist. He is kn ...

and Claus-Peter Schnorr proved a characterization in terms of

algorithmic complexity Algorithmic may refer to: *Algorithm, step-by-step instructions for a calculation **Algorithmic art, art made by an algorithm **Algorithmic composition, music made by an algorithm ** Algorithmic trading, trading decisions made by an algorithm **Alg ...

: a sequence is random if there is a uniform bound on the compressibility of its initial segments. Schnorr gave a third equivalent definition in terms of martingales. Li and Vitanyi's boo
''An Introduction to Kolmogorov Complexity and Its Applications''
is the standard introduction to these ideas. * Algorithmic complexity (Chaitin 1969, Schnorr 1973, Levin 1973): Algorithmic complexity (also known as (prefix-free)

Kolmogorov complexity In algorithmic information theory (a subfield of computer science and mathematics), the Kolmogorov complexity of an object, such as a piece of text, is the length of a shortest computer program (in a predetermined programming language) that produ ...

or program-size complexity) can be thought of as a lower bound on the algorithmic compressibility of a finite sequence (of characters or binary digits). It assigns to each such sequence ''w'' a natural number ''K(w)'' that, intuitively, measures the minimum length of a computer program (written in some fixed programming language) that takes no input and will output ''w'' when run. The complexity is required to be prefix-free: The program (a sequence of 0 and 1) is followed by an infinite string of 0s, and the length of the program (assuming it halts) includes the number of zeroes to the right of the program that the universal Turing machine reads. The additional requirement is needed because we can choose a length such that the length codes information about the substring. Given a natural number ''c'' and a sequence ''w'', we say that ''w'' is ''c''-incompressible if

K(w) \geq , w,  - c

. :''An infinite sequence ''S'' is Martin-Löf random if and only if there is a constant ''c'' such that all of ''Ss finite

prefixes A prefix is an affix which is placed before the stem of a word. Adding it to the beginning of one word changes it into another word. For example, when the prefix ''un-'' is added to the word ''happy'', it creates the word ''unhappy''. Particu ...

are ''c''-incompressible.'' * Constructive null covers (Martin-Löf 1966): This is Martin-Löf's original definition. For a finite binary string ''w'' we let ''C_w'' denote the cylinder generated by ''w''. This is the set of all infinite sequences beginning with ''w'', which is a basic open set in

Cantor space In mathematics, a Cantor space, named for Georg Cantor, is a topological abstraction of the classical Cantor set: a topological space is a Cantor space if it is homeomorphic to the Cantor set. In set theory, the topological space 2ω is called "the ...

. The product measure μ(''C_w'') of the cylinder generated by ''w'' is defined to be 2^{−, ''w'',}. Every open subset of Cantor space is the union of a countable sequence of disjoint basic open sets, and the measure of an open set is the sum of the measures of any such sequence. An ''effective open set'' is an open set that is the union of the sequence of basic open sets determined by a

recursively enumerable In computability theory, a set ''S'' of natural numbers is called computably enumerable (c.e.), recursively enumerable (r.e.), semidecidable, partially decidable, listable, provable or Turing-recognizable if: *There is an algorithm such that the ...

sequence of binary strings. A ''constructive null cover'' or ''effective measure ''0'' set'' is a recursively enumerable sequence

U_i

of effective open sets such that

U_ \subseteq U_i

and

\mu (U_i) \leq 2^

for each natural number ''i''. Every effective null cover determines a

G_\delta

set of measure 0, namely the intersection of the sets

U_i

. :''A sequence is defined to be Martin-Löf random if it is not contained in any

G_\delta

set determined by a constructive null cover.'' * Constructive martingales (Schnorr 1971): A martingale is a function

d:\^*\to,\infty)

such that, for all finite strings ''w'',

d(w) = (d(w^\smallfrown 0) + d(w^\smallfrown 1))/2

, where

a^\smallfrown b

is the concatenation of the strings ''a'' and ''b''. This is called the "fairness condition": if a martingale is viewed as a betting strategy, then the above condition requires that the bettor plays against fair odds. A martingale ''d'' is said to succeed on a sequence ''S'' if

\limsup_ d(S \upharpoonright n) = \infty,

where

S \upharpoonright n

is the first ''n'' bits of ''S''. A martingale ''d'' is constructive (also known as weakly computable, lower semi-computable) if there exists a computable function

\widehat:\^*\times\N\to

such that, for all finite binary strings ''w'' #

\widehat(w,t) \leq \widehat(w,t+1) < d(w),

for all positive integers ''t'', #

\lim_ \widehat(w,t) = d(w).

:''A sequence is Martin-Löf random if and only if no constructive martingale succeeds on it.''

Interpretations of the definitions

The Kolmogorov complexity characterization conveys the intuition that a random sequence is incompressible: no prefix can be produced by a program much shorter than the prefix. The null cover characterization conveys the intuition that a random real number should not have any property that is "uncommon". Each measure 0 set can be thought of as an uncommon property. It is not possible for a sequence to lie in no measure 0 sets, because each one-point set has measure 0. Martin-Löf's idea was to limit the definition to measure 0 sets that are effectively describable; the definition of an effective null cover determines a countable collection of effectively describable measure 0 sets and defines a sequence to be random if it does not lie in any of these particular measure 0 sets. Since the union of a countable collection of measure 0 sets has measure 0, this definition immediately leads to the theorem that there is a measure 1 set of random sequences. Note that if we identify the Cantor space of binary sequences with the interval [0,1] of real numbers, the measure on Cantor space agrees with Lebesgue measure. The martingale characterization conveys the intuition that no effective procedure should be able to make money betting against a random sequence. A martingale ''d'' is a betting strategy. ''d'' reads a finite string ''w'' and bets money on the next bit. It bets some fraction of its money that the next bit will be 0, and then remainder of its money that the next bit will be 1. ''d'' doubles the money it placed on the bit that actually occurred, and it loses the rest. ''d''(''w'') is the amount of money it has after seeing the string ''w''. Since the bet placed after seeing the string ''w'' can be calculated from the values ''d''(''w''), ''d''(''w''0), and ''d''(''w''1), calculating the amount of money it has is equivalent to calculating the bet. The martingale characterization says that no betting strategy implementable by any computer (even in the weak sense of constructive strategies, which are not necessarily

computable Computability is the ability to solve a problem in an effective manner. It is a key topic of the field of computability theory within mathematical logic and the theory of computation within computer science. The computability of a problem is close ...

) can make money betting on a random sequence.

Properties and examples of Martin-Löf random sequences

* Chaitin's halting probability Ω is an example of a random sequence. * RAND^c (the

complement A complement is something that completes something else. Complement may refer specifically to: The arts * Complement (music), an interval that, when added to another, spans an octave ** Aggregate complementation, the separation of pitch-clas ...

of RAND) is a measure 0 subset of the set of all infinite sequences. This is implied by the fact that each constructive null cover covers a measure 0 set, there are only

countably many In mathematics, a set is countable if either it is finite or it can be made in one to one correspondence with the set of natural numbers. Equivalently, a set is ''countable'' if there exists an injective function from it into the natural number ...

constructive null covers, and a countable union of measure 0 sets has measure 0. This implies that RAND is a measure 1 subset of the set of all infinite sequences. * Every random sequence is

normal Normal(s) or The Normal(s) may refer to: Film and television * ''Normal'' (2003 film), starring Jessica Lange and Tom Wilkinson * ''Normal'' (2007 film), starring Carrie-Anne Moss, Kevin Zegers, Callum Keith Rennie, and Andrew Airlie * ''Norma ...

. * There is a constructive null cover of RAND^c. This means that all effective tests for randomness (that is, constructive null covers) are, in a sense, subsumed by this ''universal'' test for randomness, since any sequence that passes this single test for randomness will pass all tests for randomness. (Martin-Löf 1966) * There is a ''universal'' constructive martingale d. This martingale is universal in the sense that, given any constructive martingale ''d'', if ''d'' succeeds on a sequence, then d succeeds on that sequence as well. Thus, d succeeds on every sequence in RAND^c (but, since d is constructive, it succeeds on no sequence in RAND). (Schnorr 1971) * The class RAND is a

\Sigma^0_2

subset of Cantor space, where

\Sigma^0_2

refers to the second level of the

arithmetical hierarchy In mathematical logic, the arithmetical hierarchy, arithmetic hierarchy or Kleene–Mostowski hierarchy (after mathematicians Stephen Cole Kleene and Andrzej Mostowski) classifies certain sets based on the complexity of formulas that define th ...

. This is because a sequence ''S'' is in RAND if and only if there is some open set in the universal effective null cover that does not contain ''S''; this property can be seen to be definable by a

\Sigma^0_2

formula. * There is a random sequence which is

\Delta^0_2

, that is, computable relative to an oracle for the Halting problem. (Schnorr 1971) Chaitin's Ω is an example of such a sequence. * No random sequence is decidable,

computably enumerable In computability theory, a set ''S'' of natural numbers is called computably enumerable (c.e.), recursively enumerable (r.e.), semidecidable, partially decidable, listable, provable or Turing-recognizable if: *There is an algorithm such that the ...

, or co-computably-enumerable. Since these correspond to the

\Delta_1^0

\Sigma_1^0

, and

\Pi_1^0

levels of the

, this means that

\Delta_2^0

is the lowest level in the arithmetical hierarchy where random sequences can be found. * Every sequence is Turing reducible to some random sequence. (Kučera 1985/1989, Gács 1986). Thus there are random sequences of arbitrarily high

Turing degree In computer science and mathematical logic the Turing degree (named after Alan Turing) or degree of unsolvability of a set of natural numbers measures the level of algorithmic unsolvability of the set. Overview The concept of Turing degree is fund ...

Relative randomness

As each of the equivalent definitions of a Martin-Löf random sequence is based on what is computable by some Turing machine, one can naturally ask what is computable by a Turing

. For a fixed oracle ''A'', a sequence ''B'' which is not only random but in fact, satisfies the equivalent definitions for computability relative to ''A'' (e.g., no martingale which is constructive relative to the oracle ''A'' succeeds on ''B'') is said to be random relative to ''A''. Two sequences, while themselves random, may contain very similar information, and therefore neither will be random relative to the other. Any time there is a

Turing reduction In computability theory, a Turing reduction from a decision problem A to a decision problem B is an oracle machine which decides problem A given an oracle for B (Rogers 1967, Soare 1987). It can be understood as an algorithm that could be used to s ...

from one sequence to another, the second sequence cannot be random relative to the first, just as computable sequences are themselves nonrandom; in particular, this means that Chaitin's Ω is not random relative to the

halting problem In computability theory, the halting problem is the problem of determining, from a description of an arbitrary computer program and an input, whether the program will finish running, or continue to run forever. Alan Turing proved in 1936 that a ...

. An important result relating to relative randomness is van Lambalgen's theorem, which states that if ''C'' is the sequence composed from ''A'' and ''B'' by interleaving the first bit of ''A'', the first bit of ''B'', the second bit of ''A'', the second bit of ''B'', and so on, then ''C'' is algorithmically random if and only if ''A'' is algorithmically random, and ''B'' is algorithmically random relative to ''A''. A closely related consequence is that if ''A'' and ''B'' are both random themselves, then ''A'' is random relative to ''B'' if and only if ''B'' is random relative to ''A''.

Stronger than Martin-Löf randomness

Relative randomness gives us the first notion which is stronger than Martin-Löf randomness, which is randomness relative to some fixed oracle ''A''. For any oracle, this is at least as strong, and for most oracles, it is strictly stronger, since there will be Martin-Löf random sequences which are not random relative to the oracle ''A''. Important oracles often considered are the halting problem,

\emptyset '

, and the ''n''th jump oracle,

\emptyset^

, as these oracles are able to answer specific questions which naturally arise. A sequence which is random relative to the oracle

\emptyset^

is called ''n''-random; a sequence is 1-random, therefore, if and only if it is Martin-Löf random. A sequence which is ''n''-random for every ''n'' is called arithmetically random. The ''n''-random sequences sometimes arise when considering more complicated properties. For example, there are only countably many

\Delta^0_2

sets, so one might think that these should be non-random. However, the halting probability Ω is

\Delta^0_2

and 1-random; it is only after 2-randomness is reached that it is impossible for a random set to be

\Delta^0_2

Weaker than Martin-Löf randomness

Additionally, there are several notions of randomness which are weaker than Martin-Löf randomness. Some of these are weak 1-randomness, Schnorr randomness, computable randomness, partial computable randomness. Yongge Wang showed Yongge Wang: Randomness and Complexity. PhD Thesis, 1996, http://webpages.uncc.edu/yonwang/papers/thesis.pdf that Schnorr randomness is different from computable randomness. Additionally, Kolmogorov–Loveland randomness is known to be no stronger than Martin-Löf randomness, but it is not known whether it is actually weaker. At the opposite end of the randomness spectrum there is the notion of a K-trivial set. These sets are anti-random in that all initial segment is logarithmically compressible (i.e.,

K(w) \leq K(, w, ) + b

for each initial segment w), but they are not computable.