Linear_congruential_generator_visualisation

A linear congruential generator (LCG) is an

algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...

that yields a sequence of pseudo-randomized numbers calculated with a discontinuous piecewise linear equation. The method represents one of the oldest and best-known

pseudorandom number generator A pseudorandom number generator (PRNG), also known as a deterministic random bit generator (DRBG), is an algorithm for generating a sequence of numbers whose properties approximate the properties of sequences of random number generation, random n ...

algorithms. The theory behind them is relatively easy to understand, and they are easily implemented and fast, especially on computer hardware which can provide

modular arithmetic In mathematics, modular arithmetic is a system of arithmetic operations for integers, other than the usual ones from elementary arithmetic, where numbers "wrap around" when reaching a certain value, called the modulus. The modern approach to mo ...

by storage-bit truncation. The generator is defined by the

recurrence relation In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a parameter ...

: :

X_ = \left( a X_n + c \right)\bmod m

where

X

is the

sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...

of pseudo-random values, and :

m,\, 0 — the " modulus "
: a,\,0 < a < m — the "multiplier"
: c,\,0 \le c < m — the "increment"
: X_0,\,0 \le X_0 < m — the "seed" or "start value"

are

integer An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...

constants that specify the generator. If ''c'' = 0, the generator is often called a multiplicative congruential generator (MCG), or Lehmer RNG. If ''c'' ≠ 0, the method is called a mixed congruential generator. When ''c'' ≠ 0, a mathematician would call the recurrence an

affine transformation In Euclidean geometry, an affine transformation or affinity (from the Latin, '' affinis'', "connected with") is a geometric transformation that preserves lines and parallelism, but not necessarily Euclidean distances and angles. More general ...

, not a

linear In mathematics, the term ''linear'' is used in two distinct senses for two different properties: * linearity of a '' function'' (or '' mapping''); * linearity of a '' polynomial''. An example of a linear function is the function defined by f(x) ...

one, but the misnomer is well-established in computer science. Associated software and data at https://github.com/vigna/CPRNG.

History

The Lehmer generator was published in 1951 and the Linear congruential generator was published in 1958 by W. E. Thomson and A. Rotenberg.

Period length

A benefit of LCGs is that an appropriate choice of parameters results in a period which is both known and long. Although not the only criterion, too short a period is a fatal flaw in a pseudorandom number generator. While LCGs are capable of producing pseudorandom numbers which can pass formal tests for randomness, the quality of the output is extremely sensitive to the choice of the parameters ''m'' and ''a''. For example, ''a'' = 1 and ''c'' = 1 produces a simple modulo-''m'' counter, which has a long period, but is obviously non-random. Other values of ''c''

coprime In number theory, two integers and are coprime, relatively prime or mutually prime if the only positive integer that is a divisor of both of them is 1. Consequently, any prime number that divides does not divide , and vice versa. This is equiv ...

to ''m'' produce a Weyl sequence, which is better distributed but still obviously non-random. Historically, poor choices for ''a'' have led to ineffective implementations of LCGs. A particularly illustrative example of this is RANDU, which was widely used in the early 1970s and led to many results which are currently being questioned because of the use of this poor LCG. There are three common families of parameter choice:

''m'' prime, ''c'' = 0

This is the original Lehmer RNG construction. The period is ''m''−1 if the multiplier ''a'' is chosen to be a primitive element of the integers modulo ''m''. The initial state must be chosen between 1 and ''m''−1. One disadvantage of a prime modulus is that the modular reduction requires a double-width product and an explicit reduction step. Often a prime just less than a power of 2 is used (the

Mersenne prime In mathematics, a Mersenne prime is a prime number that is one less than a power of two. That is, it is a prime number of the form for some integer . They are named after Marin Mersenne, a French Minim friar, who studied them in the early 1 ...

s 2³¹−1 and 2⁶¹−1 are popular), so that the reduction modulo ''m'' = 2^''e'' − ''d'' can be computed as (''ax'' mod 2^''e'') + ''d'' . This must be followed by a conditional subtraction of ''m'' if the result is too large, but the number of subtractions is limited to ''ad''/''m'', which can be easily limited to one if ''d'' is small. If a double-width product is unavailable, and the multiplier is chosen carefully, Schrage's method may be used. To do this, factor ''m'' = ''qa''+''r'', i.e. and . Then compute ''ax'' mod ''m'' = . Since ''x'' mod ''q'' < ''q'' ≤ ''m''/''a'', the first term is strictly less than ''am''/''a'' = ''m''. If ''a'' is chosen so that ''r'' ≤ ''q'' (and thus ''r''/''q'' ≤ 1), then the second term is also less than ''m'': ''r'' ≤ ''rx''/''q'' = ''x''(''r''/''q'') ≤ ''x'' < ''m''. Thus, both products can be computed with a single-width product, and the difference between them lies in the range −''m'', ''m''−1 so can be reduced to , ''m''−1with a single conditional add. The most expensive operation in Schrage's method is the division (with remainder) of ''x'' by ''q''; fast algorithms for division by a constant are not available since they also rely on double-width products. A second disadvantage of a prime modulus is that it is awkward to convert the value 1 ≤ ''x'' < ''m'' to uniform random bits. If a prime just less than a power of 2 is used, sometimes the missing values are simply ignored.

''m'' a power of 2, ''c'' = 0

Choosing ''m'' to be a

power of two A power of two is a number of the form where is an integer, that is, the result of exponentiation with number 2, two as the Base (exponentiation), base and integer as the exponent. In the fast-growing hierarchy, is exactly equal to f_1^ ...

, most often ''m'' = 2³² or ''m'' = 2⁶⁴, produces a particularly efficient LCG, because this allows the modulus operation to be computed by simply truncating the binary representation. In fact, the most significant bits are usually not computed at all. There are, however, disadvantages. This form has maximal period ''m''/4, achieved if ''a'' ≡ ±3 (mod 8) and the initial state ''X''₀ is odd. Even in this best case, the low three bits of ''X'' alternate between two values and thus only contribute one bit to the state. ''X'' is always odd (the lowest-order bit never changes), and only one of the next two bits ever changes. If ''a'' ≡ +3, ''X'' alternates ±1↔±3, while if ''a'' ≡ −3, ''X'' alternates ±1↔∓3 (all modulo 8). It can be shown that this form is equivalent to a generator with modulus ''m''/4 and ''c'' ≠ 0. A more serious issue with the use of a power-of-two modulus is that the low bits have a shorter period than the high bits. Its simplicity of implementation comes from the fact that bits are never affected by higher-order bits, so the low ''b'' bits of such a generator form a modulo-2^''b'' LCG by themselves, repeating with a period of 2^''b''−2. Only the most significant bit of ''X'' achieves the full period.

''m'' a power of 2, ''c'' ≠ 0

When ''c'' ≠ 0, correctly chosen parameters allow a period equal to ''m'', for all seed values. This will occur

if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...

: #

m

and

c

are coprime, #

a - 1

is divisible by all

prime factor A prime number (or a prime) is a natural number greater than 1 that is not a product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime because the only ways ...

s of

m

, #

a - 1

is divisible by 4 if

m

is divisible by 4. These three requirements are referred to as the Hull–Dobell Theorem. This form may be used with any ''m'', but only works well for ''m'' with many repeated prime factors, such as a power of 2; using a computer's word size is the most common choice. If ''m'' were a square-free integer, this would only allow ''a'' ≡ 1 (mod ''m''), which makes a very poor PRNG; a selection of possible full-period multipliers is only available when ''m'' has repeated prime factors. Although the Hull–Dobell theorem provides maximum period, it is not sufficient to guarantee a ''good'' generator. For example, it is desirable for ''a'' − 1 to not be any more divisible by prime factors of ''m'' than necessary. If ''m'' is a power of 2, then ''a'' − 1 should be divisible by 4 but not divisible by 8, i.e. ''a'' ≡ 5 (mod 8). Indeed, most multipliers produce a sequence which fails one test for non-randomness or another, and finding a multiplier which is satisfactory to all applicable criteria is quite challenging. The spectral test is one of the most important tests. Note that a power-of-2 modulus shares the problem as described above for ''c'' = 0: the low ''k'' bits form a generator with modulus 2^''k'' and thus repeat with a period of 2^''k''; only the most significant bit achieves the full period. If a pseudorandom number less than ''r'' is desired, is a much higher-quality result than ''X'' mod ''r''. Unfortunately, most programming languages make the latter much easier to write (X % r), so it is very commonly used. The generator is ''not'' sensitive to the choice of ''c'', as long as it is relatively prime to the modulus (e.g. if ''m'' is a power of 2, then ''c'' must be odd), so the value ''c''=1 is commonly chosen. The sequence produced by other choices of ''c'' can be written as a simple function of the sequence when ''c''=1. Specifically, if ''Y'' is the prototypical sequence defined by ''Y''₀ = 0 and ''Y''_''n''+1 = ''aY_n'' + 1 mod m, then a general sequence ''X''_''n''+1 = ''aX_n'' + ''c'' mod ''m'' can be written as an affine function of ''Y'': :

X_n = (X_0(a-1)+c)Y_n + X_0 = (X_1 - X_0)Y_n + X_0 \pmod m.

More generally, any two sequences ''X'' and ''Z'' with the same multiplier and modulus are related by :

= Y_n =  =  \pmod m.

In the common case where ''m'' is a power of 2 and ''a'' ≡ 5 (mod 8) (a desirable property for other reasons), it is always possible to find an initial value ''X''₀ so that the denominator ''X''₁ − ''X''₀ ≡ ±1 (mod ''m''), producing an even simpler relationship. With this choice of ''X''₀, ''X_n'' = ''X''₀ ± ''Y_n'' will remain true for all ''n''. The sign is determined by ''c'' ≡ ±1 (mod 4), and the constant ''X''₀ is determined by 1 ∓ ''c'' ≡ (1 − ''a'')''X''₀ (mod ''m''). As a simple example, consider the generators ''X''_''n''+1 = 157''X_n'' + 3 mod 256 and ''Y''_''n''+1 = 157''Y_n'' + 1 mod 256; i.e. ''m'' = 256, ''a'' = 157, and ''c'' = 3. Because 3 ≡ −1 (mod 4), we are searching for a solution to 1 + 3 ≡ (1 − 157)''X''₀ (mod 256). This is satisfied by ''X''₀ ≡ 41 (mod 64), so if we start with that, then ''X_n'' ≡ ''X''₀ − ''Y_n'' (mod 256) for all ''n''. For example, using ''X''₀ = 233 = 3×64 + 41: * ''X'' = 233, 232, 75, 2, 61, 108, ... * ''Y'' = 0, 1, 158, 231, 172, 125, ... * ''X'' + ''Y'' mod 256 = 233, 233, 233, 233, 233, 233, ...

Parameters in common use

The following table lists the parameters of LCGs in common use, including built-in ''rand()'' functions in runtime libraries of various

compiler In computing, a compiler is a computer program that Translator (computing), translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primaril ...

s. This table is to show popularity, not examples to emulate; ''many of these parameters are poor.'' Tables of good parameters are available. As shown above, LCGs do not always use all of the bits in the values they produce. In general, they return the most significant bits. For example, the

Java Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...

implementation operates with 48-bit values at each iteration but returns only their 32 most significant bits. This is because the higher-order bits have longer periods than the lower-order bits (see below). LCGs that use this truncation technique produce statistically better values than those that do not. This is especially noticeable in scripts that use the mod operation to reduce range; modifying the random number mod 2 will lead to alternating 0 and 1 without truncation. Contrarily, some libraries use an implicit power-of-two modulus but never output or otherwise use the most significant bit, in order to limit the output to positive

two's complement Two's complement is the most common method of representing signed (positive, negative, and zero) integers on computers, and more generally, fixed point binary values. Two's complement uses the binary digit with the ''greatest'' value as the ''s ...

integers. The output is as if the modulus were one bit less than the internal word size, and such generators are described as such in the table above.

Advantages and disadvantages

LCGs are fast and require minimal memory (one modulo-''m'' number, often 32 or 64 bits) to retain state. This makes them valuable for simulating multiple independent streams. LCGs are not intended, and must not be used, for cryptographic applications; use a cryptographically secure pseudorandom number generator for such applications. Although LCGs have a few specific weaknesses, many of their flaws come from having too small a state. The fact that people have been lulled for so many years into using them with such small moduli can be seen as a testament to the strength of the technique. A LCG with large enough state can pass even stringent statistical tests; a modulo-2⁶⁴ LCG which returns the high 32 bits passes TestU01's SmallCrush suite, and a 96-bit LCG passes the most stringent BigCrush suite. For a specific example, an ideal random number generator with 32 bits of output is expected (by the Birthday theorem) to begin duplicating earlier outputs after results. ''Any'' PRNG whose output is its full, untruncated state will not produce duplicates until its full period elapses, an easily detectable statistical flaw. For related reasons, any PRNG should have a period longer than the square of the number of outputs required. Given modern computer speeds, this means a period of 2⁶⁴ for all but the least demanding applications, and longer for demanding simulations. One flaw specific to LCGs is that, if used to choose points in an n-dimensional space, the points will lie on, at most, hyperplanes ( Marsaglia's theorem, developed by George Marsaglia). This is due to serial correlation between successive values of the sequence ''X_n''. Carelessly chosen multipliers will usually have far fewer, widely spaced planes, which can lead to problems. The spectral test, which is a simple test of an LCG's quality, measures this spacing and allows a good multiplier to be chosen. The plane spacing depends both on the modulus and the multiplier. A large enough modulus can reduce this distance below the resolution of double precision numbers. The choice of the multiplier becomes less important when the modulus is large. It is still necessary to calculate the spectral index and make sure that the multiplier is not a bad one, but purely probabilistically it becomes extremely unlikely to encounter a bad multiplier when the modulus is larger than about 2⁶⁴. Another flaw specific to LCGs is the short period of the low-order bits when ''m'' is chosen to be a power of 2. This can be mitigated by using a modulus larger than the required output, and using the most significant bits of the state. Nevertheless, for some applications LCGs may be a good option. For instance, in an embedded system, the amount of memory available is often severely limited. Similarly, in an environment such as a

video game console A video game console is an electronic device that Input/output, outputs a video signal or image to display a video game that can typically be played with a game controller. These may be home video game console, home consoles, which are generally ...

taking a small number of high-order bits of an LCG may well suffice. (The low-order bits of LCGs when m is a power of 2 should never be relied on for any degree of randomness whatsoever.) The low order bits go through very short cycles. In particular, any full-cycle LCG, when m is a power of 2, will produce alternately odd and even results. LCGs should be evaluated very carefully for suitability in non-cryptographic applications where high-quality

randomness In common usage, randomness is the apparent or actual lack of definite pattern or predictability in information. A random sequence of events, symbols or steps often has no order and does not follow an intelligible pattern or combination. ...

is critical. For Monte Carlo simulations, an LCG must use a modulus greater and preferably much greater than the cube of the number of random samples which are required. This means, for example, that a (good) 32-bit LCG can be used to obtain about a thousand random numbers; a 64-bit LCG is good for about 2²¹ random samples (a little over two million), etc. For this reason, in practice LCGs are not suitable for large-scale Monte Carlo simulations.

Sample code

Python code

The following is an implementation of an LCG in Python, in the form of a generator: from collections.abc import Generator def lcg(modulus: int, a: int, c: int, seed: int) -> Generator nt, None, None """Linear congruential generator.""" while True: seed = (a * seed + c) % modulus yield seed

Haskell code

The following is an implementation of an LCG in

Haskell Haskell () is a general-purpose, statically typed, purely functional programming language with type inference and lazy evaluation. Designed for teaching, research, and industrial applications, Haskell pioneered several programming language ...

utilizing a

lazy evaluation In programming language theory, lazy evaluation, or call-by-need, is an evaluation strategy which delays the evaluation of an Expression (computer science), expression until its value is needed (non-strict evaluation) and which avoids repeated eva ...

strategy to generate an infinite stream of output values in a list: -- Allowing a generic choice for a, c, m and x_0 linearCongruentialGenerator :: Integer -> Integer -> Integer -> Integer -> ntegerlinearCongruentialGenerator a c modulus seed = lcgacmx0 where lcgacmx0 = seed : map (\x -> (a*x + c) `mod` modulus) lcgacmx0 -- Specific parameters can be easily specified (eg. Knuth's MMIX parameters): mmixLCG :: Integer -> ntegermmixLCG = linearCongruentialGenerator 6364136223846793005 1442695040888963407 (2^(64 ::Integer))

Free Pascal

Free Pascal uses a Mersenne Twister as its default pseudo random number generator whereas Delphi uses a LCG. Here is a Delphi compatible example in Free Pascal based on the information in the table above. Given the same RandSeed value it generates the same sequence of random numbers as Delphi. unit lcg_random; interface function LCGRandom: extended; overload; inline; function LCGRandom(const range:longint): longint; overload; inline; implementation function IM: cardinal; inline; begin RandSeed := RandSeed * 134775813 + 1; Result := RandSeed; end; function LCGRandom: extended; overload; inline; begin Result := IM * 2.32830643653870e-10; end; function LCGRandom(const range: longint): longint; overload; inline; begin Result := IM * range shr 32; end; Like all pseudorandom number generators, a LCG needs to store state and alter it each time it generates a new number. Multiple threads may access this state simultaneously causing a race condition. Implementations should use different state each with unique initialization for different threads to avoid equal sequences of random numbers on simultaneously executing threads.

LCG derivatives

There are several generators which are linear congruential generators in a different form, and thus the techniques used to analyze LCGs can be applied to them. One method of producing a longer period is to sum the outputs of several LCGs of different periods having a large least common multiple; the

Wichmann–Hill Wichmann–Hill is a pseudorandom number generator proposed in 1982 by Brian Wichmann and David Hill. It consists of three linear congruential generators with different prime moduli, each of which is used to produce a uniformly distributed numb ...

generator is an example of this form. (We would prefer them to be completely

, but a prime modulus implies an even period, so there must be a common factor of 2, at least.) This can be shown to be equivalent to a single LCG with a modulus equal to the product of the component LCG moduli. Marsaglia's add-with-carry and subtract-with-borrow PRNGs with a word size of ''b''=2^''w'' and lags ''r'' and ''s'' (''r'' > ''s'') are equivalent to LCGs with a modulus of ''b^r'' ± ''b^s'' ± 1. Multiply-with-carry PRNGs with a multiplier of ''a'' are equivalent to LCGs with a large prime modulus of ''ab^r''−1 and a power-of-2 multiplier ''b''. A permuted congruential generator begins with a power-of-2-modulus LCG and applies an output transformation to eliminate the short period problem in the low-order bits.

Comparison with other PRNGs

The other widely used primitive for obtaining long-period pseudorandom sequences is the linear-feedback shift register construction, which is based on arithmetic in GF(2) 'x'' the

polynomial ring In mathematics, especially in the field of algebra, a polynomial ring or polynomial algebra is a ring formed from the set of polynomials in one or more indeterminates (traditionally also called variables) with coefficients in another ring, ...

over

GF(2) (also denoted \mathbb F_2, or \mathbb Z/2\mathbb Z) is the finite field with two elements. is the Field (mathematics), field with the smallest possible number of elements, and is unique if the additive identity and the multiplicative identity ...

. Rather than integer addition and multiplication, the basic operations are exclusive-or and carry-less multiplication, which is usually implemented as a sequence of logical shifts. These have the advantage that all of their bits are full-period; they do not suffer from the weakness in the low-order bits that plagues arithmetic modulo 2^''k''. Examples of this family include xorshift generators and the Mersenne twister. The latter provides a very long period (2¹⁹⁹³⁷−1) and variate uniformity, but it fails some statistical tests.

Lagged Fibonacci generator A Lagged Fibonacci generator (LFG or sometimes LFib) is an example of a pseudorandom number generator. This class of random number generator is aimed at being an improvement on the 'standard' linear congruential generator. These are based on a gener ...

s also fall into this category; although they use arithmetic addition, their period is ensured by an LFSR among the least-significant bits. It is easy to detect the structure of a linear-feedback shift register with appropriate tests such as the linear complexity test implemented in the TestU01 suite; a Boolean circulant matrix initialized from consecutive bits of an LFSR will never have rank greater than the degree of the polynomial. Adding a non-linear output mixing function (as in the xoshiro256** and permuted congruential generator constructions) can greatly improve the performance on statistical tests. Another structure for a PRNG is a very simple recurrence function combined with a powerful output mixing function. This includes counter mode block ciphers and non-cryptographic generators such a
SplitMix64
A structure similar to LCGs, but ''not'' equivalent, is the multiple-recursive generator: ''X_n'' = (''a''₁''X''_''n''−1 + ''a''₂''X''_''n''−2 + ··· + ''a_kX''_{''n''−''k''}) mod ''m'' for ''k'' ≥ 2. With a prime modulus, this can generate periods up to ''m^k''−1, so is a useful extension of the LCG structure to larger periods. A powerful technique for generating high-quality pseudorandom numbers is to combine two or more PRNGs of different structure; the sum of an LFSR and an LCG (as in the

KISS A kiss is the touching or pressing of one's lips against another person, animal or object. Cultural connotations of kissing vary widely; depending on the culture and context, a kiss can express sentiments of love, passion, romance, sex ...

or xorwow constructions) can do very well at some cost in speed.

Notes

References

* *Gentle, James E., (2003). ''Random Number Generation and Monte Carlo Methods'', 2nd edition, Springer, . * (in this paper, efficient algorithms are given for inferring sequences produced by certain pseudo-random number generators).

External links

* The simulatio
Linear Congruential Generator
visualizes the correlations between the pseudo-random numbers when manipulating the parameters.

* ttps://web.archive.org/web/20090108194540/http://www.math.niu.edu/~rusin/known-math/99/LCG Linear Congruential Generators post to sci.math
The "Death of Art" computer art project at Goldstein Technologies LLC, uses an LCG to generate 33,554,432 images
* P. L'Ecuyer and R. Simard
"TestU01: A C Library for Empirical Testing of Random Number Generators"
May 2006, revised November 2006, ''ACM Transactions on Mathematical Software'', 33, 4, Article 22, August 2007.
Article about another way of cracking LCG
{{DEFAULTSORT:Linear Congruential Generator Pseudorandom number generators Modular arithmetic Articles with example Python (programming language) code