Zeta Distribution
   HOME

TheInfoList



OR:

In
probability theory Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
and
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the zeta distribution is a discrete
probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
. If ''X'' is a zeta-distributed
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
with parameter ''s'', then the probability that ''X'' takes the positive integer value ''k'' is given by the
probability mass function In probability and statistics, a probability mass function (sometimes called ''probability function'' or ''frequency function'') is a function that gives the probability that a discrete random variable is exactly equal to some value. Sometimes i ...
:f_s(k) = \frac where ''ζ''(''s'') is the
Riemann zeta function The Riemann zeta function or Euler–Riemann zeta function, denoted by the Greek letter (zeta), is a mathematical function of a complex variable defined as \zeta(s) = \sum_^\infty \frac = \frac + \frac + \frac + \cdots for and its analytic c ...
(which is undefined for ''s'' = 1). The multiplicities of distinct
prime factor A prime number (or a prime) is a natural number greater than 1 that is not a product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime because the only ways ...
s of ''X'' are
independent Independent or Independents may refer to: Arts, entertainment, and media Artist groups * Independents (artist group), a group of modernist painters based in Pennsylvania, United States * Independentes (English: Independents), a Portuguese artist ...
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s. The
Riemann zeta function The Riemann zeta function or Euler–Riemann zeta function, denoted by the Greek letter (zeta), is a mathematical function of a complex variable defined as \zeta(s) = \sum_^\infty \frac = \frac + \frac + \frac + \cdots for and its analytic c ...
being the sum of all terms k^ for positive integer ''k'', it appears thus as the normalization of the
Zipf distribution Zipf's law (; ) is an empirical law stating that when a list of measured values is sorted in decreasing order, the value of the -th entry is often approximately inversely proportional to . The best known instance of Zipf's law applies to the ...
. The terms "Zipf distribution" and "zeta distribution" are often used interchangeably. But while the Zeta distribution is a
probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
by itself, it is not associated with
Zipf's law Zipf's law (; ) is an empirical law stating that when a list of measured values is sorted in decreasing order, the value of the -th entry is often approximately inversely proportional to . The best known instance of Zipf's law applies to the ...
with the same exponent.


Definition

The Zeta distribution is defined for positive integers k \geq 1, and its probability mass function is given by : P(x=k) = \frac 1 k^, where s>1 is the parameter, and \zeta(s) is the
Riemann zeta function The Riemann zeta function or Euler–Riemann zeta function, denoted by the Greek letter (zeta), is a mathematical function of a complex variable defined as \zeta(s) = \sum_^\infty \frac = \frac + \frac + \frac + \cdots for and its analytic c ...
. The cumulative distribution function is given by : P(x \leq k) = \frac, where H_ is the generalized
harmonic number In mathematics, the -th harmonic number is the sum of the reciprocals of the first natural numbers: H_n= 1+\frac+\frac+\cdots+\frac =\sum_^n \frac. Starting from , the sequence of harmonic numbers begins: 1, \frac, \frac, \frac, \frac, \dot ...
: H_ = \sum_^k \frac 1 .


Moments

The ''n''th raw moment is defined as the expected value of ''X''''n'': :m_n = E(X^n) = \frac\sum_^\infty \frac The series on the right is just a series representation of the Riemann zeta function, but it only converges for values of s-n that are greater than unity. Thus: :m_n = \begin \zeta(s-n)/\zeta(s) & \text n < s-1 \\ \infty & \text n \ge s-1 \end The ratio of the zeta functions is well-defined, even for ''n'' > ''s'' − 1 because the series representation of the zeta function can be analytically continued. This does not change the fact that the moments are specified by the series itself, and are therefore undefined for large ''n''.


Moment generating function

The
moment generating function In probability theory and statistics, the moment-generating function of a real-valued random variable is an alternative specification of its probability distribution. Thus, it provides the basis of an alternative route to analytical results compare ...
is defined as :M(t;s) = E(e^) = \frac \sum_^\infty \frac. The series is just the definition of the
polylogarithm In mathematics, the polylogarithm (also known as Jonquière's function, for Alfred Jonquière) is a special function of order and argument . Only for special values of does the polylogarithm reduce to an elementary function such as the natur ...
, valid for e^t<1 so that :M(t;s) = \frac\textt<0. Since this does not converge on an open interval containing t=0, the moment generating function does not exist.


The case ''s'' = 1

''ζ''(1) is infinite as the harmonic series, and so the case when ''s'' = 1 is not meaningful. However, if ''A'' is any set of positive integers that has a density, i.e. if :\lim_\frac exists where ''N''(''A'', ''n'') is the number of members of ''A'' less than or equal to ''n'', then :\lim_P(X\in A)\, is equal to that density. The latter limit can also exist in some cases in which ''A'' does not have a density. For example, if ''A'' is the set of all positive integers whose first digit is ''d'', then ''A'' has no density, but nonetheless, the second limit given above exists and is proportional to :\log(d+1) - \log(d) = \log\left(1+\frac\right),\, which is Benford's law.


Infinite divisibility

The Zeta distribution can be constructed with a sequence of independent random variables with a
geometric distribution In probability theory and statistics, the geometric distribution is either one of two discrete probability distributions: * The probability distribution of the number X of Bernoulli trials needed to get one success, supported on \mathbb = \; * T ...
. Let p be a
prime number A prime number (or a prime) is a natural number greater than 1 that is not a Product (mathematics), product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime ...
and X(p^) be a random variable with a geometric distribution of parameter p^, namely \quad\quad\quad \mathbb\left( X(p^) = k \right) = p^ (1 - p^ ) If the random variables ( X(p^) )_ are independent, then, the random variable Z_s defined by \quad\quad\quad Z_s = \prod_ p^ has the zeta distribution: \mathbb\left( Z_s = n \right) = \frac. Stated differently, the random variable \log(Z_s) = \sum_ X(p^) \, \log(p) is
infinitely divisible Infinite divisibility arises in different ways in philosophy, physics, economics, order theory (a branch of mathematics), and probability theory (also a branch of mathematics). One may speak of infinite divisibility, or the lack thereof, of matter ...
with
Lévy measure Levy, Lévy or Levies may refer to: People * Levy (surname), people with the surname Levy or Lévy * Levy Adcock (born 1988), American football player * Levy Barent Cohen (1747–1808), Dutch-born British financier and community worker * Lev ...
given by the following sum of Dirac masses: \quad\quad\quad \Pi_s(dx) = \sum_ \sum_ \frac \delta_(dx)


See also

Other "power-law" distributions *
Cauchy distribution The Cauchy distribution, named after Augustin-Louis Cauchy, is a continuous probability distribution. It is also known, especially among physicists, as the Lorentz distribution (after Hendrik Lorentz), Cauchy–Lorentz distribution, Lorentz(ian) ...
*
Lévy distribution In probability theory and statistics, the Lévy distribution, named after Paul Lévy, is a continuous probability distribution for a non-negative random variable. In spectroscopy, this distribution, with frequency as the dependent variable, is k ...
* Lévy skew alpha-stable distribution *
Pareto distribution The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto, is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actuarial scien ...
*
Zipf's law Zipf's law (; ) is an empirical law stating that when a list of measured values is sorted in decreasing order, the value of the -th entry is often approximately inversely proportional to . The best known instance of Zipf's law applies to the ...
* Zipf–Mandelbrot law *
Infinitely divisible distribution Infinity is something which is boundless, endless, or larger than any natural number. It is denoted by \infty, called the infinity symbol. From the time of the ancient Greeks, the philosophical nature of infinity has been the subject of m ...
*
Yule–Simon distribution In probability and statistics, the Yule–Simon distribution is a discrete probability distribution named after Udny Yule and Herbert A. Simon. Simon originally called it the ''Yule distribution''. The probability mass function (pmf) of the Yule ...


External links

* What Gut calls the "Riemann zeta distribution" is actually the probability distribution of −log ''X'', where ''X'' is a random variable with what this article calls the zeta distribution. * {{DEFAULTSORT:Zeta Distribution Discrete distributions Computational linguistics Probability distributions with non-finite variance