In
probability theory, a log-normal (or lognormal) distribution is a continuous
probability distribution
In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon i ...
of a
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
whose
logarithm is
normally distributed. Thus, if the random variable is log-normally distributed, then has a normal distribution.
Equivalently, if has a normal distribution, then the
exponential function of , , has a log-normal distribution. A random variable which is log-normally distributed takes only positive real values. It is a convenient and useful model for measurements in exact and
engineering sciences, as well as
medicine,
economics and other topics (e.g., energies, concentrations, lengths, prices of financial instruments, and other metrics).
The distribution is occasionally referred to as the Galton distribution or Galton's distribution, after
Francis Galton
Sir Francis Galton, FRS FRAI (; 16 February 1822 – 17 January 1911), was an English Victorian era polymath: a statistician, sociologist, psychologist, anthropologist, tropical explorer, geographer, inventor, meteorologist, proto- ...
.
[ The log-normal distribution has also been associated with other names, such as McAlister, Gibrat and Cobb–Douglas.][
A log-normal process is the statistical realization of the multiplicative product of many independent ]random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
s, each of which is positive. This is justified by considering the central limit theorem in the log domain (sometimes called Gibrat's law). The log-normal distribution is the maximum entropy probability distribution for a random variate —for which the mean and variance of are specified.
Definitions
Generation and parameters
Let be a standard normal variable, and let and be two real numbers. Then, the distribution of the random variable
:
is called the log-normal distribution with parameters and . These are the expected value
In probability theory, the expected value (also called expectation, expectancy, mathematical expectation, mean, average, or first moment) is a generalization of the weighted average. Informally, the expected value is the arithmetic mean of a l ...
(or mean) and standard deviation
In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...
of the variable's natural logarithm, not the expectation and standard deviation of itself.
This relationship is true regardless of the base of the logarithmic or exponential function: if is normally distributed, then so is for any two positive numbers . Likewise, if is log-normally distributed, then so is , where .
In order to produce a distribution with desired mean and variance , one uses
and
Alternatively, the "multiplicative" or "geometric" parameters and can be used. They have a more direct interpretation: is the median of the distribution, and is useful for determining "scatter" intervals, see below.
Probability density function
A positive random variable ''X'' is log-normally distributed (i.e., ), if the natural logarithm of ''X'' is normally distributed with mean and variance :
:
Let and be respectively the cumulative probability distribution function and the probability density function of the ''N''(0,1) distribution, then we have that
:
Cumulative distribution function
The cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ev ...
is
:
where is the cumulative distribution function of the standard normal distribution (i.e., ''N''(0,1)).
This may also be expressed as follows:
:
where erfc is the complementary error function.
Multivariate log-normal
If is a multivariate normal distribution, then has a multivariate log-normal distribution. The exponential is applied elementwise to the random vector . The mean of is
:
and its covariance matrix
In probability theory and statistics, a covariance matrix (also known as auto-covariance matrix, dispersion matrix, variance matrix, or variance–covariance matrix) is a square matrix giving the covariance between each pair of elements of ...
is
:
Since the multivariate log-normal distribution is not widely used, the rest of this entry only deals with the univariate distribution.
Characteristic function and moment generating function
All moments of the log-normal distribution exist and
:
This can be derived by letting within the integral. However, the log-normal distribution is not determined by its moments. This implies that it cannot have a defined moment generating function in a neighborhood of zero. Indeed, the expected value