Log-logistic Distribution
   HOME

TheInfoList



OR:

In
probability Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
and
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the log-logistic distribution (known as the Fisk distribution in
economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...
) is a
continuous probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
for a non-negative
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
. It is used in
survival analysis Survival analysis is a branch of statistics for analyzing the expected duration of time until one event occurs, such as death in biological organisms and failure in mechanical systems. This topic is called reliability theory, reliability analysis ...
as a
parametric model In statistics, a parametric model or parametric family or finite-dimensional model is a particular class of statistical models. Specifically, a parametric model is a family of probability distributions that has a finite number of parameters. Defi ...
for events whose rate increases initially and decreases later, as, for example,
mortality rate Mortality rate, or death rate, is a measure of the number of deaths (in general, or due to a specific cause) in a particular Statistical population, population, scaled to the size of that population, per unit of time. Mortality rate is typically ...
from cancer following diagnosis or treatment. It has also been used in
hydrology Hydrology () is the scientific study of the movement, distribution, and management of water on Earth and other planets, including the water cycle, water resources, and drainage basin sustainability. A practitioner of hydrology is called a hydro ...
to model stream flow and
precipitation In meteorology, precipitation is any product of the condensation of atmospheric water vapor that falls from clouds due to gravitational pull. The main forms of precipitation include drizzle, rain, rain and snow mixed ("sleet" in Commonwe ...
, in
economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...
as a simple model of the
distribution of wealth The distribution of wealth is a comparison of the wealth of various members or groups in a society. It shows one aspect of economic inequality or heterogeneity in economics, economic heterogeneity. The distribution of wealth differs from the i ...
or
income Income is the consumption and saving opportunity gained by an entity within a specified timeframe, which is generally expressed in monetary terms. Income is difficult to define conceptually and the definition may be different across fields. F ...
, and in networking to model the transmission times of data considering both the network and the software. The log-logistic distribution is the probability distribution of a
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
whose
logarithm In mathematics, the logarithm of a number is the exponent by which another fixed value, the base, must be raised to produce that number. For example, the logarithm of to base is , because is to the rd power: . More generally, if , the ...
has a
logistic distribution In probability theory and statistics, the logistic distribution is a continuous probability distribution. Its cumulative distribution function is the logistic function, which appears in logistic regression and feedforward neural networks. It rese ...
. It is similar in shape to the
log-normal distribution In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normal distribution, normally distributed. Thus, if the random variable is log-normally distributed ...
but has heavier tails. Unlike the log-normal, its
cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...
can be written in closed form.


Characterization

There are several different parameterizations of the distribution in use. The one shown here gives reasonably interpretable parameters and a simple form for the
cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...
. The parameter \alpha>0 is a
scale parameter In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions. The larger the scale parameter, the more spread out the distribution. Definition If a family ...
and is also the
median The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...
of the distribution. The parameter \beta>0 is a
shape parameter In probability theory and statistics, a shape parameter (also known as form parameter) is a kind of numerical parameter of a parametric family of probability distributionsEveritt B.S. (2002) Cambridge Dictionary of Statistics. 2nd Edition. CUP. th ...
. The distribution is
unimodal In mathematics, unimodality means possessing a unique mode. More generally, unimodality means there is only a single highest value, somehow defined, of some mathematical object. Unimodal probability distribution In statistics, a unimodal p ...
when \beta>1 and its dispersion decreases as \beta increases. The
cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...
is :\begin F(x; \alpha, \beta) & = \\ pt & = \\ pt & = \end where x>0, \alpha>0, \beta>0. The
probability density function In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
is :f(x; \alpha, \beta) = \frac


Alternative parameterization

An alternative parametrization is given by the pair \mu, s in analogy with the logistic distribution: : \mu = \ln (\alpha) : s = 1 / \beta


Properties


Moments

The kth raw moment exists only when k<\beta, when it is given by :\begin \operatorname(X^k) & = \alpha^k\operatorname(1-k/\beta, 1+k/\beta) \\ pt& = \alpha^k\, \end where B is the
beta function In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral : \Beta(z_1,z_2) = \int_0^1 t^ ...
. Expressions for the
mean A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...
,
variance In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
,
skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimodal ...
and
kurtosis In probability theory and statistics, kurtosis (from , ''kyrtos'' or ''kurtos'', meaning "curved, arching") refers to the degree of “tailedness” in the probability distribution of a real-valued random variable. Similar to skewness, kurtos ...
can be derived from this. Writing b=\pi/\beta for convenience, the mean is : \operatorname(X) = \alpha b / \sin b , \quad \beta>1, and the variance is : \operatorname(X) = \alpha^2 \left( 2b / \sin 2b -b^2 / \sin^2 b \right), \quad \beta>2. Explicit expressions for the skewness and kurtosis are lengthy. As \beta tends to infinity the mean tends to \alpha, the variance and skewness tend to zero and the excess kurtosis tends to 6/5 (see also related distributions below).


Quantiles

The
quantile function In probability and statistics, the quantile function is a function Q: ,1\mapsto \mathbb which maps some probability x \in ,1/math> of a random variable v to the value of the variable y such that P(v\leq y) = x according to its probability distr ...
(inverse cumulative distribution function) is : :F^(p;\alpha, \beta) = \alpha\left( \frac \right)^. It follows that the
median The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...
is \alpha, the lower
quartile In statistics, quartiles are a type of quantiles which divide the number of data points into four parts, or ''quarters'', of more-or-less equal size. The data must be ordered from smallest to largest to compute quartiles; as such, quartiles are ...
is 3^ \alpha and the upper quartile is 3^ \alpha.


Applications


Survival analysis

The log-logistic distribution provides one
parametric model In statistics, a parametric model or parametric family or finite-dimensional model is a particular class of statistical models. Specifically, a parametric model is a family of probability distributions that has a finite number of parameters. Defi ...
for
survival analysis Survival analysis is a branch of statistics for analyzing the expected duration of time until one event occurs, such as death in biological organisms and failure in mechanical systems. This topic is called reliability theory, reliability analysis ...
. Unlike the more commonly used Weibull distribution, it can have a non-
monotonic In mathematics, a monotonic function (or monotone function) is a function between ordered sets that preserves or reverses the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of ord ...
hazard function A hazard is a potential source of harm. Substances, events, or circumstances can constitute hazards when their nature would potentially allow them to cause damage to health, life, property, or any other interest of value. The probability of that ...
: when \beta>1, the hazard function is
unimodal In mathematics, unimodality means possessing a unique mode. More generally, unimodality means there is only a single highest value, somehow defined, of some mathematical object. Unimodal probability distribution In statistics, a unimodal p ...
(when \beta ≤ 1, the hazard decreases monotonically). The fact that the cumulative distribution function can be written in closed form is particularly useful for analysis of survival data with censoring. The log-logistic distribution can be used as the basis of an accelerated failure time model by allowing \alpha to differ between groups, or more generally by introducing covariates that affect \alpha but not \beta by modelling \log(\alpha) as a linear function of the covariates. The
survival function The survival function is a function that gives the probability that a patient, device, or other object of interest will survive past a certain time. The survival function is also known as the survivor function or reliability function. The term ...
is :S(t) = 1 - F(t) = +(t/\alpha)^,\, and so the
hazard function A hazard is a potential source of harm. Substances, events, or circumstances can constitute hazards when their nature would potentially allow them to cause damage to health, life, property, or any other interest of value. The probability of that ...
is : h(t) = \frac = \frac . The log-logistic distribution with shape parameter \beta = 1 is the marginal distribution of the inter-times in a geometric-distributed
counting process A counting process is a stochastic process In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables in a probability space, where the index of the famil ...
.


Hydrology

The log-logistic distribution has been used in hydrology for modelling stream flow rates and precipitation. Extreme values like maximum one-day rainfall and river discharge per month or per year often follow a
log-normal distribution In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normal distribution, normally distributed. Thus, if the random variable is log-normally distributed ...
. The log-normal distribution, however, needs a numeric approximation. As the log-logistic distribution, which can be solved analytically, is similar to the log-normal distribution, it can be used instead. The blue picture illustrates an example of fitting the log-logistic distribution to ranked maximum one-day October rainfalls and it shows the 90% confidence belt based on the
binomial distribution In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...
. The rainfall data are represented by the plotting position ''r''/(''n''+1) as part of the cumulative frequency analysis.


Economics

The log-logistic has been used as a simple model of the
distribution of wealth The distribution of wealth is a comparison of the wealth of various members or groups in a society. It shows one aspect of economic inequality or heterogeneity in economics, economic heterogeneity. The distribution of wealth differs from the i ...
or
income Income is the consumption and saving opportunity gained by an entity within a specified timeframe, which is generally expressed in monetary terms. Income is difficult to define conceptually and the definition may be different across fields. F ...
in
economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...
, where it is known as the Fisk distribution. Its
Gini coefficient In economics, the Gini coefficient ( ), also known as the Gini index or Gini ratio, is a measure of statistical dispersion intended to represent the income distribution, income inequality, the wealth distribution, wealth inequality, or the ...
is 1/\beta. The Gini coefficient for a continuous probability distribution takes the form: :G = \int_^F(1-F)dx where F is the CDF of the distribution and \mu is the expected value. For the log-logistic distribution, the formula for the Gini coefficient becomes: :G = \int_^ Defining the substitution z = x/\alpha leads to the simpler equation: :G = \int_^ And making the substitution u = 1/(1 + z^) further simplifies the Gini coefficient formula to: :G = \int_^u^(1-u)^du The integral component is equivalent to the standard
beta function In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral : \Beta(z_1,z_2) = \int_0^1 t^ ...
\text(1-1/\beta,1+1/\beta). The beta function may also be written as: :\text(x,y) = where \Gamma(\cdot) is the
gamma function In mathematics, the gamma function (represented by Γ, capital Greek alphabet, Greek letter gamma) is the most common extension of the factorial function to complex numbers. Derived by Daniel Bernoulli, the gamma function \Gamma(z) is defined ...
. Using the properties of the gamma function, it can be shown that: :\text(1-1/\beta,1+1/\beta) = \Gamma(1-1/\beta)\Gamma(1/\beta) From
Euler's reflection formula In mathematics, a reflection formula or reflection relation for a function is a relationship between and . It is a special case of a functional equation. It is common in mathematical literature to use the term "functional equation" for what a ...
, the expression can be simplified further: :\text(1-1/\beta,1+1/\beta) = Finally, we may conclude that the Gini coefficient for the log-logistic distribution G = 1/\beta.


Networking

The log-logistic has been used as a model for the period of time beginning when some data leaves a software user application in a computer and the response is received by the same application after travelling through and being processed by other computers, applications, and network segments, most or all of them without hard
real-time Real-time, realtime, or real time may refer to: Computing * Real-time computing, hardware and software systems subject to a specified time constraint * Real-time clock, a computer clock that keeps track of the current time * Real-time Control Syst ...
guarantees (for example, when an application is displaying data coming from a remote
sensor A sensor is often defined as a device that receives and responds to a signal or stimulus. The stimulus is the quantity, property, or condition that is sensed and converted into electrical signal. In the broadest definition, a sensor is a devi ...
connected to the Internet). It has been shown to be a more accurate probabilistic model for that than the
log-normal distribution In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normal distribution, normally distributed. Thus, if the random variable is log-normally distributed ...
or others, as long as abrupt changes of regime in the sequences of those times are properly detected.


Related distributions

* If X \sim \operatorname(\alpha,\beta) then kX \sim \operatorname(k \alpha, \beta). * If X \sim \operatorname(\alpha, \beta) then X^k \sim \operatorname(\alpha^k, \beta/, k, ). * \operatorname(\alpha,\beta) \sim \textrm(1,\alpha,\beta) ( Dagum distribution). * \operatorname(\alpha,\beta) \sim \textrm(1,\alpha,\beta) ( Singh–Maddala distribution). * \textrm(\gamma,\sigma) \sim \beta'(1,1,\gamma,\sigma) (
Beta prime distribution In probability theory and statistics, the beta prime distribution (also known as inverted beta distribution or beta distribution of the second kindJohnson et al (1995), p 248) is an absolutely continuous probability distribution. If p\in ,1/math ...
). *If ''X'' has a log-logistic distribution with scale parameter \alpha and shape parameter \beta then ''Y'' = log(''X'') has a
logistic distribution In probability theory and statistics, the logistic distribution is a continuous probability distribution. Its cumulative distribution function is the logistic function, which appears in logistic regression and feedforward neural networks. It rese ...
with location parameter \log(\alpha) and scale parameter 1/\beta. *As the shape parameter \beta of the log-logistic distribution increases, its shape increasingly resembles that of a (very narrow)
logistic distribution In probability theory and statistics, the logistic distribution is a continuous probability distribution. Its cumulative distribution function is the logistic function, which appears in logistic regression and feedforward neural networks. It rese ...
. Informally: ::\operatorname(\alpha, \beta) \to L(\alpha,\alpha/\beta) \quad \text \quad \beta \to \infty. *The log-logistic distribution with shape parameter \beta=1 and scale parameter \alpha is the same as the generalized Pareto distribution with location parameter \mu=0, shape parameter \xi=1 and scale parameter \alpha: ::\operatorname(\alpha,1) = \operatorname(1,\alpha,1). *The addition of another parameter (a shift parameter) formally results in a shifted log-logistic distribution, but this is usually considered in a different parameterization so that the distribution can be bounded above or bounded below.


Generalizations

Several different distributions are sometimes referred to as the generalized log-logistic distribution, as they contain the log-logistic as a special case. These include the Burr Type XII distribution (also known as the ''Singh–Maddala distribution'') and the Dagum distribution, both of which include a second shape parameter. Both are in turn special cases of the even more general ''generalized beta distribution of the second kind''. Another more straightforward generalization of the log-logistic is the shifted log-logistic distribution. Another generalized log-logistic distribution is the log-transform of the metalog distribution, in which power series expansions in terms of p are substituted for
logistic distribution In probability theory and statistics, the logistic distribution is a continuous probability distribution. Its cumulative distribution function is the logistic function, which appears in logistic regression and feedforward neural networks. It rese ...
parameters \mu and \sigma. The resulting log-metalog distribution is highly shape flexible, has simple closed form
PDF Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
and
quantile function In probability and statistics, the quantile function is a function Q: ,1\mapsto \mathbb which maps some probability x \in ,1/math> of a random variable v to the value of the variable y such that P(v\leq y) = x according to its probability distr ...
, can be fit to data with linear least squares, and subsumes the log-logistic distribution is special case.


See also

* Probability distributions: List of important distributions supported on semi-infinite intervals


References

{{DEFAULTSORT:Log-Logistic Distribution Continuous distributions Survival analysis Probability distributions with non-finite variance Economic inequality