In
probability
Probability is the branch of mathematics concerning numerical descriptions of how likely an Event (probability theory), event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and ...
and
statistics
Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, the log-logistic distribution (known as the Fisk distribution in
economics
Economics () is the social science that studies the production, distribution, and consumption of goods and services.
Economics focuses on the behaviour and interactions of economic agents and how economies work. Microeconomics analy ...
) is a
continuous probability distribution for a non-negative
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
. It is used in
survival analysis as a
parametric model
In statistics, a parametric model or parametric family or finite-dimensional model is a particular class of statistical models. Specifically, a parametric model is a family of probability distributions that has a finite number of parameters.
Def ...
for events whose rate increases initially and decreases later, as, for example,
mortality rate from cancer following diagnosis or treatment. It has also been used in
hydrology to model stream flow and
precipitation, in
economics
Economics () is the social science that studies the production, distribution, and consumption of goods and services.
Economics focuses on the behaviour and interactions of economic agents and how economies work. Microeconomics analy ...
as a simple model of the
distribution of wealth or
income, and in
networking
Network, networking and networked may refer to:
Science and technology
* Network theory, the study of graphs as a representation of relations between discrete objects
* Network science, an academic field that studies complex networks
Mathematics
...
to model the transmission times of data considering both the network and the software.
The log-logistic distribution is the probability distribution of a
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
whose
logarithm has a
logistic distribution.
It is similar in shape to the
log-normal distribution but has
heavier tails. Unlike the log-normal, its
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ev ...
can be written in
closed form.
Characterization
There are several different parameterizations of the distribution in use. The one shown here gives reasonably interpretable parameters and a simple form for the
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ev ...
.
[
][
]
The parameter
is a
scale parameter and is also the
median
In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...
of the distribution. The parameter
is a
shape parameter. The distribution is
unimodal when
and its
dispersion decreases as
increases.
The
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ev ...
is
:
where
,
,
The
probability density function is
:
Alternative parameterization
An alternative parametrization is given by the pair
in analogy with the logistic distribution:
:
:
Properties
Moments
The
th raw
moment
Moment or Moments may refer to:
* Present time
Music
* The Moments, American R&B vocal group Albums
* ''Moment'' (Dark Tranquillity album), 2020
* ''Moment'' (Speed album), 1998
* ''Moments'' (Darude album)
* ''Moments'' (Christine Guldbrand ...
exists only when
when it is given by
[
]
:
where B is the
beta function.
Expressions for the
mean,
variance,
skewness and
kurtosis can be derived from this. Writing
for convenience, the mean is
:
and the variance is
:
Explicit expressions for the skewness and kurtosis are lengthy.
As
tends to infinity the mean tends to
, the variance and skewness tend to zero and the excess kurtosis tends to 6/5 (see also
related distributions below).
Quantiles
The
quantile function
In probability and statistics, the quantile function, associated with a probability distribution of a random variable, specifies the value of the random variable such that the probability of the variable being less than or equal to that value equ ...
(inverse cumulative distribution function) is :
:
It follows that the
median
In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...
is
,
the lower
quartile
In statistics, a quartile is a type of quantile which divides the number of data points into four parts, or ''quarters'', of more-or-less equal size. The data must be ordered from smallest to largest to compute quartiles; as such, quartiles are a ...
is
and the upper quartile is
.
Applications
Survival analysis
The log-logistic distribution provides one
parametric model
In statistics, a parametric model or parametric family or finite-dimensional model is a particular class of statistical models. Specifically, a parametric model is a family of probability distributions that has a finite number of parameters.
Def ...
for
survival analysis. Unlike the more commonly used
Weibull distribution, it can have a non-
monotonic hazard function: when
the hazard function is
unimodal (when
≤ 1, the hazard decreases monotonically). The fact that the cumulative distribution function can be written in closed form is particularly useful for analysis of survival data with
censoring.
The log-logistic distribution can be used as the basis of an
accelerated failure time model
In the statistical area of survival analysis, an accelerated failure time model (AFT model) is a parametric model that provides an alternative to the commonly used proportional hazards models. Whereas a proportional hazards model assumes that th ...
by allowing
to differ between groups, or more generally by introducing covariates that affect
but not
by modelling
as a linear function of the covariates.
The
survival function is
:
and so the
hazard function is
:
The log-logistic distribution with shape parameter
is the marginal distribution of the inter-times in a geometric-distributed
counting process.
Hydrology
The log-logistic distribution has been used in hydrology for modelling stream flow rates and precipitation.
[
Extreme values like maximum one-day rainfall and river discharge per month or per year often follow a log-normal distribution. The log-normal distribution, however, needs a numeric approximation. As the log-logistic distribution, which can be solved analytically, is similar to the log-normal distribution, it can be used instead.
The blue picture illustrates an example of fitting the log-logistic distribution to ranked maximum one-day October rainfalls and it shows the 90% ]confidence belt
In frequentist statistics, a confidence interval (CI) is a range of estimates for an unknown parameter. A confidence interval is computed at a designated ''confidence level''; the 95% confidence level is most common, but other levels, such as ...
based on the binomial distribution
In probability theory and statistics, the binomial distribution with parameters ''n'' and ''p'' is the discrete probability distribution of the number of successes in a sequence of ''n'' independent experiments, each asking a yes–no quest ...
. The rainfall data are represented by the plotting position ''r''/(''n''+1) as part of the cumulative frequency analysis.
Economics
The log-logistic has been used as a simple model of the distribution of wealth or income in economics
Economics () is the social science that studies the production, distribution, and consumption of goods and services.
Economics focuses on the behaviour and interactions of economic agents and how economies work. Microeconomics analy ...
, where it is known as the Fisk distribution.
Its Gini coefficient is .[
]
The Gini coefficient for a continuous probability distribution takes the form:
:
where is the CDF of the distribution and is the expected value. For the log-logistic distribution, the formula for the Gini coefficient becomes:
:
Defining the substitution leads to the simpler equation:
:
And making the substitution further simplifies the Gini coefficient formula to:
:
The integral component is equivalent to the standard beta function . The beta function may also be written as:
:
where is the gamma function
In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers except ...
. Using the properties of the gamma function, it can be shown that:
:
From Euler's reflection formula, the expression can be simplified further:
:
Finally, we may conclude that the Gini coefficient for the log-logistic distribution .
Networking
The log-logistic has been used as a model for the period of time beginning when some data leaves a software user application in a computer and the response is received by the same application after travelling through and being processed by other computers, applications, and network segments, most or all of them without hard real-time guarantees (for example, when an application is displaying data coming from a remote sensor
A sensor is a device that produces an output signal for the purpose of sensing a physical phenomenon.
In the broadest definition, a sensor is a device, module, machine, or subsystem that detects events or changes in its environment and sends ...
connected to the Internet). It has been shown to be a more accurate probabilistic model for that than the log-normal distribution or others, as long as abrupt changes of regime in the sequences of those times are properly detected.
Related distributions
* If then
* If then
* ( Dagum distribution).
* ( Singh–Maddala distribution).
* ( Beta prime distribution).
*If ''X'' has a log-logistic distribution with scale parameter and shape parameter then ''Y'' = log(''X'') has a logistic distribution with location parameter and scale parameter
*As the shape parameter of the log-logistic distribution increases, its shape increasingly resembles that of a (very narrow) logistic distribution. Informally:
::
*The log-logistic distribution with shape parameter and scale parameter is the same as the generalized Pareto distribution with location parameter , shape parameter and scale parameter
::
*The addition of another parameter (a shift parameter) formally results in a shifted log-logistic distribution, but this is usually considered in a different parameterization so that the distribution can be bounded above or bounded below.
Generalizations
Several different distributions are sometimes referred to as the generalized log-logistic distribution, as they contain the log-logistic as a special case.[ These include the ]Burr Type XII distribution
In probability theory, statistics and econometrics, the Burr Type XII distribution or simply the Burr distribution is a continuous probability distribution for a non-negative random variable. It is also known as the Singh–Maddala distribution a ...
(also known as the ''Singh–Maddala distribution'') and the Dagum distribution, both of which include a second shape parameter. Both are in turn special cases of the even more general ''generalized beta distribution of the second kind''. Another more straightforward generalization of the log-logistic is the shifted log-logistic distribution.
Another generalized log-logistic distribution is the log-transform of the metalog distribution, in which power series expansions in terms of are substituted for logistic distribution parameters and . The resulting log-metalog distribution is highly shape flexible, has simple closed form PDF
Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. ...
and quantile function
In probability and statistics, the quantile function, associated with a probability distribution of a random variable, specifies the value of the random variable such that the probability of the variable being less than or equal to that value equ ...
, can be fit to data with linear least squares, and subsumes the log-logistic distribution is special case.
See also
* Probability distributions: List of important distributions supported on semi-infinite intervals
References
{{DEFAULTSORT:Log-Logistic Distribution
Continuous distributions
Survival analysis
Probability distributions with non-finite variance
Economic inequality