probability theory Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...

and

statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...

, the generalized extreme value (GEV) distribution is a family of continuous

probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...

s developed within

extreme value theory Extreme value theory or extreme value analysis (EVA) is the study of extremes in statistical distributions. It is widely used in many disciplines, such as structural engineering, finance, economics, earth sciences, traffic prediction, and Engin ...

to combine the Gumbel, Fréchet and Weibull families also known as type I, II and III extreme value distributions. By the

extreme value theorem In calculus, the extreme value theorem states that if a real-valued function f is continuous on the closed and bounded interval ,b/math>, then f must attain a maximum and a minimum, each at least once. That is, there exist numbers c and ...

the GEV distribution is the only possible limit distribution of properly normalized maxima of a sequence of independent and identically distributed random variables. Note that a limit distribution needs to exist, which requires regularity conditions on the tail of the distribution. Despite this, the GEV distribution is often used as an approximation to model the maxima of long (finite) sequences of random variables. In some fields of application the generalized extreme value distribution is known as the Fisher–Tippett distribution, named after R.A. Fisher and L.H.C. Tippett who recognised three different forms outlined below. However usage of this name is sometimes restricted to mean the special case of the

Gumbel distribution In probability theory and statistics, the Gumbel distribution (also known as the type-I generalized extreme value distribution) is used to model the distribution of the maximum (or the minimum) of a number of samples of various distributions. Thi ...

. The origin of the common functional form for all three distributions dates back to at least , though allegedly it could also have been given by .

Specification

Using the standardized variable

s = \tfrac

, where

\mu

, the location parameter, can be any real number, and

\sigma > 0

is the scale parameter; the cumulative distribution function of the GEV distribution is then :

F(s; \xi) = \begin \exp (-\mathrm^) & \text \xi = 0 , \\
\exp \bigl( \! - ( 1 + \xi s)^ \bigr) & \text \xi \neq 0 \text \xi s > -1 , \\
0 & \text \xi > 0 \text s \le -\tfrac , \\
1 & \text \xi < 0 \text s \ge \tfrac , \end

where

\xi

, the shape parameter, can be any real number. Thus, for

\xi > 0

, the expression is valid for

s > -\tfrac

, while for

\xi < 0

it is valid for

s < - \tfrac

. In the first case,

-\tfrac

is the negative, lower end-point, where

F

is ; in the second case,

-\tfrac

is the positive, upper end-point, where

F

is 1. For

\xi = 0

, the second expression is formally undefined and is replaced with the first expression, which is the result of taking the limit of the second, as

\xi \to 0

in which case

s

can be any real number. In the special case of

x = \mu

, we have

s = 0

, so

F(0; \xi) = \mathrm^ \approx 0.368

regardless of the values of

\xi

and

\sigma

. The probability density function of the standardized distribution is :

f(s; \xi) = \begin \mathrm^ \exp (-\mathrm^ ) & \text \xi = 0 , \\
(1 + \xi s)^ \exp\bigl(\! -( 1 + \xi s )^ \bigr) & \text \xi \neq 0 \text \xi s > -1 , \\
0 & \text \end

again valid for

s > -\tfrac

in the case

\xi > 0

, and for

s < -\tfrac

in the case

\xi < 0

. The density is zero outside of the relevant range. In the case

\xi = 0

, the density is positive on the whole real line. Since the cumulative distribution function is invertible, the quantile function for the GEV distribution has an explicit expression, namely :

; \end

and therefore the quantile density function

q = \tfrac

is :

q(p; \sigma, \xi) = \frac \qquad \text p \in (0, 1) ,

valid for

\sigma > 0

and for any real

\xi

Summary statistics

Using

\ g_k \equiv \Gamma(1 - k\ \xi) ~

for

~ k \in\\ ,

where

\ \Gamma(\cdot)\

is the

gamma function In mathematics, the gamma function (represented by Γ, capital Greek alphabet, Greek letter gamma) is the most common extension of the factorial function to complex numbers. Derived by Daniel Bernoulli, the gamma function \Gamma(z) is defined ...

, some simple statistics of the distribution are given by: :

\ \operatorname\left( X \right) = \mu + \bigl(\ g_1 - 1\ \bigr)\frac \quad

for

\quad \xi < 1\ ,

\ \operatorname\left( X \right) = \bigl(\ g_2 - g_1^2\ \bigr) \frac\ ,

\ \operatorname\left( X \right) = \mu + \Bigl(\ \left( 1 + \xi \right)^ - 1\ \Bigr) \frac ~.

The

skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimodal ...

is :

\ \operatorname\left( X \right) = \begin
 \frac \cdot \sgn(\xi) \quad ~~ \mathsf\quad \xi \ne 0\ , \\
 \\
\quad \frac \quad \approx \quad 1.14 \quad\qquad \mathsf\quad \xi = 0  ~.
\end

The excess

kurtosis In probability theory and statistics, kurtosis (from , ''kyrtos'' or ''kurtos'', meaning "curved, arching") refers to the degree of “tailedness” in the probability distribution of a real-valued random variable. Similar to skewness, kurtos ...

is: :

\ \operatorname\left( X \right) = \frac - 3 ~.

Link to Fréchet, Weibull, and Gumbel families

The shape parameter

\ \xi\

governs the tail behavior of the distribution. The sub-

families Family (from ) is a group of people related either by consanguinity (by recognized birth) or affinity (by marriage or other relationship). It forms the basis for social order. Ideally, families offer predictability, structure, and safety as ...

defined by three cases:

\ \xi = 0\ ,

\ \xi > 0\ ,

and

\ \xi < 0\ ;

these correspond, respectively, to the ''Gumbel'', ''Fréchet'', and ''Weibull''

, whose cumulative distribution functions are displayed below. * Type I or '' Gumbel'' extreme value distribution, case

~ \xi = 0\ , \quad

for all

\quad x \in  \Bigl(\ -\infty\ ,\ +\infty\ \Bigr)\ :

F(\ x;\ \mu,\ \sigma,\ 0\ ) = \exp \left( - \exp \left( -\frac \right)  \right) ~.

* Type II or '' Fréchet'' extreme value distribution, case

~ \xi  > 0\ , \quad

for all

\quad  x \in \left(\ \mu - \tfrac\ ,\ +\infty\ \right)\ :

:Let

\quad \alpha \equiv \tfrac > 0 \quad

and

\quad y \equiv 1 + \tfrac (x-\mu)\ ;

F(\ x;\ \mu,\ \sigma,\ \xi\ ) = \begin 0 & y \leq 0 \quad \mathsf \quad x \leq \mu - \tfrac \\ \exp\left( -\frac \right) & y > 0 \quad \mathsf \quad x > \mu - \tfrac ~. \end

* Type III or ''reversed Weibull'' extreme value distribution, case

~ \xi < 0\ , \quad

for all

\quad x \in \left( -\infty\ ,\ \mu + \tfrac\ \right)\ :

:Let

\quad \alpha \equiv - \tfrac > 0 \quad

and

\quad y \equiv 1 - \tfrac (x - \mu)\ ;

F(\ x;\ \mu,\ \sigma,\ \xi\ ) = \begin \exp\left( -y^ \right) & y > 0 \quad \mathsf \quad x < \mu + \tfrac \\ 1 & y \leq 0  \quad \mathsf \quad x \geq \mu + \tfrac ~. \end

The subsections below remark on properties of these distributions.

Modification for minima rather than maxima

The theory here relates to data maxima and the distribution being discussed is an extreme value distribution for maxima. A generalised extreme value distribution for data minima can be obtained, for example by substituting

\ -x\;

for

\;x\;

in the distribution function, and subtracting the cumulative distribution from one: That is, replace

\ F(x)\

with Doing so yields yet another family of distributions.

Alternative convention for the Weibull distribution

The ordinary Weibull distribution arises in reliability applications and is obtained from the distribution here by using the variable

\ t = \mu - x\ ,

which gives a strictly positive support, in contrast to the use in the formulation of extreme value theory here. This arises because the ordinary Weibull distribution is used for cases that deal with data ''minima'' rather than data maxima. The distribution here has an addition parameter compared to the usual form of the Weibull distribution and, in addition, is reversed so that the distribution has an upper bound rather than a lower bound. Importantly, in applications of the GEV, the upper bound is unknown and so must be estimated, whereas when applying the ordinary Weibull distribution in reliability applications the lower bound is usually known to be zero.

Ranges of the distributions

Note the differences in the ranges of interest for the three extreme value distributions: Gumbel is unlimited, Fréchet has a lower limit, while the reversed Weibull has an upper limit. More precisely, univariate extreme value theory describes which of the three is the limiting law according to the initial law and in particular depending on the original distribution's tail.

Distribution of log variables

One can link the type I to types II and III in the following way: If the cumulative distribution function of some random variable

\ X\

is of type II, and with the positive numbers as support, i.e.

\ F(\ x;\ 0,\ \sigma,\ \alpha\ )\ ,

then the cumulative distribution function of

\ln X

is of type I, namely

\ F(\ x;\ \ln \sigma,\ \tfrac,\ 0\ ) ~.

Similarly, if the cumulative distribution function of

\ X\

is of type III, and with the negative numbers as support, i.e.

\ F(\ x;\ 0,\ \sigma,\ -\alpha\ )\ ,

then the cumulative distribution function of

\ \ln (-X)\

is of type I, namely

\ F(\ x;\ -\ln \sigma,\ \tfrac,\ 0\ ) ~.

Link to logit models (logistic regression)

Multinomial logit In statistics, multinomial logistic regression is a classification method that generalizes logistic regression to multiclass problems, i.e. with more than two possible discrete outcomes. That is, it is a model that is used to predict the prob ...

models, and certain other types of

logistic regression In statistics, a logistic model (or logit model) is a statistical model that models the logit, log-odds of an event as a linear function (calculus), linear combination of one or more independent variables. In regression analysis, logistic regres ...

, can be phrased as

latent variable In statistics, latent variables (from Latin: present participle of ) are variables that can only be inferred indirectly through a mathematical model from other observable variables that can be directly observed or measured. Such '' latent va ...

models with

error variable In statistics, linear regression is a model that estimates the relationship between a scalar response (dependent variable) and one or more explanatory variables (regressor or independent variable). A model with exactly one explanatory variable ...

s distributed as

s (type I generalized extreme value distributions). This phrasing is common in the theory of

discrete choice In economics, discrete choice models, or qualitative choice models, describe, explain, and predict choices between two or more discrete alternatives, such as entering or not entering the labor market, or choosing between modes of transport. Such c ...

models, which include

logit model In statistics, the logit ( ) function is the quantile function associated with the standard logistic distribution. It has many uses in data analysis and machine learning, especially in data transformations. Mathematically, the logit is the ...

probit model In statistics, a probit model is a type of regression where the dependent variable can take only two values, for example married or not married. The word is a portmanteau, coming from ''probability'' + ''unit''. The purpose of the model is to es ...

s, and various extensions of them, and derives from the fact that the difference of two type-I GEV-distributed variables follows a

logistic distribution In probability theory and statistics, the logistic distribution is a continuous probability distribution. Its cumulative distribution function is the logistic function, which appears in logistic regression and feedforward neural networks. It rese ...

, of which the

logit function In statistics, the logit ( ) function is the quantile function associated with the standard logistic distribution. It has many uses in data analysis and machine learning, especially in Data transformation (statistics), data transformations. Ma ...

is the

quantile function In probability and statistics, the quantile function is a function Q: ,1\mapsto \mathbb which maps some probability x \in ,1/math> of a random variable v to the value of the variable y such that P(v\leq y) = x according to its probability distr ...

. The type-I GEV distribution thus plays the same role in these logit models as the

normal distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is f(x) = \frac ...

does in the corresponding probit models.

Properties

The

cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ever ...

of the generalized extreme value distribution solves the

stability postulate In probability theory, to obtain a nondegenerate limiting distribution for extremes of samples, it is necessary to "reduce" the actual greatest value by applying a linear transformation with coefficients that depend on the sample size. If \ X_1, ...

equation. The generalized extreme value distribution is a special case of a max-stable distribution, and is a transformation of a min-stable distribution.

Applications

* The GEV distribution is widely used in the treatment of "tail risks" in fields ranging from insurance to finance. In the latter case, it has been considered as a means of assessing various financial risks via metrics such as

value at risk Value at risk (VaR) is a measure of the risk of loss of investment/capital. It estimates how much a set of investments might lose (with a given probability), given normal market conditions, in a set time period such as a day. VaR is typically us ...

* However, the resulting shape parameters have been found to lie in the range leading to undefined means and variances, which underlines the fact that reliable data analysis is often impossible. * In

hydrology Hydrology () is the scientific study of the movement, distribution, and management of water on Earth and other planets, including the water cycle, water resources, and drainage basin sustainability. A practitioner of hydrology is called a hydro ...

the GEV distribution is applied to extreme events such as annual maximum one-day rainfalls and river discharges. The blue picture, made with CumFreq, illustrates an example of fitting the GEV distribution to ranked annually maximum one-day rainfalls showing also the 90% confidence belt based on the

binomial distribution In probability theory and statistics, the binomial distribution with parameters and is the discrete probability distribution of the number of successes in a sequence of statistical independence, independent experiment (probability theory) ...

. The rainfall data are represented by plotting positions as part of the cumulative frequency analysis.

Example for Normally distributed variables

Let

\ \left\\

be i.i.d.

normally distributed In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real number, real-valued random variable. The general form of its probability density function is f(x ...

random variables with mean and variance . The

Fisher–Tippett–Gnedenko theorem In statistics, the Fisher–Tippett–Gnedenko theorem (also the Fisher–Tippett theorem or the extreme value theorem) is a general result in extreme value theory regarding asymptotic distribution of extreme order statistics. The maximum of a sa ...

tells us that

\ \max \ \sim GEV(\mu_n, \sigma_n, 0)\ ,

where

\begin
   \mu_n &= \Phi^\left( 1 - \frac \right) \\
   \sigma_n &= \Phi^\left( 1 - \frac \right)- \Phi^\left(1-\frac \right) ~.
\end

This allow us to estimate e.g. the mean of

\ \max \\

from the mean of the GEV distribution:

\begin
\operatorname\left\
& \approx \mu_n + \gamma_\ \sigma_n \\
&= (1 - \gamma_)\ \Phi^\left( 1 - \frac \right) + \gamma_\ \Phi^\left( 1 - \frac \right) \\
&= \sqrt\ \cdot\ \left(1 + \frac + \mathcal \left(\frac \right) \right)\ ,
\end

where

\ \gamma_\

is the

Euler–Mascheroni constant Euler's constant (sometimes called the Euler–Mascheroni constant) is a mathematical constant, usually denoted by the lowercase Greek letter gamma (), defined as the limiting difference between the harmonic series and the natural logarith ...

Related distributions

# If

\ X \sim \textrm(\mu,\,\sigma,\,\xi)\

then

\ m X + b \sim \textrm(m \mu+b,\ , m, \sigma,\ \xi)\

# If

\ X \sim \textrm(\mu,\ \sigma)\

(

) then

\ X \sim \textrm(\mu,\,\sigma,\,0)\

# If

\ X \sim \textrm(\sigma,\,\mu)\

(

Weibull distribution In probability theory and statistics, the Weibull distribution is a continuous probability distribution. It models a broad range of random variables, largely in the nature of a time to failure or time between events. Examples are maximum on ...

) then

\ \mu\left(1-\sigma\log \tfrac \right) \sim \textrm(\mu,\,\sigma,\,0)\

# If

\ X \sim \textrm(\mu,\,\sigma,\,0)\

then

\ \sigma \exp (-\tfrac ) \sim \textrm(\sigma,\,\mu)\

(

) # If

\ X \sim \textrm(1)\

(

Exponential distribution In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuousl ...

) then

\ \mu - \sigma \log X \sim \textrm(\mu,\,\sigma,\,0)\

# If

\ X \sim \mathrm(\alpha_X, \beta)\

and

\ Y \sim \mathrm(\alpha_Y, \beta)\

then

\ X-Y \sim \mathrm(\alpha_X-\alpha_Y,\beta)\

(see

Logistic distribution In probability theory and statistics, the logistic distribution is a continuous probability distribution. Its cumulative distribution function is the logistic function, which appears in logistic regression and feedforward neural networks. It rese ...

). # If

\ X\

and

\ Y \sim \mathrm(\alpha, \beta)\

then

\ X+Y \nsim \mathrm(2 \alpha,\beta)\

(The sum is ''not'' a logistic distribution). :: Note that

\ \operatorname\ = 2\alpha+2\beta\gamma \neq 2\alpha = \operatorname\left\ ~.

Proofs

4. Let

\ X \sim \textrm(\sigma,\,\mu)\ ,

then the cumulative distribution of

\ g(x) = \mu\left(1-\sigma\log\frac \right)\

is: :

\right), \quad s = \frac\ , \end

:which is the cdf for

\sim \textrm(\mu,\,\sigma,\,0) ~.

5. Let

\ X \sim \textrm(1)\ ,

then the cumulative distribution of

\ g(X) = \mu - \sigma \log X\

is: :

, \quad ~\mathsf~ \quad s \equiv \frac\ ; \end

:which is the cumulative distribution of

\ \operatorname(\mu, \sigma, 0) ~.

Specification

Summary statistics

Link to Fréchet, Weibull, and Gumbel families

Modification for minima rather than maxima

Alternative convention for the Weibull distribution

Ranges of the distributions

Distribution of log variables

Link to logit models (logistic regression)

Properties

Applications

Example for Normally distributed variables

Related distributions

Proofs

See also

References

Further reading