In
mathematics and
mathematical optimization
Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criterion, from some set of available alternatives. It is generally divided into two subfi ...
, the convex conjugate of a function is a generalization of the
Legendre transformation
In mathematics, the Legendre transformation (or Legendre transform), named after Adrien-Marie Legendre, is an involutive transformation on real-valued convex functions of one real variable. In physical problems, it is used to convert function ...
which applies to non-convex functions. It is also known as Legendre–Fenchel transformation, Fenchel transformation, or Fenchel conjugate (after
Adrien-Marie Legendre
Adrien-Marie Legendre (; ; 18 September 1752 – 9 January 1833) was a French mathematician who made numerous contributions to mathematics. Well-known and important concepts such as the Legendre polynomials and Legendre transformation are nam ...
and
Werner Fenchel). It allows in particular for a far reaching generalization of Lagrangian duality.
Definition
Let
be a
real topological vector space and let
be the
dual space
In mathematics, any vector space ''V'' has a corresponding dual vector space (or just dual space for short) consisting of all linear forms on ''V'', together with the vector space structure of pointwise addition and scalar multiplication by con ...
to
. Denote by
:
the canonical
dual pairing, which is defined by
For a function
taking values on the
extended real number line
In mathematics, the affinely extended real number system is obtained from the real number system \R by adding two infinity elements: +\infty and -\infty, where the infinities are treated as actual numbers. It is useful in describing the algebra o ...
, its is the function
:
whose value at
is defined to be the
supremum
In mathematics, the infimum (abbreviated inf; plural infima) of a subset S of a partially ordered set P is a greatest element in P that is less than or equal to each element of S, if such an element exists. Consequently, the term ''greatest l ...
:
:
or, equivalently, in terms of the
infimum
In mathematics, the infimum (abbreviated inf; plural infima) of a subset S of a partially ordered set P is a greatest element in P that is less than or equal to each element of S, if such an element exists. Consequently, the term ''greatest ...
:
:
This definition can be interpreted as an encoding of the
convex hull of the function's
epigraph in terms of its
supporting hyperplanes.
Examples
For more examples, see .
* The convex conjugate of an
affine function
In Euclidean geometry, an affine transformation or affinity (from the Latin, ''affinis'', "connected with") is a geometric transformation that preserves lines and parallelism, but not necessarily Euclidean distances and angles.
More genera ...
is
* The convex conjugate of a
power function is
exponential function
The exponential function is a mathematical function denoted by f(x)=\exp(x) or e^x (where the argument is written as an exponent). Unless otherwise specified, the term generally refers to the positive-valued function of a real variable, ...
f(x)= e^x is
f^\left(x^ \right)
= \begin x^ \ln x^ - x^ , & x^ > 0
\\ 0 , & x^ = 0
\\ \infty , & x^ < 0.
\end
The convex conjugate and Legendre transform of the exponential function agree except that the
domain
Domain may refer to:
Mathematics
*Domain of a function, the set of input values for which the (total) function is defined
** Domain of definition of a partial function
**Natural domain of a partial function
**Domain of holomorphy of a function
*Do ...
of the convex conjugate is strictly larger as the Legendre transform is only defined for positive real numbers.
Connection with expected shortfall (average value at risk)
Se
this article for example.
Let ''F'' denote a
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ev ...
of a
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the p ...
''X''. Then (integrating by parts),
f(x):= \int_^x F(u) \, du = \operatorname\left max(0,x-X)\right= x-\operatorname \left min(x,X)\right/math>
has the convex conjugate
f^(p)= \int_0^p F^(q) \, dq = (p-1)F^(p)+\operatorname\left min(F^(p),X)\right
= p F^(p)-\operatorname\left max(0,F^(p)-X)\right
Ordering
A particular interpretation has the transform
f^\text(x):= \arg \sup_t t\cdot x-\int_0^1 \max\ \, du,
as this is a nondecreasing rearrangement of the initial function ''f''; in particular,
f^\text= f for ''f'' nondecreasing.
Properties
The convex conjugate of a
closed convex function is again a closed convex function. The convex conjugate of a
polyhedral convex function (a convex function with
polyhedral epigraph) is again a polyhedral convex function.
Order reversing
Declare that
f \le g if and only if
f(x) \le g(x) for all
x. Then convex-conjugation is
order-reversing
In mathematics, a monotonic function (or monotone function) is a function (mathematics), function between List of order structures in mathematics, ordered sets that preserves or reverses the given order relation, order. This concept first aro ...
, which by definition means that if
f \le g then
f^* \ge g^*.
For a family of functions
\left(f_\alpha\right)_\alpha it follows from the fact that supremums may be interchanged that
:
\left(\inf_\alpha f_\alpha\right)^*(x^*) = \sup_\alpha f_\alpha^*(x^*),
and from the
max–min inequality that
:
\left(\sup_\alpha f_\alpha\right)^*(x^*) \le \inf_\alpha f_\alpha^*(x^*).
Biconjugate
The convex conjugate of a function is always
lower semi-continuous
In mathematical analysis, semicontinuity (or semi-continuity) is a property of extended real-valued functions that is weaker than continuity. An extended real-valued function f is upper (respectively, lower) semicontinuous at a point x_0 if, r ...
. The biconjugate
f^ (the convex conjugate of the convex conjugate) is also the
closed convex hull, i.e. the largest
lower semi-continuous
In mathematical analysis, semicontinuity (or semi-continuity) is a property of extended real-valued functions that is weaker than continuity. An extended real-valued function f is upper (respectively, lower) semicontinuous at a point x_0 if, r ...
convex function with
f^ \le f.
For
proper functions f,
:
f = f^ if and only if
In logic and related fields such as mathematics and philosophy, "if and only if" (shortened as "iff") is a biconditional logical connective between statements, where either both statements are true or both are false.
The connective is bi ...
f is convex and lower semi-continuous, by the
Fenchel–Moreau theorem
In convex analysis, the Fenchel–Moreau theorem (named after Werner Fenchel and Jean Jacques Moreau) or Fenchel biconjugation theorem (or just biconjugation theorem) is a theorem which gives necessary and sufficient conditions for a function ...
.
Fenchel's inequality
For any function and its convex conjugate , Fenchel's inequality (also known as the Fenchel–Young inequality) holds for every
x \in X and
:
\left\langle p,x \right\rangle \le f(x) + f^*(p).
The proof follows from the definition of convex conjugate:
f^*(p) = \sup_ \left\ \ge \langle p,x \rangle - f(x).
Convexity
For two functions
f_0 and
f_1 and a number
0 \le \lambda \le 1 the convexity relation
:
\left((1-\lambda) f_0 + \lambda f_1\right)^ \le (1-\lambda) f_0^ + \lambda f_1^
holds. The
operation is a convex mapping itself.
Infimal convolution
The infimal convolution (or epi-sum) of two functions
f and
g is defined as
:
\left( f \operatorname g \right)(x) = \inf \left\.
Let
f_1, \ldots, f_ be
proper, convex and
lower semicontinuous functions on
\mathbb^. Then the infimal convolution is convex and lower semicontinuous (but not necessarily proper), and satisfies
:
\left( f_1 \operatorname \cdots \operatorname f_m \right)^ = f_1^ + \cdots + f_m^.
The infimal convolution of two functions has a geometric interpretation: The (strict)
epigraph of the infimal convolution of two functions is the
Minkowski sum
In geometry, the Minkowski sum (also known as dilation) of two sets of position vectors ''A'' and ''B'' in Euclidean space is formed by adding each vector in ''A'' to each vector in ''B'', i.e., the set
: A + B = \.
Analogously, the Minkowsk ...
of the (strict) epigraphs of those functions.
Maximizing argument
If the function
f is differentiable, then its derivative is the maximizing argument in the computation of the convex conjugate:
:
f^\prime(x) = x^*(x):= \arg\sup_ -f^\left( x^ \right) and
:
f^\left( x^ \right) = x\left( x^ \right):= \arg\sup_x - f(x);
whence
:
x = \nabla f^\left( \nabla f(x) \right),
:
x^ = \nabla f\left( \nabla f^\left( x^ \right)\right),
and moreover
:
f^(x) \cdot f^\left( x^(x) \right) = 1,
:
f^\left( x^ \right) \cdot f^\left( x(x^) \right) = 1.
Scaling properties
If for some
\gamma>0, g(x) = \alpha + \beta x + \gamma \cdot f\left( \lambda x + \delta \right), then
:
g^\left( x^ \right)= - \alpha - \delta\frac \lambda + \gamma \cdot f^\left(\frac \right).
Behavior under linear transformations
Let
A : X \to Y be a
bounded linear operator. For any convex function
f on
X,
:
\left(A f\right)^ = f^ A^
where
:
(A f)(y) = \inf\
is the preimage of
f with respect to
A and
A^ is the
adjoint operator of
A.
A closed convex function
f is symmetric with respect to a given set
G of
orthogonal linear transformations,
:
f(A x) = f(x) for all
x and all
A \in G
if and only if its convex conjugate
f^ is symmetric with respect to
G.
Table of selected convex conjugates
The following table provides Legendre transforms for many common functions as well as a few useful properties.
See also
*
Dual problem
In mathematical optimization theory, duality or the duality principle is the principle that optimization problems may be viewed from either of two perspectives, the primal problem or the dual problem. If the primal is a minimization problem then t ...
*
Fenchel's duality theorem
*
Legendre transformation
In mathematics, the Legendre transformation (or Legendre transform), named after Adrien-Marie Legendre, is an involutive transformation on real-valued convex functions of one real variable. In physical problems, it is used to convert function ...
*
Young's inequality for products In mathematics, Young's inequality for products is a mathematical inequality about the product of two numbers. The inequality is named after William Henry Young and should not be confused with Young's convolution inequality.
Young's inequality f ...
References
*
*
*
Further reading
*
*
*
*
(271 pages)
*
(24 pages)
{{Convex analysis and variational analysis
Convex analysis
Duality theories
Theorems involving convexity
Transforms