In
mathematics
Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
and
mathematical optimization
Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criteria, from some set of available alternatives. It is generally divided into two subfiel ...
, the convex conjugate of a function is a generalization of the
Legendre transformation which applies to non-convex functions. It is also known as Legendre–Fenchel transformation, Fenchel transformation, or Fenchel conjugate (after
Adrien-Marie Legendre and
Werner Fenchel). The convex conjugate is widely used for constructing the
dual problem
In mathematical optimization theory, duality or the duality principle is the principle that optimization problems may be viewed from either of two perspectives, the primal problem or the dual problem. If the primal is a minimization problem then th ...
in
optimization theory, thus generalizing
Lagrangian duality.
Definition
Let
be a
real topological vector space
In mathematics, a topological vector space (also called a linear topological space and commonly abbreviated TVS or t.v.s.) is one of the basic structures investigated in functional analysis.
A topological vector space is a vector space that is als ...
and let
be the
dual space
In mathematics, any vector space ''V'' has a corresponding dual vector space (or just dual space for short) consisting of all linear forms on ''V,'' together with the vector space structure of pointwise addition and scalar multiplication by cons ...
to
. Denote by
:
the canonical
dual pairing, which is defined by
For a function
taking values on the
extended real number line, its is the function
:
whose value at
is defined to be the
supremum
In mathematics, the infimum (abbreviated inf; : infima) of a subset S of a partially ordered set P is the greatest element in P that is less than or equal to each element of S, if such an element exists. If the infimum of S exists, it is unique, ...
:
:
or, equivalently, in terms of the
infimum:
:
This definition can be interpreted as an encoding of the
convex hull
In geometry, the convex hull, convex envelope or convex closure of a shape is the smallest convex set that contains it. The convex hull may be defined either as the intersection of all convex sets containing a given subset of a Euclidean space, ...
of the function's
epigraph in terms of its
supporting hyperplanes.
Examples
For more examples, see .
* The convex conjugate of an
affine function
In Euclidean geometry, an affine transformation or affinity (from the Latin, ''wikt:affine, affinis'', "connected with") is a geometric transformation that preserves line (geometry), lines and parallel (geometry), parallelism, but not necessarily ...
is
* The convex conjugate of a
power function is
absolute value
In mathematics, the absolute value or modulus of a real number x, is the non-negative value without regard to its sign. Namely, , x, =x if x is a positive number, and , x, =-x if x is negative (in which case negating x makes -x positive), ...
function
is
* The convex conjugate of the
exponential function is
The convex conjugate and Legendre transform of the exponential function agree except that the
domain of the convex conjugate is strictly larger as the Legendre transform is only defined for
positive real numbers.
Connection with expected shortfall (average value at risk)
Se
this article for example.
Let ''F'' denote a
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ever ...
of a
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
''X''. Then (
integrating by parts),
Ordering
A particular interpretation has the transform
f^\text(x):= \arg \sup_t t\cdot x-\int_0^1 \max\ \, du,
as this is a nondecreasing rearrangement of the initial function ''f''; in particular,
f^\text= f for ''f'' nondecreasing.
Properties
The convex conjugate of a
closed convex function is again a closed convex function. The convex conjugate of a
polyhedral convex function (a convex function with
polyhedral epigraph) is again a polyhedral convex function.
Order reversing
Declare that
f \le g if and only if
f(x) \le g(x) for all
x. Then convex-conjugation is
order-reversing, which by definition means that if
f \le g then
f^* \ge g^*.
For a family of functions
\left(f_\alpha\right)_\alpha it follows from the fact that supremums may be interchanged that
:
\left(\inf_\alpha f_\alpha\right)^*(x^*) = \sup_\alpha f_\alpha^*(x^*),
and from the
max–min inequality that
:
\left(\sup_\alpha f_\alpha\right)^*(x^*) \le \inf_\alpha f_\alpha^*(x^*).
Biconjugate
The convex conjugate of a function is always
lower semi-continuous. The biconjugate
f^ (the convex conjugate of the convex conjugate) is also the
closed convex hull, i.e. the largest
lower semi-continuous convex function with
f^ \le f.
For
proper functions f,
:
f = f^ if and only if
In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
f is convex and lower semi-continuous, by the
Fenchel–Moreau theorem.
Fenchel's inequality
For any function and its convex conjugate , Fenchel's inequality (also known as the Fenchel–Young inequality) holds for every
x \in X and
:
\left\langle p,x \right\rangle \le f(x) + f^*(p).
Furthermore, the equality holds only when
p \in \partial f(x).
The proof follows from the definition of convex conjugate:
f^*(p) = \sup_ \left\ \ge \langle p,x \rangle - f(x).
Convexity
For two functions
f_0 and
f_1 and a number
0 \le \lambda \le 1 the convexity relation
:
\left((1-\lambda) f_0 + \lambda f_1\right)^ \le (1-\lambda) f_0^ + \lambda f_1^
holds. The
operation is a convex mapping itself.
Infimal convolution
The infimal convolution (or epi-sum) of two functions
f and
g is defined as
:
\left( f \operatorname g \right)(x) = \inf \left\.
Let
f_1, \ldots, f_ be
proper, convex and
lower semicontinuous functions on
\mathbb^. Then the infimal convolution is convex and lower semicontinuous (but not necessarily proper), and satisfies
:
\left( f_1 \operatorname \cdots \operatorname f_m \right)^ = f_1^ + \cdots + f_m^.
The infimal convolution of two functions has a geometric interpretation: The (strict)
epigraph of the infimal convolution of two functions is the
Minkowski sum of the (strict) epigraphs of those functions.
Maximizing argument
If the function
f is differentiable, then its derivative is the maximizing argument in the computation of the convex conjugate:
:
f^\prime(x) = x^*(x):= \arg\sup_ -f^\left( x^ \right) and
:
f^\left( x^ \right) = x\left( x^ \right):= \arg\sup_x - f(x);
hence
:
x = \nabla f^\left( \nabla f(x) \right),
:
x^ = \nabla f\left( \nabla f^\left( x^ \right)\right),
and moreover
:
f^(x) \cdot f^\left( x^(x) \right) = 1,
:
f^\left( x^ \right) \cdot f^\left( x(x^) \right) = 1.
Scaling properties
If for some
\gamma>0, g(x) = \alpha + \beta x + \gamma \cdot f\left( \lambda x + \delta \right), then
:
g^\left( x^ \right)= - \alpha - \delta\frac \lambda + \gamma \cdot f^\left(\frac \right).
Behavior under linear transformations
Let
A : X \to Y be a
bounded linear operator. For any convex function
f on
X,
:
\left(A f\right)^ = f^ A^
where
:
(A f)(y) = \inf\
is the preimage of
f with respect to
A and
A^ is the
adjoint operator of
A.
A closed convex function
f is symmetric with respect to a given set
G of
orthogonal linear transformations,
:
f(A x) = f(x) for all
x and all
A \in G
if and only if its convex conjugate
f^ is symmetric with respect to
G.
Table of selected convex conjugates
The following table provides Legendre transforms for many common functions as well as a few useful properties.
See also
*
Dual problem
In mathematical optimization theory, duality or the duality principle is the principle that optimization problems may be viewed from either of two perspectives, the primal problem or the dual problem. If the primal is a minimization problem then th ...
*
Fenchel's duality theorem
*
Legendre transformation
*
Young's inequality for products
References
*
*
*
Further reading
*
*
*
(271 pages)
*
(24 pages)
{{Convex analysis and variational analysis
Convex analysis
Duality theories
Theorems involving convexity
Transforms