
In
functional analysis
Functional analysis is a branch of mathematical analysis, the core of which is formed by the study of vector spaces endowed with some kind of limit-related structure (for example, Inner product space#Definition, inner product, Norm (mathematics ...
, a reproducing kernel Hilbert space (RKHS) is a
Hilbert space
In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
of functions in which point evaluation is a continuous
linear functional
In mathematics, a linear form (also known as a linear functional, a one-form, or a covector) is a linear mapIn some texts the roles are reversed and vectors are defined as linear maps from covectors to scalars from a vector space to its field of ...
. Specifically, a Hilbert space
of functions from a set
(to
or
) is an RKHS if the point-evaluation functional
,
, is continuous for every
. Equivalently,
is an RKHS if there exists a function
such that, for all
,
The function
is then called the ''reproducing kernel'', and it reproduces the value of
at
via the inner product.
An immediate consequence of this property is that convergence in norm implies uniform convergence on any subset of
on which
is bounded. However, the converse does not necessarily hold. Often the set
carries a topology, and
depends continuously on
, in which case: convergence in norm implies uniform convergence on compact subsets of
.
It is not entirely straightforward to construct natural examples of a Hilbert space which are not an RKHS in a non-trivial fashion. Some examples, however, have been found.
While, formally,
''L''2 spaces are defined as Hilbert spaces of equivalence classes of functions, this definition can trivially be extended to a Hilbert space of functions by choosing a (total) function as a representative for each equivalence class. However, no choice of representatives can make this space an RKHS (
would need to be the non-existent Dirac delta function). However, there are RKHSs in which the norm is an ''L''
2-norm, such as the space of band-limited functions (see the example below).
An RKHS is associated with a kernel that reproduces every function in the space in the sense that for every
in the set on which the functions are defined, "evaluation at
" can be performed by taking an inner product with a function determined by the kernel. Such a ''reproducing kernel'' exists if and only if every evaluation functional is continuous.
The reproducing kernel was first introduced in the 1907 work of
Stanisław Zaremba concerning
boundary value problem
In the study of differential equations, a boundary-value problem is a differential equation subjected to constraints called boundary conditions. A solution to a boundary value problem is a solution to the differential equation which also satis ...
s for
harmonic
In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
and
biharmonic functions.
James Mercer simultaneously examined
functions which satisfy the reproducing property in the theory of
integral equation
In mathematical analysis, integral equations are equations in which an unknown function appears under an integral sign. In mathematical notation, integral equations may thus be expressed as being of the form: f(x_1,x_2,x_3,\ldots,x_n ; u(x_1,x_2 ...
s. The idea of the reproducing kernel remained untouched for nearly twenty years until it appeared in the dissertations of
Gábor Szegő
Gábor Szegő () (January 20, 1895 – August 7, 1985) was a Hungarian-American mathematician. He was one of the foremost mathematical analysts of his generation and made fundamental contributions to the theory of orthogonal polynomials and ...
,
Stefan Bergman
Stefan Bergman (5 May 1895 – 6 June 1977) was a Russian Poland, Poland-born American mathematician whose primary work was in complex analysis. He is known for the positive-definite kernel, kernel function he discovered in 1922 at Humboldt Univer ...
, and
Salomon Bochner
Salomon Bochner (20 August 1899 – 2 May 1982) was a Galizien-born mathematician, known for work in mathematical analysis, probability theory and differential geometry.
Life
He was born into a Jewish family in Podgórze (near Kraków), th ...
. The subject was eventually systematically developed in the early 1950s by
Nachman Aronszajn
Nachman Aronszajn (26 July 1907 – 5 February 1980) was a Polish American mathematician. Aronszajn's main field of study was mathematical analysis, where he systematically developed the concept of reproducing kernel Hilbert space. He also contrib ...
and Stefan Bergman.
These spaces have wide applications, including
complex analysis
Complex analysis, traditionally known as the theory of functions of a complex variable, is the branch of mathematical analysis that investigates functions of complex numbers. It is helpful in many branches of mathematics, including algebraic ...
,
harmonic analysis
Harmonic analysis is a branch of mathematics concerned with investigating the connections between a function and its representation in frequency. The frequency representation is found by using the Fourier transform for functions on unbounded do ...
, and
quantum mechanics
Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
. Reproducing kernel Hilbert spaces are particularly important in the field of
statistical learning theory
Statistical learning theory is a framework for machine learning drawing from the fields of statistics and functional analysis. Statistical learning theory deals with the statistical inference problem of finding a predictive function based on da ...
because of the celebrated
representer theorem
For computer science, in statistical learning theory, a representer theorem is any of several related results stating that a minimizer f^ of a regularized Empirical risk minimization, empirical risk functional defined over a reproducing kernel Hi ...
which states that every function in an RKHS that minimises an empirical risk functional can be written as a
linear combination
In mathematics, a linear combination or superposition is an Expression (mathematics), expression constructed from a Set (mathematics), set of terms by multiplying each term by a constant and adding the results (e.g. a linear combination of ''x'' a ...
of the kernel function evaluated at the training points. This is a practically useful result as it effectively simplifies the
empirical risk minimization
In statistical learning theory, the principle of empirical risk minimization defines a family of learning algorithms based on evaluating performance over a known and fixed dataset. The core idea is based on an application of the law of large num ...
problem from an infinite dimensional to a finite dimensional optimization problem.
For ease of understanding, we provide the framework for real-valued Hilbert spaces. The theory can be easily extended to spaces of complex-valued functions and hence include the many important examples of reproducing kernel Hilbert spaces that are spaces of
analytic functions
In mathematics, an analytic function is a function that is locally given by a convergent power series. There exist both real analytic functions and complex analytic functions. Functions of each type are infinitely differentiable, but complex ...
.
Definition
Let
be an arbitrary
set
Set, The Set, SET or SETS may refer to:
Science, technology, and mathematics Mathematics
*Set (mathematics), a collection of elements
*Category of sets, the category whose objects and morphisms are sets and total functions, respectively
Electro ...
and
a
Hilbert space
In mathematics, a Hilbert space is a real number, real or complex number, complex inner product space that is also a complete metric space with respect to the metric induced by the inner product. It generalizes the notion of Euclidean space. The ...
of
real-valued function
In mathematics, a real-valued function is a function whose values are real numbers. In other words, it is a function that assigns a real number to each member of its domain.
Real-valued functions of a real variable (commonly called ''real ...
s on
, equipped with pointwise addition and pointwise scalar multiplication. The
evaluation
In common usage, evaluation is a systematic determination and assessment of a subject's merit, worth and significance, using criteria governed by a set of Standardization, standards. It can assist an organization, program, design, project or any o ...
functional over the Hilbert space of functions
is a linear functional that evaluates each function at a point
,
:
We say that ''H'' is a reproducing kernel Hilbert space if, for all
in
,
is
continuous
Continuity or continuous may refer to:
Mathematics
* Continuity (mathematics), the opposing concept to discreteness; common examples include
** Continuous probability distribution or random variable in probability and statistics
** Continuous ...
at every
in
or, equivalently, if
is a
bounded operator
In functional analysis and operator theory, a bounded linear operator is a linear transformation L : X \to Y between topological vector spaces (TVSs) X and Y that maps bounded subsets of X to bounded subsets of Y.
If X and Y are normed vector ...
on
, i.e. there exists some
such that
Although
is assumed for all
, it might still be the case that
.
While property () is the weakest condition that ensures both the existence of an inner product and the evaluation of every function in
at every point in the domain, it does not lend itself to easy application in practice. A more intuitive definition of the RKHS can be obtained by observing that this property guarantees that the evaluation functional can be represented by taking the inner product of
with a function
in
. This function is the so-called reproducing kernel for the Hilbert space
from which the RKHS takes its name. More formally, the
Riesz representation theorem
The Riesz representation theorem, sometimes called the Riesz–Fréchet representation theorem after Frigyes Riesz and Maurice René Fréchet, establishes an important connection between a Hilbert space and its continuous dual space. If the un ...
implies that for all
in
there exists a unique element
of
with the reproducing property,
Since
is itself a function defined on
with values in the field
(or
in the case of complex Hilbert spaces) and as
is in
we have that
:
where
is the element in
associated to
.
This allows us to define the reproducing kernel of
as a function
(or
in the complex case) by
:
From this definition it is easy to see that
(or
in the complex case) is both symmetric (resp. conjugate symmetric) and
positive definite In mathematics, positive definiteness is a property of any object to which a bilinear form
In mathematics, a bilinear form is a bilinear map on a vector space (the elements of which are called '' vectors'') over a field ''K'' (the elements of w ...
, i.e.
:
for every
The Moore–Aronszajn theorem (see below) is a sort of converse to this: if a function
satisfies these conditions then there is a Hilbert space of functions on
for which it is a reproducing kernel.
Examples
The simplest example of a reproducing kernel Hilbert space is the space
where
is a set and
is the
counting measure
In mathematics, specifically measure theory, the counting measure is an intuitive way to put a measure on any set – the "size" of a subset is taken to be the number of elements in the subset if the subset has finitely many elements, and infinit ...
on
. For
, the reproducing kernel
is the
indicator function
In mathematics, an indicator function or a characteristic function of a subset of a set is a function that maps elements of the subset to one, and all other elements to zero. That is, if is a subset of some set , then the indicator functio ...
of the one point set
.
Nontrivial reproducing kernel Hilbert spaces often involve
analytic function
In mathematics, an analytic function is a function that is locally given by a convergent power series. There exist both real analytic functions and complex analytic functions. Functions of each type are infinitely differentiable, but complex ...
s, as we now illustrate by example. Consider the Hilbert space of
bandlimited
Bandlimiting is the process of reducing a signal’s energy outside a specific frequency range, keeping only the desired part of the signal’s spectrum. This technique is crucial in signal processing and communications to ensure signals stay cl ...
continuous function
In mathematics, a continuous function is a function such that a small variation of the argument induces a small variation of the value of the function. This implies there are no abrupt changes in value, known as '' discontinuities''. More preci ...
s
. Fix some
cutoff frequency
In physics and electrical engineering, a cutoff frequency, corner frequency, or break frequency is a boundary in a system's frequency response at which energy flowing through the system begins to be reduced ( attenuated or reflected) rather than ...