The Kantorovich theorem, or Newton–Kantorovich theorem, is a mathematical statement on the semi-local
convergence
Convergence may refer to:
Arts and media Literature
*''Convergence'' (book series), edited by Ruth Nanda Anshen
*Convergence (comics), "Convergence" (comics), two separate story lines published by DC Comics:
**A four-part crossover storyline that ...
of
Newton's method
In numerical analysis, the Newton–Raphson method, also known simply as Newton's method, named after Isaac Newton and Joseph Raphson, is a root-finding algorithm which produces successively better approximations to the roots (or zeroes) of a ...
. It was first stated by
Leonid Kantorovich
Leonid Vitalyevich Kantorovich (, ; 19 January 19127 April 1986) was a Soviet mathematician and economist, known for his theory and development of techniques for the optimal allocation of resources. He is regarded as the founder of linear programm ...
in 1948.
It is similar to the form of the
Banach fixed-point theorem
In mathematics, the Banach fixed-point theorem (also known as the contraction mapping theorem or contractive mapping theorem or Banach–Caccioppoli theorem) is an important tool in the theory of metric spaces; it guarantees the existence and uniqu ...
, although it states existence and uniqueness of a
zero
0 (zero) is a number representing an empty quantity. Adding (or subtracting) 0 to any number leaves that number unchanged; in mathematical terminology, 0 is the additive identity of the integers, rational numbers, real numbers, and compl ...
rather than a
fixed point.
Newton's method constructs a sequence of points that under certain conditions will converge to a solution
of an equation
or a vector solution of a system of equation
. The Kantorovich theorem gives conditions on the initial point of this sequence. If those conditions are satisfied then a solution exists close to the initial point and the sequence converges to that point.
Assumptions
Let
be an open subset and
a
differentiable function
In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non- vertical tangent line at each interior point in ...
with a
Jacobian that is locally
Lipschitz continuous
In mathematical analysis, Lipschitz continuity, named after Germany, German mathematician Rudolf Lipschitz, is a strong form of uniform continuity for function (mathematics), functions. Intuitively, a Lipschitz continuous function is limited in h ...
(for instance if
is twice differentiable). That is, it is assumed that for any
there is an open subset
such that
and there exists a constant
such that for any
:
holds. The norm on the left is the operator norm. In other words, for any vector
the inequality
:
must hold.
Now choose any initial point
. Assume that
is invertible and construct the Newton step
The next assumption is that not only the next point
but the entire ball
is contained inside the set
. Let
be the Lipschitz constant for the Jacobian over this ball (assuming it exists).
As a last preparation, construct recursively, as long as it is possible, the sequences
,
,
according to
:
Statement
Now if
then
#a solution
of
exists inside the closed ball
and
#the Newton iteration starting in
converges to
with at least linear order of convergence.
A statement that is more precise but slightly more difficult to prove uses the roots
of the quadratic polynomial
:
,
:
and their ratio
:
Then
#a solution
exists inside the closed ball
#it is unique inside the bigger ball
#and the convergence to the solution of
is dominated by the convergence of the Newton iteration of the quadratic polynomial
towards its smallest root
, if
, then
#:
#The quadratic convergence is obtained from the error estimate
#:
Corollary
In 1986, Yamamoto proved that the error evaluations of the Newton method such as Doring (1969), Ostrowski (1971, 1973), Gragg-Tapia (1974), Potra-Ptak (1980), Miel (1981), Potra (1984), can be derived from the Kantorovich theorem.
Generalizations
There is a
''q''-analog for the Kantorovich theorem. For other generalizations/variations, see Ortega & Rheinboldt (1970).
Applications
Oishi and Tanabe claimed that the Kantorovich theorem can be applied to obtain reliable solutions of
linear programming
Linear programming (LP), also called linear optimization, is a method to achieve the best outcome (such as maximum profit or lowest cost) in a mathematical model whose requirements and objective are represented by linear function#As a polynomia ...
.
References
Further reading
*
John H. Hubbard and
Barbara Burke Hubbard: ''Vector Calculus, Linear Algebra, and Differential Forms: A Unified Approach'', Matrix Editions,
preview of 3. edition and sample material including Kant.-thm.
* {{cite book , first=Tetsuro , last=Yamamoto , chapter=Historical Developments in Convergence Analysis for Newton's and Newton-like Methods , pages=241–263 , editor-first=C. , editor-last=Brezinski , editor2-first=L. , editor2-last=Wuytack , title=Numerical Analysis : Historical Developments in the 20th Century , publisher=North-Holland , year=2001 , isbn=0-444-50617-9
Functional analysis
Numerical analysis
Theorems in mathematical analysis
Optimization in vector spaces
Optimization algorithms and methods