Interior-point methods (also referred to as barrier methods or IPMs) are a certain class of

algorithm In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...

s that solve linear and nonlinear

convex optimization Convex optimization is a subfield of mathematical optimization that studies the problem of minimizing convex functions over convex sets (or, equivalently, maximizing concave functions over convex sets). Many classes of convex optimization prob ...

problems. An interior point method was discovered by Soviet mathematician I. I. Dikin in 1967 and reinvented in the U.S. in the mid-1980s. In 1984,

Narendra Karmarkar Narendra Krishna Karmarkar (born Circa 1956) is an Indian Mathematician. Karmarkar developed Karmarkar's algorithm. He is listed as an ISI highly cited researcher. He invented one of the first provably polynomial time algorithms for linear pro ...

developed a method for

linear programming Linear programming (LP), also called linear optimization, is a method to achieve the best outcome (such as maximum profit or lowest cost) in a mathematical model whose requirements are represented by linear relationships. Linear programming is ...

called

Karmarkar's algorithm Karmarkar's algorithm is an algorithm introduced by Narendra Karmarkar in 1984 for solving linear programming problems. It was the first reasonably efficient algorithm that solves these problems in polynomial time. The ellipsoid method is also po ...

, which runs in provably polynomial time and is also very efficient in practice. It enabled solutions of linear programming problems that were beyond the capabilities of the

simplex method In mathematical optimization, Dantzig's simplex algorithm (or simplex method) is a popular algorithm for linear programming. The name of the algorithm is derived from the concept of a simplex and was suggested by T. S. Motzkin. Simplices are no ...

. Contrary to the simplex method, it reaches a best solution by traversing the interior of the

feasible region In mathematical optimization, a feasible region, feasible set, search space, or solution space is the set of all possible points (sets of values of the choice variables) of an optimization problem that satisfy the problem's constraints, potent ...

. The method can be generalized to convex programming based on a

self-concordant In optimization, a self-concordant function is a function f:\mathbb \rightarrow \mathbb for which : , f(x), \leq 2 f''(x)^ or, equivalently, a function f:\mathbb \rightarrow \mathbb that, wherever f''(x) > 0, satisfies : \left, \frac \frac ...

barrier function In constrained optimization, a field of mathematics, a barrier function is a continuous function whose value on a point increases to infinity as the point approaches the boundary of the feasible region of an optimization problem. Such functions a ...

used to encode the

convex set In geometry, a subset of a Euclidean space, or more generally an affine space over the reals, is convex if, given any two points in the subset, the subset contains the whole line segment that joins them. Equivalently, a convex set or a convex ...

. Any convex optimization problem can be transformed into minimizing (or maximizing) a

linear function In mathematics, the term linear function refers to two distinct but related notions: * In calculus and related areas, a linear function is a function whose graph is a straight line, that is, a polynomial function of degree zero or one. For di ...

over a convex set by converting to the epigraph form. The idea of encoding the

feasible set In mathematical optimization, a feasible region, feasible set, search space, or solution space is the set of all possible points (sets of values of the choice variables) of an optimization problem that satisfy the problem's constraints, potent ...

using a barrier and designing barrier methods was studied by Anthony V. Fiacco, Garth P. McCormick, and others in the early 1960s. These ideas were mainly developed for general

nonlinear programming In mathematics, nonlinear programming (NLP) is the process of solving an optimization problem where some of the constraints or the objective function are nonlinear. An optimization problem is one of calculation of the extrema (maxima, minima or st ...

, but they were later abandoned due to the presence of more competitive methods for this class of problems (e.g.

sequential quadratic programming Sequential quadratic programming (SQP) is an iterative method for constrained nonlinear optimization. SQP methods are used on mathematical problems for which the objective function and the constraints are twice continuously differentiable. SQP me ...

Yurii Nesterov Yurii Nesterov is a Russian mathematician, an internationally recognized expert in convex optimization, especially in the development of efficient algorithms and numerical optimization analysis. He is currently a professor at the University of L ...

, and

Arkadi Nemirovski Arkadi Nemirovski (born March 14, 1947) is a professor at the H. Milton Stewart School of Industrial and Systems Engineering at the Georgia Institute of Technology. He has been a leader in continuous optimization and is best known for his wor ...

came up with a special class of such barriers that can be used to encode any convex set. They guarantee that the number of

iteration Iteration is the repetition of a process in order to generate a (possibly unbounded) sequence of outcomes. Each repetition of the process is a single iteration, and the outcome of each iteration is then the starting point of the next iteration. ...

s of the algorithm is bounded by a polynomial in the dimension and accuracy of the solution. Karmarkar's breakthrough revitalized the study of interior-point methods and barrier problems, showing that it was possible to create an algorithm for linear programming characterized by

polynomial complexity In computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations performed by t ...

and, moreover, that was competitive with the simplex method. Already

Khachiyan Leonid Genrikhovich Khachiyan (; russian: Леони́д Ге́нрихович Хачия́н; May 3, 1952April 29, 2005) was a Soviet and American mathematician and computer scientist. He was most famous for his ellipsoid algorithm (1979) for ...

ellipsoid method In mathematical optimization, the ellipsoid method is an iterative method for minimizing convex functions. When specialized to solving feasible linear optimization problems with rational data, the ellipsoid method is an algorithm which finds an ...

was a polynomial-time algorithm; however, it was too slow to be of practical interest. The class of primal-dual path-following interior-point methods is considered the most successful. Mehrotra's predictor–corrector algorithm provides the basis for most implementations of this class of methods.

Primal-dual interior-point method for nonlinear optimization

The primal-dual method's idea is easy to demonstrate for constrained

nonlinear optimization In mathematics, nonlinear programming (NLP) is the process of solving an optimization problem where some of the constraints or the objective function are nonlinear. An optimization problem is one of calculation of the extrema (maxima, minima or st ...

. For simplicity, consider the all-inequality version of a nonlinear optimization problem: :minimize

f(x)

subject to

c_i(x) \ge 0 ~\text~ i = 1, \ldots, m, ~ x \in \mathbb^n,

where

f : \mathbb^ \to \mathbb, c_i : \mathbb^ \rightarrow \mathbb \quad (1).

This inequality-constrained optimization problem is then solved by converting it into an unconstrained objective function whose minimum we hope to find efficiently. Specifically, the logarithmic

associated with (1) is :

B(x,\mu) = f(x) - \mu \sum_^m \log(c_i(x)). \quad (2)

Here

\mu

is a small positive scalar, sometimes called the "barrier parameter". As

\mu

converges to zero the minimum of

B(x,\mu)

should converge to a solution of (1). The barrier function

gradient In vector calculus, the gradient of a scalar-valued differentiable function of several variables is the vector field (or vector-valued function) \nabla f whose value at a point p is the "direction and rate of fastest increase". If the gr ...

is :

g_b(x,\mu) := \nabla B(x,\mu) = g(x) - \mu \sum_^m \frac \nabla c_i(x), \quad (3)

where

g(x):=\nabla f(x)

is the gradient of the original function

f(x)

, and

\nabla c_i

is the gradient of

c_i

. In addition to the original ("primal") variable

x

we introduce a

Lagrange multiplier In mathematical optimization, the method of Lagrange multipliers is a strategy for finding the local maxima and minima of a function subject to equality constraints (i.e., subject to the condition that one or more equations have to be satisfied ...

-inspired dual variable

\lambda \in \mathbb ^m

c_i(x) \lambda_i = \mu, \forall i = 1, \ldots, m. \quad (4)

(4) is sometimes called the "perturbed complementarity" condition, for its resemblance to "complementary slackness" in

KKT conditions KKT may refer to: * Karush–Kuhn–Tucker conditions In mathematical optimization, the Karush–Kuhn–Tucker (KKT) conditions, also known as the Kuhn–Tucker conditions, are first derivative tests (sometimes called first-order necessary condi ...

. We try to find those

(x_\mu, \lambda_\mu)

for which the gradient of the barrier function is zero. Applying (4) to (3), we get an equation for the gradient: :

g - A^T \lambda = 0, \quad (5)

where the matrix

A

is the

Jacobian In mathematics, a Jacobian, named for Carl Gustav Jacob Jacobi, may refer to: *Jacobian matrix and determinant *Jacobian elliptic functions *Jacobian variety *Intermediate Jacobian In mathematics, the intermediate Jacobian of a compact Kähler m ...

of the constraints

c(x)

. The intuition behind (5) is that the gradient of

f(x)

should lie in the subspace spanned by the constraints' gradients. The "perturbed complementarity" with small

\mu

(4) can be understood as the condition that the solution should either lie near the boundary

c_i(x) = 0

, or that the projection of the gradient

g

on the constraint component

c_i(x)

normal should be almost zero. Applying

Newton's method In numerical analysis, Newton's method, also known as the Newton–Raphson method, named after Isaac Newton and Joseph Raphson, is a root-finding algorithm which produces successively better approximations to the roots (or zeroes) of a real ...

to (4) and (5), we get an equation for

(x, \lambda)

update

(p_x, p_\lambda)

: :

\begin
 W & -A^T \\
 \Lambda A & C
\end\begin
 p_x  \\
 p_\lambda
\end=\begin
 -g + A^T \lambda  \\
 \mu 1 - C \lambda
\end,

where

W

is the

Hessian matrix In mathematics, the Hessian matrix or Hessian is a square matrix of second-order partial derivatives of a scalar-valued function, or scalar field. It describes the local curvature of a function of many variables. The Hessian matrix was developed ...

B(x, \mu)

\Lambda

is a

diagonal matrix In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagonal ...

\lambda

, and

C

is a diagonal matrix with

C_ = c_i(x)

. Because of (1), (4) the condition :

\lambda \ge 0

should be enforced at each step. This can be done by choosing appropriate

\alpha

: :

(x,\lambda) \to (x + \alpha p_x, \lambda + \alpha p_\lambda).

References

Bibliography

* * * * * * * * {{Optimization algorithms, convex Optimization algorithms and methods

Primal-dual interior-point method for nonlinear optimization

See also

References

Bibliography