In
mathematics
Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
and economics, transportation theory or transport theory is a name given to the study of optimal
transport
Transport (in British English) or transportation (in American English) is the intentional Motion, movement of humans, animals, and cargo, goods from one location to another. Mode of transport, Modes of transport include aviation, air, land tr ...
ation and
allocation of resources. The problem was formalized by the French
mathematician
A mathematician is someone who uses an extensive knowledge of mathematics in their work, typically to solve mathematical problems. Mathematicians are concerned with numbers, data, quantity, mathematical structure, structure, space, Mathematica ...
Gaspard Monge
Gaspard Monge, Comte de Péluse (; 9 May 1746 – 28 July 1818) was a French mathematician, commonly presented as the inventor of descriptive geometry, (the mathematical basis of) technical drawing, and the father of differential geometry. Dur ...
in 1781.
[G. Monge. ''Mémoire sur la théorie des déblais et des remblais. Histoire de l’Académie Royale des Sciences de Paris, avec les Mémoires de Mathématique et de Physique pour la même année'', pages 666–704, 1781.]
In the 1920s A.N. Tolstoi was one of the first to study the transportation problem
mathematically. In 1930, in the collection ''Transportation Planning Volume I'' for the National Commissariat of Transportation of the Soviet Union, he published a paper "Methods of Finding the Minimal Kilometrage in Cargo-transportation in space".
Major advances were made in the field during World War II by the
Soviet
The Union of Soviet Socialist Republics. (USSR), commonly known as the Soviet Union, was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 until Dissolution of the Soviet ...
mathematician and economist
Leonid Kantorovich.
[L. Kantorovich. ''On the translocation of masses.'' C.R. (Doklady) Acad. Sci. URSS (N.S.), 37:199–201, 1942.] Consequently, the problem as it is stated is sometimes known as the Monge–Kantorovich transportation problem.
The
linear programming
Linear programming (LP), also called linear optimization, is a method to achieve the best outcome (such as maximum profit or lowest cost) in a mathematical model whose requirements and objective are represented by linear function#As a polynomia ...
formulation of the transportation problem is also known as the
Hitchcock–
Koopmans transportation problem.
Motivation
Mines and factories
Suppose that we have a collection of
mines mining iron ore, and a collection of
factories which use the iron ore that the mines produce. Suppose for the sake of argument that these mines and factories form two
disjoint subset
In mathematics, a Set (mathematics), set ''A'' is a subset of a set ''B'' if all Element (mathematics), elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they a ...
s
and
of the
Euclidean plane
In mathematics, a Euclidean plane is a Euclidean space of Two-dimensional space, dimension two, denoted \textbf^2 or \mathbb^2. It is a geometric space in which two real numbers are required to determine the position (geometry), position of eac ...
. Suppose also that we have a ''cost function''
, so that
is the cost of transporting one shipment of iron from
to
. For simplicity, we ignore the time taken to do the transporting. We also assume that each mine can supply only one factory (no splitting of shipments) and that each factory requires precisely one shipment to be in operation (factories cannot work at half- or double-capacity). Having made the above assumptions, a ''transport plan'' is a bijection
.
In other words, each mine
supplies precisely one target factory
and each factory is supplied by precisely one mine.
We wish to find the ''optimal transport plan'', the plan
whose ''total cost''
:
is the least of all possible transport plans from
to
. This motivating special case of the transportation problem is an instance of the
assignment problem.
More specifically, it is equivalent to finding a minimum weight matching in a
bipartite graph.
Moving books: the importance of the cost function
The following simple example illustrates the importance of the
cost function in determining the optimal transport plan. Suppose that we have
books of equal width on a shelf (the
real line
A number line is a graphical representation of a straight line that serves as spatial representation of numbers, usually graduated like a ruler with a particular origin (geometry), origin point representing the number zero and evenly spaced mark ...
), arranged in a single contiguous block. We wish to rearrange them into another contiguous block, but shifted one book-width to the right. Two obvious candidates for the optimal transport plan present themselves:
# move all
books one book-width to the right ("many small moves");
# move the left-most book
book-widths to the right and leave all other books fixed ("one big move").
If the cost function is proportional to Euclidean distance (
for some
) then these two candidates are ''both'' optimal. If, on the other hand, we choose the
strictly convex cost function proportional to the square of Euclidean distance (
for some
), then the "many small moves" option becomes the unique minimizer.
Note that the above cost functions consider only the horizontal distance traveled by the books, not the horizontal distance traveled by a device used to pick each book up and move the book into position. If the latter is considered instead, then, of the two transport plans, the second is always optimal for the Euclidean distance, while, provided there are at least 3 books, the first transport plan is optimal for the squared Euclidean distance.
Hitchcock problem
The following transportation problem formulation is credited to
F. L. Hitchcock:
:Suppose there are
sources
for a commodity, with
units of supply at
and
sinks
for the commodity, with the demand
at
. If
is the unit cost of shipment from
to
, find a flow that satisfies demand from supplies and minimizes the flow cost. This challenge in logistics was taken up by
D. R. Fulkerson and in the book ''Flows in Networks'' (1962) written with
L. R. Ford Jr.
Tjalling Koopmans is also credited with formulations of
transport economics and allocation of resources.
Abstract formulation of the problem
Monge and Kantorovich formulations
The transportation problem as it is stated in modern or more technical literature looks somewhat different because of the development of
Riemannian geometry
Riemannian geometry is the branch of differential geometry that studies Riemannian manifolds, defined as manifold, smooth manifolds with a ''Riemannian metric'' (an inner product on the tangent space at each point that varies smooth function, smo ...
and
measure theory
In mathematics, the concept of a measure is a generalization and formalization of geometrical measures (length, area, volume) and other common notions, such as magnitude (mathematics), magnitude, mass, and probability of events. These seemingl ...
. The mines-factories example, simple as it is, is a useful reference point when thinking of the abstract case. In this setting, we allow the possibility that we may not wish to keep all mines and factories open for business, and allow mines to supply more than one factory, and factories to accept iron from more than one mine.
Let
and
be two
separable metric space
In mathematics, a metric space is a Set (mathematics), set together with a notion of ''distance'' between its Element (mathematics), elements, usually called point (geometry), points. The distance is measured by a function (mathematics), functi ...
s such that any
probability measure
In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...
on
(or
) is a
Radon measure (i.e. they are
Radon spaces). Let
be a Borel-measurable function. Given probability measures
on
and
on
, Monge's formulation of the optimal transportation problem is to find a transport map
that realizes the infimum
:
where
denotes the pushforward measure, push forward of
by
. A map
that attains this infimum (''i.e.'' makes it a
minimum instead of an infimum) is called an "optimal transport map".
Monge's formulation of the optimal transportation problem can be ill-posed, because sometimes there is no
satisfying
: this happens, for example, when
is a
Dirac measure but
is not.
We can improve on this by adopting Kantorovich's formulation of the optimal transportation problem, which is to find a probability measure
on
that attains the infimum
:
where
denotes the collection of all probability measures on
with
marginals on
and
on
. It can be shown
[L. Ambrosio, N. Gigli & G. Savaré. ]
Gradient Flows in Metric Spaces and in the Space of Probability Measures
'' Lectures in Mathematics ETH Zürich, Birkhäuser Verlag, Basel. (2005) that a minimizer for this problem always exists when the cost function
is lower semi-continuous and
is a
tight collection of measures (which is guaranteed for Radon spaces
and
). (Compare this formulation with the definition of the
Wasserstein metric on the space of probability measures.) A gradient descent formulation for the solution of the Monge–Kantorovich problem was given by
Sigurd Angenent, Steven Haker, and
Allen Tannenbaum.
Duality formula
The minimum of the Kantorovich problem is equal to
:
where the
supremum runs over all pairs of
bounded and
continuous function
In mathematics, a continuous function is a function such that a small variation of the argument induces a small variation of the value of the function. This implies there are no abrupt changes in value, known as '' discontinuities''. More preci ...
s
and
such that
:
Economic interpretation
The economic interpretation is clearer if signs are flipped. Let
stand for the vector of characteristics of a worker,
for the vector of characteristics of a firm, and
for the economic output generated by worker
matched with firm
. Setting
and
, the Monge–Kantorovich problem
rewrites:
:
which has
dual:
:
where the infimum runs over bounded and continuous function
and
. If the dual problem has a solution, one can see that:
:
so that
interprets as the equilibrium wage of a worker of type
, and
interprets as the equilibrium profit of a firm of type
.
Solution of the problem
Optimal transportation on the real line
For
, let
denote the collection of
probability measure
In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...
s on
that have finite
-th
moment. Let
and let
, where
is a convex function.
# If
has no atom (measure theory)">atom
Atoms are the basic particles of the chemical elements. An atom consists of a atomic nucleus, nucleus of protons and generally neutrons, surrounded by an electromagnetically bound swarm of electrons. The chemical elements are distinguished fr ...
is an optimal transport map. It is the unique optimal transport map if
The proof of this solution appears in Rachev & Rüschendorf (1998).
assignment. The objective function in the primal Kantorovich problem is then
:
this operation. In the
. As a result, setting
which can be readily inputted in a large-scale linear programming solver (see chapter 3.4 of Galichon (2016)
).
. In this case, we can see that the primal and dual Kantorovich problems respectively boil down to:
:
which is a finite-dimensional convex optimization problem that can be solved by standard techniques, such as
is a convex polyhedron. The resulting configuration is called a
.
is invertible. One then has
:
The proof of this solution appears in Galichon (2016).
. Let
. Then the Kantorovich problem has a unique solution
, and this solution is induced by an optimal transport map: i.e., there exists a Borel map
. (Here
Consider a variant of the discrete problem above, where we have added an entropic regularization term to the objective function of the primal problem
:
where, compared with the unregularized version, the "hard" constraint in the former dual (
terms). The optimality conditions in the dual problem can be expressed as
:
, solving the dual is therefore equivalent to looking for two diagonal positive matrices
. The existence of such matrices generalizes
to solve . Sinkhorn–Knopp's algorithm is therefore a
algorithm on the dual regularized problem.
The Monge–Kantorovich optimal transport has found applications in wide range in different fields. Among them are:
*
).