graph theory In mathematics and computer science, graph theory is the study of ''graph (discrete mathematics), graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of ''Vertex (graph ...

, the shortest path problem is the problem of finding a

path A path is a route for physical travel – see Trail. Path or PATH may also refer to: Physical paths of different types * Bicycle path * Bridle path, used by people on horseback * Course (navigation), the intended path of a vehicle * Desir ...

between two vertices (or nodes) in a

graph Graph may refer to: Mathematics *Graph (discrete mathematics), a structure made of vertices and edges **Graph theory, the study of such graphs and their properties *Graph (topology), a topological space resembling a graph in the sense of discret ...

such that the sum of the weights of its constituent edges is minimized. The problem of finding the shortest path between two intersections on a road map may be modeled as a special case of the shortest path problem in graphs, where the vertices correspond to intersections and the edges correspond to road segments, each weighted by the length or distance of each segment.

Definition

The shortest path problem can be defined for graphs whether undirected,

directed Direct may refer to: Mathematics * Directed set, in order theory * Direct limit of (pre), sheaves * Direct sum of modules, a construction in abstract algebra which combines several vector spaces Computing * Direct access (disambiguation), a ...

, or mixed. The definition for undirected graphs states that every edge can be traversed in either direction. Directed graphs require that consecutive vertices be connected by an appropriate directed edge. Two vertices are adjacent when they are both incident to a common edge. A

in an undirected graph is a

sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...

of vertices

P = ( v_1, v_2, \ldots, v_n ) \in V \times V \times \cdots \times V

such that

v_i

is adjacent to

v_

for

1 \leq i < n

. Such a path

P

is called a path of length

n-1

from

v_1

v_n

. (The

v_i

are variables; their numbering relates to their position in the sequence and need not relate to a canonical labeling.) Let

E = \

where

e_

is the edge incident to both

v_i

and

v_j

. Given a

real-valued In mathematics, value may refer to several, strongly related notions. In general, a mathematical value may be any definite mathematical object. In elementary mathematics, this is most often a number – for example, a real number such as or an ...

weight function

f: E \rightarrow \mathbb

, and an undirected (simple) graph

G

, the shortest path from

v

v'

is the path

P = ( v_1, v_2, \ldots, v_n )

(where

v_1 = v

and

v_n = v'

) that over all possible

n

minimizes the sum

\sum_^ f(e_).

When each edge in the graph has unit weight or

f: E \rightarrow \

, this is equivalent to finding the path with fewest edges. The problem is also sometimes called the single-pair shortest path problem, to distinguish it from the following variations: * The single-source shortest path problem, in which we have to find shortest paths from a source vertex ''v'' to all other vertices in the graph. * The single-destination shortest path problem, in which we have to find shortest paths from all vertices in the directed graph to a single destination vertex ''v''. This can be reduced to the single-source shortest path problem by reversing the arcs in the directed graph. * The all-pairs shortest path problem, in which we have to find shortest paths between every pair of vertices ''v'', ''v' '' in the graph. These generalizations have significantly more efficient algorithms than the simplistic approach of running a single-pair shortest path algorithm on all relevant pairs of vertices.

Algorithms

Several well-known algorithms exist for solving this problem and its variants. *

Dijkstra's algorithm Dijkstra's algorithm ( ) is an algorithm for finding the shortest paths between nodes in a weighted graph, which may represent, for example, a road network. It was conceived by computer scientist Edsger W. Dijkstra in 1956 and published three ...

solves the single-source shortest path problem with only non-negative edge weights. *

Bellman–Ford algorithm The Bellman–Ford algorithm is an algorithm that computes shortest paths from a single source vertex (graph theory), vertex to all of the other vertices in a weighted digraph. It is slower than Dijkstra's algorithm for the same problem, but more ...

solves the single-source problem if edge weights may be negative. * A* search algorithm solves for single-pair shortest path using heuristics to try to speed up the search. * Floyd–Warshall algorithm solves all pairs shortest paths. * Johnson's algorithm solves all pairs shortest paths, and may be faster than Floyd–Warshall on

sparse graph In mathematics, a dense graph is a Graph (discrete mathematics), graph in which the number of edges is close to the maximal number of edges (where every pair of Vertex (graph theory), vertices is connected by one edge). The opposite, a graph with ...

s. * Viterbi algorithm solves the shortest stochastic path problem with an additional probabilistic weight on each node. Additional algorithms and associated evaluations may be found in .

Single-source shortest paths

Undirected graphs

Unweighted graphs

Directed acyclic graphs

An algorithm using topological sorting can solve the single-source shortest path problem in time in arbitrarily-weighted directed acyclic graphs.

Directed graphs with nonnegative weights

The following table is taken from , with some corrections and additions. A green background indicates an asymptotically best bound in the table; ''L'' is the maximum length (or weight) among all edges, assuming integer edge weights.

Directed graphs with arbitrary weights without negative cycles

Directed graphs with arbitrary weights with negative cycles

Finds a negative cycle or calculates distances to all vertices.

Planar graphs with nonnegative weights

Applications

Network flows are a fundamental concept in graph theory and operations research, often used to model problems involving the transportation of goods, liquids, or information through a network. A network flow problem typically involves a directed graph where each edge represents a pipe, wire, or road, and each edge has a capacity, which is the maximum amount that can flow through it. The goal is to find a feasible flow that maximizes the flow from a source node to a sink node. Shortest Path Problems can be used to solve certain network flow problems, particularly when dealing with single-source, single-sink networks. In these scenarios, we can transform the network flow problem into a series of shortest path problems.

Transformation Steps

# Create a Residual Graph: #* For each edge (u, v) in the original graph, create two edges in the residual graph: #** (u, v) with capacity c(u, v) #** (v, u) with capacity 0 #* The residual graph represents the remaining capacity available in the network. # Find the Shortest Path: #* Use a shortest path algorithm (e.g., Dijkstra's algorithm, Bellman-Ford algorithm) to find the shortest path from the source node to the sink node in the residual graph. # Augment the Flow: #* Find the minimum capacity along the shortest path. #* Increase the flow on the edges of the shortest path by this minimum capacity. #* Decrease the capacity of the edges in the forward direction and increase the capacity of the edges in the backward direction. # Update the Residual Graph: #* Update the residual graph based on the augmented flow. # Repeat: #* Repeat steps 2-4 until no more paths can be found from the source to the sink.

All-pairs shortest paths

The all-pairs shortest path problem finds the shortest paths between every pair of vertices , in the graph. The all-pairs shortest paths problem for unweighted directed graphs was introduced by , who observed that it could be solved by a linear number of matrix multiplications that takes a total time of .

Undirected graph

Directed graph

Applications

Shortest path algorithms are applied to automatically find directions between physical locations, such as driving directions on

web mapping Web mapping or an online mapping is the process of using, creating, and distributing maps on the World Wide Web (the Web), usually through the use of Web GIS, Web geographic information systems (Web GIS). A web map or an online map is both served ...

websites like MapQuest or

Google Maps Google Maps is a web mapping platform and consumer application offered by Google. It offers satellite imagery, aerial photography, street maps, 360° interactive panorama, interactive panoramic views of streets (Google Street View, Street View ...

. For this application fast specialized algorithms are available. If one represents a nondeterministic

abstract machine In computer science, an abstract machine is a theoretical model that allows for a detailed and precise analysis of how a computer system functions. It is similar to a mathematical function in that it receives inputs and produces outputs based on p ...

as a graph where vertices describe states and edges describe possible transitions, shortest path algorithms can be used to find an optimal sequence of choices to reach a certain goal state, or to establish lower bounds on the time needed to reach a given state. For example, if vertices represent the states of a puzzle like a Rubik's Cube and each directed edge corresponds to a single move or turn, shortest path algorithms can be used to find a solution that uses the minimum possible number of moves. In a networking or

telecommunications Telecommunication, often used in its plural form or abbreviated as telecom, is the transmission of information over a distance using electronic means, typically through cables, radio waves, or other communication technologies. These means of ...

mindset, this shortest path problem is sometimes called the min-delay path problem and usually tied with a

widest path problem In graph algorithms, the widest path problem is the problem of finding a path between two designated vertices in a weighted graph, maximizing the weight of the minimum-weight edge in the path. The widest path problem is also known as the maxim ...

. For example, the algorithm may seek the shortest (min-delay) widest path, or widest shortest (min-delay) path. A more lighthearted application is the games of "

six degrees of separation Six degrees of separation is the idea that all people are six or fewer social connections away from each other. As a result, a chain of "friend of a friend" statements can be made to connect any two people in a maximum of six steps. It is al ...

" that try to find the shortest path in graphs like movie stars appearing in the same film. Other applications, often studied in

operations research Operations research () (U.S. Air Force Specialty Code: Operations Analysis), often shortened to the initialism OR, is a branch of applied mathematics that deals with the development and application of analytical methods to improve management and ...

, include plant and facility layout,

robotics Robotics is the interdisciplinary study and practice of the design, construction, operation, and use of robots. Within mechanical engineering, robotics is the design and construction of the physical structures of robots, while in computer s ...

transportation Transport (in British English) or transportation (in American English) is the intentional Motion, movement of humans, animals, and cargo, goods from one location to another. Mode of transport, Modes of transport include aviation, air, land tr ...

, and VLSI design.

Road networks

A road network can be considered as a graph with positive weights. The nodes represent road junctions and each edge of the graph is associated with a road segment between two junctions. The weight of an edge may correspond to the length of the associated road segment, the time needed to traverse the segment, or the cost of traversing the segment. Using directed edges it is also possible to model one-way streets. Such graphs are special in the sense that some edges are more important than others for long-distance travel (e.g. highways). This property has been formalized using the notion of highway dimension. There are a great number of algorithms that exploit this property and are therefore able to compute the shortest path a lot quicker than would be possible on general graphs. All of these algorithms work in two phases. In the first phase, the graph is preprocessed without knowing the source or target node. The second phase is the query phase. In this phase, source and target node are known. The idea is that the road network is static, so the preprocessing phase can be done once and used for a large number of queries on the same road network. The algorithm with the fastest known query time is called hub labeling and is able to compute shortest path on the road networks of Europe or the US in a fraction of a microsecond. Other techniques that have been used are: * ALT ( A* search, landmarks, and

triangle inequality In mathematics, the triangle inequality states that for any triangle, the sum of the lengths of any two sides must be greater than or equal to the length of the remaining side. This statement permits the inclusion of Degeneracy (mathematics)#T ...

) * Arc flags * Contraction hierarchies * Transit node routing * Reach-based pruning * Labeling * Hub labels

General algebraic framework on semirings: the algebraic path problem

Many problems can be framed as a form of the shortest path for some suitably substituted notions of addition along a path and taking the minimum. The general approach to these is to consider the two operations to be those of a

semiring In abstract algebra, a semiring is an algebraic structure. Semirings are a generalization of rings, dropping the requirement that each element must have an additive inverse. At the same time, semirings are a generalization of bounded distribu ...

. Semiring multiplication is done along the path, and the addition is between paths. This general framework is known as the algebraic path problem. Most of the classic shortest-path algorithms (and new ones) can be formulated as solving linear systems over such algebraic structures. More recently, an even more general framework for solving these (and much less obviously related problems) has been developed under the banner of valuation algebras.

Shortest path in stochastic time-dependent networks

In real-life, a transportation network is usually stochastic and time-dependent. The travel duration on a road segment depends on many factors such as the amount of traffic (origin-destination matrix), road work, weather, accidents and vehicle breakdowns. A more realistic model of such a road network is a stochastic time-dependent (STD) network. There is no accepted definition of optimal path under uncertainty (that is, in stochastic road networks). It is a controversial subject, despite considerable progress during the past decade. One common definition is a path with the minimum expected travel time. The main advantage of this approach is that it can make use of efficient shortest path algorithms for deterministic networks. However, the resulting optimal path may not be reliable, because this approach fails to address travel time variability. To tackle this issue, some researchers use travel duration distribution instead of its expected value. So, they find the probability distribution of total travel duration using different optimization methods such as dynamic programming and

. These methods use

stochastic optimization Stochastic optimization (SO) are optimization methods that generate and use random variables. For stochastic optimization problems, the objective functions or constraints are random. Stochastic optimization also include methods with random iter ...

, specifically stochastic dynamic programming to find the shortest path in networks with probabilistic arc length. The terms ''travel time reliability'' and ''travel time variability'' are used as opposites in the transportation research literature: the higher the variability, the lower the reliability of predictions. To account for variability, researchers have suggested two alternative definitions for an optimal path under uncertainty. The ''most reliable path'' is one that maximizes the probability of arriving on time given a travel time budget. An ''α-reliable path'' is one that minimizes the travel time budget required to arrive on time with a given probability.

References

Notes

Bibliography

* * * * * * * * * * * * * * * * * * * * * * * * * * * * Attributes Dijkstra's algorithm to Minty ("private communication") on p. 225. * * * * * * *

Definition

Algorithms

Single-source shortest paths

Undirected graphs

Unweighted graphs

Directed acyclic graphs

Directed graphs with nonnegative weights

Directed graphs with arbitrary weights without negative cycles

Directed graphs with arbitrary weights with negative cycles

Planar graphs with nonnegative weights

Applications

Transformation Steps

All-pairs shortest paths

Undirected graph

Directed graph

Applications

Road networks

Related problems

Paths with constraints

Partial observability

Strategic shortest paths

Negative cycle detection

General algebraic framework on semirings: the algebraic path problem

Shortest path in stochastic time-dependent networks

See also

References

Notes

Bibliography

Further reading