In
topological data analysis
In applied mathematics, topological data analysis (TDA) is an approach to the analysis of datasets using techniques from topology. Extraction of information from datasets that are high-dimensional, incomplete and noisy is generally challenging. TDA ...
, persistent homology is a method for computing topological features of a space at different spatial resolutions. More persistent features are detected over a wide range of spatial scales and are deemed more likely to represent true features of the underlying space rather than artifacts of sampling, noise, or particular choice of parameters.
To find the persistent homology of a space, the space must first be represented as a
simplicial complex
In mathematics, a simplicial complex is a structured Set (mathematics), set composed of Point (geometry), points, line segments, triangles, and their ''n''-dimensional counterparts, called Simplex, simplices, such that all the faces and intersec ...
. A distance function on the underlying space corresponds to a
filtration
Filtration is a physical separation process that separates solid matter and fluid from a mixture using a ''filter medium'' that has a complex structure through which only the fluid can pass. Solid particles that cannot pass through the filte ...
of the simplicial complex, that is a nested sequence of increasing subsets. One common method of doing this is via taking the sublevel filtration of the distance to a
point cloud
A point cloud is a discrete set of data Point (geometry), points in space. The points may represent a 3D shape or object. Each point Position (geometry), position has its set of Cartesian coordinates (X, Y, Z). Points may contain data other than ...
, or equivalently, the
offset filtration
The offset filtration (also called the "union-of-balls" or "union-of-disks" filtration) is a growing sequence of metric balls used to detect the size and scale of topological features of a data set. The offset filtration commonly arises in persis ...
on the point cloud and taking its
nerve
A nerve is an enclosed, cable-like bundle of nerve fibers (called axons). Nerves have historically been considered the basic units of the peripheral nervous system. A nerve provides a common pathway for the Electrochemistry, electrochemical nerv ...
in order to get the simplicial filtration known as
ÄŚech
Čech (feminine Čechová) is a Czech surname meaning Czech. It was used to distinguish an inhabitant of Bohemia from Slovaks, Moravians and other ethnic groups. Notable people with the surname include:
* Dana Čechová (born 1983), Czech tab ...
filtration. A similar construction uses a nested sequence of
Vietoris–Rips complex
In topology, the Vietoris–Rips complex, also called the Vietoris complex or Rips complex, is a way of forming a topological space from distances in a set of points. It is an abstract simplicial complex that can be defined from any metric space ...
es known as the
Vietoris–Rips filtration.
Definition
Formally, consider a real-valued function on a simplicial complex
that is non-decreasing on increasing sequences of faces, so
whenever
is a face of
in
. Then for every
the
sublevel set is a subcomplex of ''K'', and the ordering of the values of
on the simplices in
(which is in practice always finite) induces an ordering on the sublevel complexes that defines a filtration
:
When
, the inclusion
induces a
group homomorphism, homomorphism on the
simplicial homology
In algebraic topology, simplicial homology is the sequence of homology groups of a simplicial complex. It formalizes the idea of the number of holes of a given dimension in the complex. This generalizes the number of connected component (topology), ...
groups for each dimension
. The
persistent homology groups are the images of these homomorphisms, and the
persistent Betti numbers are the
ranks of those groups. Persistent Betti numbers for
coincide with
the
size function, a predecessor of persistent homology.
Any filtered complex over a field
can be brought by a linear transformation preserving the filtration to so called canonical form, a canonically defined direct sum of filtered complexes of two types: one-dimensional complexes with trivial differential
and two-dimensional complexes with trivial homology
.
A
persistence module over a
partially ordered set
is a set of vector spaces
indexed by
, with a linear map
whenever
, with
equal to the identity and
for
. Equivalently, we may consider it as a
functor
In mathematics, specifically category theory, a functor is a Map (mathematics), mapping between Category (mathematics), categories. Functors were first considered in algebraic topology, where algebraic objects (such as the fundamental group) ar ...
from
considered as a category to the category of vector spaces (or
-modules). There is a classification of persistence modules over a field
indexed by
:
Multiplication by
corresponds to moving forward one step in the persistence module. Intuitively, the free parts on the right side correspond to the homology generators that appear at filtration level
and never disappear, while the torsion parts correspond to those that appear at filtration level
and last for
steps of the filtration (or equivalently, disappear at filtration level
).
Each of these two theorems allows us to uniquely represent the persistent homology of a filtered simplicial complex with a persistence barcode or persistence diagram. A barcode represents each persistent generator with a horizontal line beginning at the first filtration level where it appears, and ending at the filtration level where it disappears, while a persistence diagram plots a point for each generator with its x-coordinate the birth time and its y-coordinate the death time.
Equivalently the same data is represented by Barannikov's canonical form,
where each generator is represented by a segment connecting the birth and the death values plotted on separate lines for each
.
Stability
Persistent homology is stable in a precise sense, which provides robustness against noise. The bottleneck distance is a natural metric on the space of persistence diagrams given by
where
ranges over bijections. A small perturbation in the input filtration leads to a small perturbation of its persistence diagram in the bottleneck distance. For concreteness, consider a filtration on a space
homeomorphic to a simplicial complex determined by the sublevel sets of a continuous tame function
. The map
taking
to the persistence diagram of its
th homology is 1-
Lipschitz with respect to the
-metric on functions and the bottleneck distance on persistence diagrams.
That is,
.
Computation
The principal algorithm is based on the bringing of the filtered complex to its canonical form by upper-triangular matrices and runs in worst-case cubical complexity in the number of simplices.
The fastest known algorithm for computing persistent homology runs in matrix multiplication time.
Since the number of simplices is highly relevant for computation time, finding filtered simplicial complexes with few simplexes is an active research area. Several approaches have been proposed to reduce the number of simplices in a filtered simplicial complex in order to approximate persistent homology.
Software
There are various software packages for computing persistence intervals of a finite filtration.
See also
*
Computational topology Algorithmic topology, or computational topology, is a subfield of topology with an overlap with areas of computer science, in particular, computational geometry and computational complexity theory.
A primary concern of algorithmic topology, as its ...
References
{{reflist
Homology theory
Computational topology