Topological deep learning (TDL) is a research field that extends

deep learning Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...

to handle complex,

non-Euclidean In mathematics, non-Euclidean geometry consists of two geometries based on axioms closely related to those that specify Euclidean geometry. As Euclidean geometry lies at the intersection of metric geometry and affine geometry, non-Euclidean geo ...

data structures. Traditional deep learning models, such as

convolutional neural networks A convolutional neural network (CNN) is a type of feedforward neural network that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different type ...

(CNNs) and

recurrent neural networks Recurrent neural networks (RNNs) are a class of artificial neural networks designed for processing sequential data, such as text, speech, and time series, where the order of elements is important. Unlike feedforward neural networks, which proces ...

(RNNs), excel in processing data on regular grids and sequences. However, scientific and real-world data often exhibit more intricate data domains encountered in scientific computations , including

point cloud A point cloud is a discrete set of data Point (geometry), points in space. The points may represent a 3D shape or object. Each point Position (geometry), position has its set of Cartesian coordinates (X, Y, Z). Points may contain data other than ...

s, meshes,

time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...

scalar field In mathematics and physics, a scalar field is a function associating a single number to each point in a region of space – possibly physical space. The scalar may either be a pure mathematical number ( dimensionless) or a scalar physical ...

graphs Graph may refer to: Mathematics *Graph (discrete mathematics), a structure made of vertices and edges **Graph theory, the study of such graphs and their properties * Graph (topology), a topological space resembling a graph in the sense of discre ...

, or general

topological spaces In mathematics, a topological space is, roughly speaking, a geometrical space in which closeness is defined but cannot necessarily be measured by a numeric distance. More specifically, a topological space is a set whose elements are called point ...

simplicial complex In mathematics, a simplicial complex is a structured Set (mathematics), set composed of Point (geometry), points, line segments, triangles, and their ''n''-dimensional counterparts, called Simplex, simplices, such that all the faces and intersec ...

es and

CW complex In mathematics, and specifically in topology, a CW complex (also cellular complex or cell complex) is a topological space that is built by gluing together topological balls (so-called ''cells'') of different dimensions in specific ways. It generali ...

es. TDL addresses this by incorporating topological concepts to process data with higher-order relationships, such as interactions among multiple entities and complex hierarchies. This approach leverages structures like

simplicial complexes In mathematics, a simplicial complex is a structured set composed of points, line segments, triangles, and their ''n''-dimensional counterparts, called simplices, such that all the faces and intersections of the elements are also included in ...

and

hypergraphs In mathematics, a hypergraph is a generalization of a graph in which an edge can join any number of vertices. In contrast, in an ordinary graph, an edge connects exactly two vertices. Formally, a directed hypergraph is a pair (X,E), where X is ...

to capture global dependencies and qualitative spatial properties, offering a more nuanced representation of data. TDL also encompasses methods from

computational A computation is any type of arithmetic or non-arithmetic calculation that is well-defined. Common examples of computation are mathematical equation solving and the execution of computer algorithms. Mechanical or electronic devices (or, historic ...

and

algebraic topology Algebraic topology is a branch of mathematics that uses tools from abstract algebra to study topological spaces. The basic goal is to find algebraic invariant (mathematics), invariants that classification theorem, classify topological spaces up t ...

that permit studying properties of

neural networks A neural network is a group of interconnected units called neurons that send signals to one another. Neurons can be either Cell (biology), biological cells or signal pathways. While individual neurons are simple, many of them together in a netwo ...

and their training process, such as their predictive performance or generalization properties. The mathematical foundations of TDL are

differential topology In mathematics, differential topology is the field dealing with the topological properties and smooth properties of smooth manifolds. In this sense differential topology is distinct from the closely related field of differential geometry, which ...

, and

geometric topology In mathematics, geometric topology is the study of manifolds and Map (mathematics)#Maps as functions, maps between them, particularly embeddings of one manifold into another. History Geometric topology as an area distinct from algebraic topo ...

. Therefore, TDL can be generalized for data on

differentiable manifolds In mathematics, a differentiable manifold (also differential manifold) is a type of manifold that is locally similar enough to a vector space to allow one to apply calculus. Any manifold can be described by a collection of charts (atlas). One may ...

knots A knot is a fastening in rope or interwoven lines. Knot or knots may also refer to: Other common meanings * Knot (unit), of speed * Knot (wood), a timber imperfection Arts, entertainment, and media Films * ''Knots'' (film), a 2004 film * ''Kn ...

, links, tangles, curves, etc.

History and motivation

Traditional techniques from

often operate under the assumption that a dataset is residing in a highly-structured space (like

image An image or picture is a visual representation. An image can be Two-dimensional space, two-dimensional, such as a drawing, painting, or photograph, or Three-dimensional space, three-dimensional, such as a carving or sculpture. Images may be di ...

s, where

convolutional neural network A convolutional neural network (CNN) is a type of feedforward neural network that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different ty ...

s exhibit outstanding performance over alternative methods) or a

Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces ...

. The prevalence of new types of data, in particular

, meshes, and

molecule A molecule is a group of two or more atoms that are held together by Force, attractive forces known as chemical bonds; depending on context, the term may or may not include ions that satisfy this criterion. In quantum physics, organic chemi ...

s, resulted in the development of new techniques, culminating in the field of geometric deep learning, which originally proposed a signal-processing perspective for treating such data types. While originally confined to graphs, where connectivity is defined based on nodes and edges, follow-up work extended concepts to a larger variety of data types, including

es and

es, with recent work proposing a unified perspective of

message-passing In computer science, message passing is a technique for invoking behavior (i.e., running a program) on a computer. The invoking program sends a message to a process (which may be an actor or object) and relies on that process and its supporting ...

on general combinatorial complexes. An independent perspective on different types of data originated from

topological data analysis In applied mathematics, topological data analysis (TDA) is an approach to the analysis of datasets using techniques from topology. Extraction of information from datasets that are high-dimensional, incomplete and noisy is generally challenging. TDA ...

, which proposed a new framework for describing structural information of data, i.e., their "shape," that is inherently aware of multiple scales in data, ranging from ''local'' information to ''global'' information. While at first restricted to smaller datasets, subsequent work developed new descriptors that efficiently summarized topological information of datasets to make them available for traditional machine-learning techniques, such as

support vector machine In machine learning, support vector machines (SVMs, also support vector networks) are supervised max-margin models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laborato ...

s or

random forest Random forests or random decision forests is an ensemble learning method for statistical classification, classification, regression analysis, regression and other tasks that works by creating a multitude of decision tree learning, decision trees ...

s. Such descriptors ranged from new techniques for feature engineering over new ways of providing suitable

coordinates In geometry, a coordinate system is a system that uses one or more numbers, or coordinates, to uniquely determine and standardize the Position (geometry), position of the Point (geometry), points or other geometric elements on a manifold such as ...

for topological descriptors, or the creation of more efficient dissimilarity measures. Contemporary research in this field is largely concerned with either integrating information about the underlying data topology into existing deep-learning models or obtaining novel ways of training on topological domains.

Learning on topological spaces

Focusing on

topology Topology (from the Greek language, Greek words , and ) is the branch of mathematics concerned with the properties of a Mathematical object, geometric object that are preserved under Continuous function, continuous Deformation theory, deformat ...

in the sense of

point set topology In mathematics, general topology (or point set topology) is the branch of topology that deals with the basic set-theoretic definitions and constructions used in topology. It is the foundation of most other branches of topology, including differ ...

, an active branch of TDL is concerned with learning ''on'' topological spaces, that is, on different topological domains.

An introduction to topological domains

One of the core concepts in topological deep learning is the domain upon which this data is defined and supported. In case of Euclidean data, such as images, this domain is a grid, upon which the pixel value of the image is supported. In a more general setting this domain might be a topological domain. Next, we introduce the most common topological domains that are encountered in a deep learning setting. These domains include, but not limited to, graphs, simplicial complexes, cell complexes, combinatorial complexes and hypergraphs. Given a finite set S of abstract entities, a neighborhood function

\mathcal

on S is an assignment that attach to every point

x

in S a subset of S or a relation. Such a function can be induced by equipping S with an ''auxiliary structure''. Edges provide one way of defining relations among the entities of S. More specifically, edges in a graph allow one to define the notion of neighborhood using, for instance, the one hop neighborhood notion. Edges however, limited in their modeling capacity as they can only be used to model ''binary relations'' among entities of S since every edge is connected typically to two entities. In many applications, it is desirable to permit relations that incorporate more than two entities. The idea of using relations that involve more than two entities is central to topological domains. Such higher-order relations allow for a broader range of neighborhood functions to be defined on S to capture multi-way interactions among entities of S. Next we review the main properties, advantages, and disadvantages of some commonly studied topological domains in the context of deep learning, including (abstract) simplicial complexes, regular cell complexes, hypergraphs, and combinatorial complexes. Higher order networks

Comparisons among topological domains

Each of the enumerated topological domains has its own characteristics, advantages, and limitations: *

Simplicial complex In mathematics, a simplicial complex is a structured Set (mathematics), set composed of Point (geometry), points, line segments, triangles, and their ''n''-dimensional counterparts, called Simplex, simplices, such that all the faces and intersec ...

es ** Simplest form of higher-order domains. ** Extensions of graph-based models. ** Admit hierarchical structures, making them suitable for various applications. ** Hodge theory can be naturally defined on simplicial complexes. ** Require relations to be subsets of larger relations, imposing constraints on the structure. * Cell Complexes ** Generalize simplicial complexes. ** Provide more flexibility in defining higher-order relations. ** Each cell in a cell complex is homeomorphic to an open ball, attached together via attaching maps. ** Boundary cells of each cell in a cell complex are also cells in the complex. ** Represented combinatorially via incidence matrices. *

Hypergraphs In mathematics, a hypergraph is a generalization of a graph in which an edge can join any number of vertices. In contrast, in an ordinary graph, an edge connects exactly two vertices. Formally, a directed hypergraph is a pair (X,E), where X is ...

** Allow arbitrary set-type relations among entities. ** Relations are not imposed by other relations, providing more flexibility. ** Do not explicitly encode the dimension of cells or relations. ** Useful when relations in the data do not adhere to constraints imposed by other models like simplicial and cell complexes. * Combinatorial Complexes : ** Generalize and bridge the gaps between simplicial complexes, cell complexes, and hypergraphs. ** Allow for hierarchical structures and set-type relations. ** Combine features of other complexes while providing more flexibility in modeling relations. ** Can be represented combinatorially, similar to cell complexes.

Hierarchical structure and set-type relations

The properties of simplicial complexes, cell complexes, and hypergraphs give rise to two main features of relations on higher-order domains, namely hierarchies of relations and set-type relations.

= Rank function

= A rank function on a higher-order domain X is an order-preserving function rk: X → Z, where rk(''x'') attaches a non-negative integer value to each relation ''x'' in X, preserving set inclusion in X. Cell and simplicial complexes are common examples of higher-order domains equipped with rank functions and therefore with hierarchies of relations.

= Set-type relations

= Relations in a higher-order domain are called set-type relations if the existence of a relation is not implied by another relation in the domain. Hypergraphs constitute examples of higher-order domains equipped with set-type relations. Given the modeling limitations of simplicial complexes, cell complexes, and hypergraphs, we develop the combinatorial complex, a higher-order domain that features both hierarchies of relations and set-type relations. The learning tasks in TDL can be broadly classified into three categories: * Cell classification: Predict targets for each cell in a complex. Examples include triangular mesh segmentation, where the task is to predict the class of each face or edge in a given mesh. * Complex classification: Predict targets for an entire complex. For example, predict the class of each input mesh. * Cell prediction: Predict properties of cell-cell interactions in a complex, and in some cases, predict whether a cell exists in the complex. An example is the prediction of linkages among entities in hyperedges of a hypergraph. In practice, to perform the aforementioned tasks, deep learning models designed for specific topological spaces must be constructed and implemented. These models, known as topological neural networks, are tailored to operate effectively within these spaces.

Topological neural networks

Central to TDL are topological neural networks (TNNs), specialized architectures designed to operate on data structured in topological domains. Unlike traditional neural networks tailored for grid-like structures, TNNs are adept at handling more intricate data representations, such as graphs, simplicial complexes, and cell complexes. By harnessing the inherent topology of the data, TNNs can capture both local and global relationships, enabling nuanced analysis and interpretation.

Message passing topological neural networks

In a general topological domain, higher-order message passing involves exchanging messages among entities and cells using a set of neighborhood functions. Definition: Higher-Order Message Passing on a General Topological Domain Higher order message passing

Let

\mathcal

be a topological domain. We define a set of neighborhood functions

\mathcal=\

\mathcal

. Consider a cell

x

and let

y\in \mathcal_k(x)

for some

\mathcal_k \in \mathcal

. A message

m_

between cells

x

and

y

is a computation dependent on these two cells or the data supported on them. Denote

\mathcal(x)

as the multi-set

\

, and let

\mathbf_x^

represent some data supported on cell

x

at layer

l

. Higher-order message passing on

\mathcal

, induced by

\mathcal

, is defined by the following four update rules: #

m_ = \alpha_(\mathbf_x^,\mathbf_y^)

m_^k =  \bigoplus_  m_

, where

\bigoplus

is the intra-neighborhood aggregation function. #

m_ =  \bigotimes_ m_x^k

, where

\bigotimes

is the inter-neighborhood aggregation function. #

\mathbf_x^ = \beta (\mathbf_x^, m_x)

, where

\alpha_,\beta

are differentiable functions. Some remarks on Definition above are as follows. First, Equation 1 describes how messages are computed between cells

x

and

y

. The message

m_

is influenced by both the data

\mathbf_x^

and

\mathbf_y^

associated with cells

x

and

y

, respectively. Additionally, it incorporates characteristics specific to the cells themselves, such as orientation in the case of cell complexes. This allows for a richer representation of spatial relationships compared to traditional graph-based message passing frameworks. Second, Equation 2 defines how messages from neighboring cells are aggregated within each neighborhood. The function

\bigoplus

aggregates these messages, allowing information to be exchanged effectively between adjacent cells within the same neighborhood. Third, Equation 3 outlines the process of combining messages from different neighborhoods. The function

\bigotimes

aggregates messages across various neighborhoods, facilitating communication between cells that may not be directly connected but share common neighborhood relationships. Fourth, Equation 4 specifies how the aggregated messages influence the state of a cell in the next layer. Here, the function

\beta

updates the state of cell

x

based on its current state

\mathbf_x^

and the aggregated message

m_x

obtained from neighboring cells.

Non-message passing topological neural networks

While the majority of TNNs follow the message passing paradigm from graph learning, several models have been suggested that do not follow this approach. For instance, Maggs et al. leverage geometric information from embedded simplicial complexes, i.e., simplicial complexes with high-dimensional features attached to their vertices.This offers interpretability and geometric consistency ''without'' relying on message passing. Furthermore, in a contrastive loss-based method was suggested to learn the simplicial representation.

Learning on topological descriptors

Motivated by the modular nature of

deep neural networks Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...

, initial work in TDL drew inspiration from

, and aimed to make the resulting descriptors amenable to integration into deep-learning models. This led to work defining new

layers Layer or layered may refer to: Arts, entertainment, and media * ''Layers'' (Kungs album) * ''Layers'' (Les McCann album) * ''Layers'' (Royce da 5′9″ album) *“Layers”, the title track of Royce da 5′9″’s sixth studio album * Layer, a ...

for deep neural networks. Pioneering work by Hofer et al., for instance, introduced a layer that permitted topological descriptors like persistence diagrams or persistence barcodes to be integrated into a deep neural network. This was achieved by means of end-to-end-trainable projection functions, permitting topological features to be used to solve shape classification tasks, for instance. Follow-up work expanded more on the theoretical properties of such descriptors and integrated them into the field of

representation learning In machine learning (ML), feature learning or representation learning is a set of techniques that allow a system to automatically discover the representations needed for feature detection or classification from raw data. This replaces manual fea ...

. Other such ''topological layers'' include layers based on extended persistent homology descriptors, persistence landscapes, or coordinate functions. In parallel,

persistent homology In topological data analysis, persistent homology is a method for computing topological features of a space at different spatial resolutions. More persistent features are detected over a wide range of spatial scales and are deemed more likely to ...

also found applications in graph-learning tasks. Noteworthy examples include new algorithms for learning task-specific filtration functions for graph classification or node classification tasks.

Applications

TDL is rapidly finding new applications across different domains, including data compression, enhancing the expressivity and predictive performance of graph neural networks, action recognition, and trajectory prediction.{{citation , last1=Roddenberry , first1=T. M. , title=Principled simplicial neural networks for trajectory prediction , date=July 2021 , pages=9020–9029 , publisher=PMLR , last2=Glaze , first2=N. , last3=Segarra , first3=S., arxiv=2102.10058

References

Deep learning Topology