Elastic maps provide a tool for

nonlinear dimensionality reduction Nonlinear dimensionality reduction, also known as manifold learning, is any of various related techniques that aim to project high-dimensional data, potentially existing across non-linear manifolds which cannot be adequately captured by linear de ...

. By their construction, they are a system of elastic springs embedded in the data space. This system approximates a low-dimensional manifold. The elastic coefficients of this system allow the switch from completely unstructured

k-means clustering ''k''-means clustering is a method of vector quantization, originally from signal processing, that aims to partition of a set, partition ''n'' observations into ''k'' clusters in which each observation belongs to the cluster (statistics), cluste ...

(zero elasticity) to the estimators located closely to linear PCA manifolds (for high bending and low stretching modules). With some intermediate values of the

elasticity coefficient In chemistry, the Reaction rate, rate of a chemical reaction is influenced by many different factors, such as temperature, pH, reactant, the concentration of Product (chemistry), products, and other effectors. The degree to which these factors c ...

s, this system effectively approximates non-linear principal manifolds. This approach is based on a

mechanical Mechanical may refer to: Machine * Machine (mechanical), a system of mechanisms that shape the actuator input to achieve a specific application of output forces and movement * Mechanical calculator, a device used to perform the basic operations o ...

analogy between principal manifolds, that are passing through "the middle" of the data distribution, and elastic membranes and plates. The method was developed by A.N. Gorban
A.Y. Zinovyev
and A.A. Pitenko in 1996–1998.

Energy of elastic map

Let

be a data set in a finite-dimensional

Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces ...

. Elastic map is represented by a set of nodes

_j

in the same space. Each datapoint

s \in

has a ''host node'', namely the closest node

_j

(if there are several closest nodes then one takes the node with the smallest number). The data set

is divided into classes

K_j=\

. The ''approximation energy'' D is the distortion :

D=\frac\sum_^k \sum_\, s-_j\, ^2

, which is the energy of the springs with unit elasticity which connect each data point with its host node. It is possible to apply weighting factors to the terms of this sum, for example to reflect the

standard deviation In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its Expected value, mean. A low standard Deviation (statistics), deviation indicates that the values tend to be close to the mean ( ...

of the

probability density function In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...

of any subset of data points

\

. On the set of nodes an additional structure is defined. Some pairs of nodes,

(_i,_j)

, are connected by ''elastic edges''. Call this set of pairs

E

. Some triplets of nodes,

(_i,_j,_k)

, form ''bending ribs''. Call this set of triplets

G

. : The stretching energy is

U_=\frac\lambda \sum_ \, _i -_j\, ^2

, : The bending energy is

U_G=\frac\mu \sum_ \, _i -2_j+_k\, ^2

, where

\lambda

and

\mu

are the stretching and bending moduli respectively. The stretching energy is sometimes referred to as the ''membrane'', while the bending energy is referred to as the ''thin plate'' term. For example, on the 2D rectangular grid the elastic edges are just vertical and horizontal edges (pairs of closest vertices) and the bending ribs are the vertical or horizontal triplets of consecutive (closest) vertices. : The total energy of the elastic map is thus

U=D+U_E+U_G.

The position of the nodes

\

is determined by the

mechanical equilibrium In classical mechanics, a particle is in mechanical equilibrium if the net force on that particle is zero. By extension, a physical system made up of many parts is in mechanical equilibrium if the net force on each of its individual parts is ze ...

of the elastic map, i.e. its location is such that it minimizes the total energy

U

Expectation-maximization algorithm

For a given splitting of dataset

in classes

K_j

, minimization of the quadratic functional

U

is a linear problem with the

sparse matrix In numerical analysis and scientific computing, a sparse matrix or sparse array is a matrix in which most of the elements are zero. There is no strict definition regarding the proportion of zero-value elements for a matrix to qualify as sparse ...

of coefficients. Therefore, similar to

principal component analysis Principal component analysis (PCA) is a linear dimensionality reduction technique with applications in exploratory data analysis, visualization and data preprocessing. The data is linearly transformed onto a new coordinate system such that th ...

k-means ''k''-means clustering is a method of vector quantization, originally from signal processing, that aims to partition of a set, partition ''n'' observations into ''k'' clusters in which each observation belongs to the cluster (statistics), cluste ...

, a splitting method is used: * For given

\

find

\

; * For given

\

minimize

U

and find

\

; * If no change, terminate. This expectation-maximization algorithm guarantees a local minimum of

U

. For improving the approximation various additional methods are proposed. For example, the ''softening'' strategy is used. This strategy starts with a rigid grids (small length, small bending and large elasticity modules

\lambda

and

\mu

coefficients) and finishes with soft grids (small

\lambda

and

\mu

). The training goes in several epochs, each epoch with its own grid rigidness. Another adaptive strategy is ''growing net'': one starts from a small number of nodes and gradually adds new nodes. Each epoch goes with its own number of nodes.

Applications

Most important applications of the method and free software are in bioinformatics for

exploratory data analysis In statistics, exploratory data analysis (EDA) is an approach of data analysis, analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods. A statistical model can be used or ...

and visualisation of multidimensional data, for data visualisation in economics, social and political sciences, as an auxiliary tool for

data mapping In computing and data management, data mapping is the process of creating data element mappings between two distinct data models. Data mapping is used as a first step for a wide variety of data integration tasks, including: * Data transforma ...

in geographic informational systems and for visualisation of data of various nature. The method is applied in quantitative biology for reconstructing the curved surface of a tree leaf from a stack of light microscopy images. This reconstruction is used for quantifying the

geodesic In geometry, a geodesic () is a curve representing in some sense the locally shortest path ( arc) between two points in a surface, or more generally in a Riemannian manifold. The term also has meaning in any differentiable manifold with a conn ...

distances between

trichomes Trichomes (; ) are fine outgrowths or appendages on plants, algae, lichens, and certain protists. They are of diverse structure and function. Examples are hairs, glandular hairs, scales, and papillae. A covering of any kind of hair on a plant ...

and their patterning, which is a marker of the capability of a plant to resist to pathogenes. Recently, the method is adapted as a support tool in the decision process underlying the selection, optimization, and management of

financial portfolio In finance, a portfolio is a collection of investments. Definition The term "portfolio" refers to any combination of financial assets such as stocks, bonds and cash. Portfolios may be held by individual investors or managed by financial profess ...

s. The method of elastic maps has been systematically tested and compared with several

machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...

methods on the applied problem of identification of the flow regime of a gas-liquid flow in a pipe. There are various regimes: Single phase water or air flow, Bubbly flow, Bubbly-slug flow, Slug flow, Slug-churn flow, Churn flow, Churn-annular flow, and Annular flow. The simplest and most common method used to identify the flow regime is visual observation. This approach is, however, subjective and unsuitable for relatively high gas and liquid flow rates. Therefore, the machine learning methods are proposed by many authors. The methods are applied to differential pressure data collected during a calibration process. The method of elastic maps provided a 2D map, where the area of each regime is represented. The comparison with some other machine learning methods is presented in Table 1 for various pipe diameters and pressure. Here, ANN stands for the

backpropagation In machine learning, backpropagation is a gradient computation method commonly used for training a neural network to compute its parameter updates. It is an efficient application of the chain rule to neural networks. Backpropagation computes th ...

artificial neural network In machine learning, a neural network (also artificial neural network or neural net, abbreviated ANN or NN) is a computational model inspired by the structure and functions of biological neural networks. A neural network consists of connected ...

s, SVM stands for the

support vector machine In machine learning, support vector machines (SVMs, also support vector networks) are supervised max-margin models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laborato ...

, SOM for the

self-organizing map A self-organizing map (SOM) or self-organizing feature map (SOFM) is an unsupervised machine learning technique used to produce a low-dimensional (typically two-dimensional) representation of a higher-dimensional data set while preserving the t ...

s. The hybrid technology was developed for engineering applications. In this technology, elastic maps are used in combination with

Principal Component Analysis Principal component analysis (PCA) is a linear dimensionality reduction technique with applications in exploratory data analysis, visualization and data preprocessing. The data is linearly transformed onto a new coordinate system such that th ...

(PCA),

Independent Component Analysis In signal processing, independent component analysis (ICA) is a computational method for separating a multivariate statistics, multivariate signal into additive subcomponents. This is done by assuming that at most one subcomponent is Gaussian and ...

(ICA) and backpropagation ANN. The textbookM. Resta
Computational Intelligence Paradigms in Economic and Financial Decision Making
Series Intelligent Systems Reference Library, Volume 99, Springer International Publishing, Switzerland 2016. provides a systematic comparison of elastic maps and

s (SOMs) in applications to economic and financial decision-making.

References

{{reflist Data mining Dimension reduction