In
statistics, compositional data are quantitative descriptions of the parts of some whole, conveying relative information. Mathematically, compositional data is
represented by points on a
simplex
In geometry, a simplex (plural: simplexes or simplices) is a generalization of the notion of a triangle or tetrahedron to arbitrary dimensions. The simplex is so-named because it represents the simplest possible polytope in any given dimension ...
. Measurements involving probabilities, proportions, percentages, and
ppm can all be thought of as compositional data.
Ternary plot
Compositional data in three variables can be plotted via
ternary plot
A ternary plot, ternary graph, triangle plot, simplex plot, Gibbs triangle or de Finetti diagram is a barycentric plot on three variables which sum to a constant. It graphically depicts the ratios of the three variables as positions in an equil ...
s. The use of a
barycentric plot
Plot or Plotting may refer to:
Art, media and entertainment
* Plot (narrative), the story of a piece of fiction
Music
* ''The Plot'' (album), a 1976 album by jazz trumpeter Enrico Rava
* The Plot (band), a band formed in 2003
Other
* ''Plot ...
on three variables graphically depicts the ratios of the three variables as positions in an
equilateral
In geometry, an equilateral triangle is a triangle in which all three sides have the same length. In the familiar Euclidean geometry, an equilateral triangle is also equiangular; that is, all three internal angles are also congruent to each oth ...
triangle
A triangle is a polygon with three edges and three vertices. It is one of the basic shapes in geometry. A triangle with vertices ''A'', ''B'', and ''C'' is denoted \triangle ABC.
In Euclidean geometry, any three points, when non- colli ...
.
Simplicial sample space
In general,
John Aitchison
John Aitchison (22 July 1926 – 23 December 2016) was a Scottish statistician.
Career
John Aitchison studied at the Universitiy of Edinburgh after being uncomfortable explaining to his headmaster that he didn’t plan to attend universi ...
defined compositional data to be proportions of some whole in 1982. In particular, a compositional data point (or ''composition'' for short) can be represented by a real vector with positive components. The sample space of compositional data is a simplex:
::

The only information is given by the ratios between components, so the information of a composition is preserved under multiplication by any positive constant. Therefore, the sample space of compositional data can always be assumed to be a standard simplex, i.e.
. In this context, normalization to the standard simplex is called closure and is denoted by