In
mathematics
Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
, especially the usage of
linear algebra
Linear algebra is the branch of mathematics concerning linear equations such as
:a_1x_1+\cdots +a_nx_n=b,
linear maps such as
:(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n,
and their representations in vector spaces and through matrix (mathemat ...
in
mathematical physics
Mathematical physics is the development of mathematics, mathematical methods for application to problems in physics. The ''Journal of Mathematical Physics'' defines the field as "the application of mathematics to problems in physics and the de ...
and
differential geometry
Differential geometry is a Mathematics, mathematical discipline that studies the geometry of smooth shapes and smooth spaces, otherwise known as smooth manifolds. It uses the techniques of Calculus, single variable calculus, vector calculus, lin ...
, Einstein notation (also known as the Einstein summation convention or Einstein summation notation) is a notational convention that implies
summation
In mathematics, summation is the addition of a sequence of numbers, called ''addends'' or ''summands''; the result is their ''sum'' or ''total''. Beside numbers, other types of values can be summed as well: functions, vectors, matrices, pol ...
over a set of indexed terms in a formula, thus achieving brevity. As part of mathematics it is a notational subset of
Ricci calculus Ricci () is an Italian surname. Notable Riccis Arts and entertainment
* Antonio Ricci (painter) (c.1565–c.1635), Spanish Baroque painter of Italian origin
* Christina Ricci (born 1980), American actress
* Clara Ross Ricci (1858-1954), British ...
; however, it is often used in physics applications that do not distinguish between
tangent
In geometry, the tangent line (or simply tangent) to a plane curve at a given point is, intuitively, the straight line that "just touches" the curve at that point. Leibniz defined it as the line through a pair of infinitely close points o ...
and
cotangent space
In differential geometry, the cotangent space is a vector space associated with a point x on a smooth (or differentiable) manifold \mathcal M; one can define a cotangent space for every point on a smooth manifold. Typically, the cotangent space, T ...
s. It was introduced to physics by
Albert Einstein
Albert Einstein (14 March 187918 April 1955) was a German-born theoretical physicist who is best known for developing the theory of relativity. Einstein also made important contributions to quantum mechanics. His mass–energy equivalence f ...
in 1916.
Introduction
Statement of convention
According to this convention, when an index variable appears twice in a single
term and is not otherwise defined (see
Free and bound variables), it implies summation of that term over all the values of the index. So where the indices can range over the
set
Set, The Set, SET or SETS may refer to:
Science, technology, and mathematics Mathematics
*Set (mathematics), a collection of elements
*Category of sets, the category whose objects and morphisms are sets and total functions, respectively
Electro ...
,
is simplified by the convention to:
The upper indices are not
exponents but are indices of coordinates,
coefficient
In mathematics, a coefficient is a Factor (arithmetic), multiplicative factor involved in some Summand, term of a polynomial, a series (mathematics), series, or any other type of expression (mathematics), expression. It may be a Dimensionless qu ...
s or
basis vector
In mathematics, a set of elements of a vector space is called a basis (: bases) if every element of can be written in a unique way as a finite linear combination of elements of . The coefficients of this linear combination are referred to as ...
s. That is, in this context should be understood as the second component of rather than the square of (this can occasionally lead to ambiguity). The upper index position in is because, typically, an index occurs once in an upper (superscript) and once in a lower (subscript) position in a term (see ' below). Typically, would be equivalent to the traditional .
In
general relativity
General relativity, also known as the general theory of relativity, and as Einstein's theory of gravity, is the differential geometry, geometric theory of gravitation published by Albert Einstein in 1915 and is the current description of grav ...
, a common convention is that
* the
Greek alphabet
The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It was derived from the earlier Phoenician alphabet, and is the earliest known alphabetic script to systematically write vowels as wel ...
is used for space and time components, where indices take on values 0, 1, 2, or 3 (frequently used letters are ),
* the
Latin alphabet
The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from � ...
is used for spatial components only, where indices take on values 1, 2, or 3 (frequently used letters are ),
In general, indices can range over any
indexing set, including an
infinite set
In set theory, an infinite set is a set that is not a finite set. Infinite sets may be countable or uncountable.
Properties
The set of natural numbers (whose existence is postulated by the axiom of infinity) is infinite. It is the only set ...
. This should not be confused with a typographically similar convention used to distinguish between
tensor index notation
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
and the closely related but distinct basis-independent
abstract index notation
Abstract index notation (also referred to as slot-naming index notation) is a mathematical notation for tensors and spinors that uses indices to indicate their types, rather than their components in a particular basis. The indices are mere placeh ...
.
An index that is summed over is a ''summation index'', in this case "". It is also called a
dummy index since any symbol can replace "" without changing the meaning of the expression (provided that it does not collide with other index symbols in the same term).
An index that is not summed over is a
''free index'' and should appear only once per term. If such an index does appear, it usually also appears in every other term in an equation. An example of a free index is the "" in the equation
, which is equivalent to the equation
.
Application
Einstein notation can be applied in slightly different ways. Typically, each index occurs once in an upper (superscript) and once in a lower (subscript) position in a term; however, the convention can be applied more generally to any repeated indices within a term.
When dealing with
covariant and contravariant vectors, where the position of an index indicates the type of vector, the first case usually applies; a covariant vector can only be contracted with a contravariant vector, corresponding to summation of the products of coefficients. On the other hand, when there is a fixed coordinate basis (or when not considering coordinate vectors), one may choose to use only subscripts; see ' below.
Vector representations
Superscripts and subscripts versus only subscripts
In terms of
covariance and contravariance of vectors
In physics, especially in multilinear algebra and tensor analysis, covariance and contravariance describe how the quantitative description of certain geometric or physical entities changes with a change of basis. Briefly, a contravariant vecto ...
,
* upper indices represent components of
contravariant vectors (
vector
Vector most often refers to:
* Euclidean vector, a quantity with a magnitude and a direction
* Disease vector, an agent that carries and transmits an infectious pathogen into another living organism
Vector may also refer to:
Mathematics a ...
s),
* lower indices represent components of
covariant vectors (
covectors).
They transform contravariantly or covariantly, respectively, with respect to
change of basis
In mathematics, an ordered basis of a vector space of finite dimension allows representing uniquely any element of the vector space by a coordinate vector, which is a sequence of scalars called coordinates. If two different bases are conside ...
.
In recognition of this fact, the following notation uses the same symbol both for a vector or covector and its ''components'', as in:
where
is the vector and
are its components (not the
th covector
),
is the covector and
are its components. The basis vector elements
are each column vectors, and the covector basis elements
are each row covectors. (See also ;
duality, below and the
examples
Example may refer to:
* ''exempli gratia'' (e.g.), usually read out in English as "for example"
* .example, reserved as a domain name that may not be installed as a top-level domain of the Internet
** example.com, example.net, example.org, a ...
)
In the presence of a
non-degenerate form (an
isomorphism
In mathematics, an isomorphism is a structure-preserving mapping or morphism between two structures of the same type that can be reversed by an inverse mapping. Two mathematical structures are isomorphic if an isomorphism exists between the ...
, for instance a
Riemannian metric
In differential geometry, a Riemannian manifold is a geometric space on which many geometric notions such as distance, angles, length, volume, and curvature are defined. Euclidean space, the N-sphere, n-sphere, hyperbolic space, and smooth surf ...
or
Minkowski metric
In physics, Minkowski space (or Minkowski spacetime) () is the main mathematical description of spacetime in the absence of general_relativity, gravitation. It combines inertial space and time manifolds into a four-dimensional model.
The model ...
), one can
raise and lower indices.
A basis gives such a form (via the
dual basis
In linear algebra, given a vector space V with a basis B of vectors indexed by an index set I (the cardinality of I is the dimension of V), the dual set of B is a set B^* of vectors in the dual space V^* with the same index set I such that B and ...
), hence when working on with a
Euclidean metric
In mathematics, the Euclidean distance between two points in Euclidean space is the length of the line segment between them. It can be calculated from the Cartesian coordinates of the points using the Pythagorean theorem, and therefore is oc ...
and a fixed
orthonormal basis
In mathematics, particularly linear algebra, an orthonormal basis for an inner product space V with finite Dimension (linear algebra), dimension is a Basis (linear algebra), basis for V whose vectors are orthonormal, that is, they are all unit vec ...
, one has the option to work with only subscripts.
However, if one changes coordinates, the way that coefficients change depends on the variance of the object, and one cannot ignore the distinction; see
Covariance and contravariance of vectors
In physics, especially in multilinear algebra and tensor analysis, covariance and contravariance describe how the quantitative description of certain geometric or physical entities changes with a change of basis. Briefly, a contravariant vecto ...
.
Mnemonics
In the above example, vectors are represented as
matrices
Matrix (: matrices or matrixes) or MATRIX may refer to:
Science and mathematics
* Matrix (mathematics), a rectangular array of numbers, symbols or expressions
* Matrix (logic), part of a formula in prenex normal form
* Matrix (biology), the ...
(column vectors), while covectors are represented as matrices (row covectors).
When using the column vector convention:
* "Upper indices go up to down; lower indices go left to right."
* "Covariant tensors are row vectors that have indices that are below (co-row-below)."
* Covectors are row vectors:
Hence the lower index indicates which ''column'' you are in.
* Contravariant vectors are column vectors:
Hence the upper index indicates which ''row'' you are in.
Abstract description
The virtue of Einstein notation is that it represents the
invariant quantities with a simple notation.
In physics, a
scalar is invariant under transformations of basis. In particular, a
Lorentz scalar
In a relativistic theory of physics, a Lorentz scalar is a scalar expression whose value is invariant under any Lorentz transformation. A Lorentz scalar may be generated from, e.g., the scalar product of vectors, or by contracting tensors. Whil ...
is invariant under a
Lorentz transformation
In physics, the Lorentz transformations are a six-parameter family of Linear transformation, linear coordinate transformation, transformations from a Frame of Reference, coordinate frame in spacetime to another frame that moves at a constant vel ...
. The individual terms in the sum are not. When the basis is changed, the ''components'' of a vector change by a
linear transformation
In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that pr ...
described by a matrix. This led Einstein to propose the convention that repeated indices imply the summation is to be done.
As for covectors, they change by the
inverse matrix. This is designed to guarantee that the linear function associated with the covector, the sum above, is the same no matter what the basis is.
The value of the Einstein convention is that it applies to other
vector space
In mathematics and physics, a vector space (also called a linear space) is a set (mathematics), set whose elements, often called vector (mathematics and physics), ''vectors'', can be added together and multiplied ("scaled") by numbers called sc ...
s built from using the
tensor product
In mathematics, the tensor product V \otimes W of two vector spaces V and W (over the same field) is a vector space to which is associated a bilinear map V\times W \rightarrow V\otimes W that maps a pair (v,w),\ v\in V, w\in W to an element of ...
and
duality. For example, , the tensor product of with itself, has a basis consisting of tensors of the form . Any tensor in can be written as:
, the dual of , has a basis , , ..., which obeys the rule
where is the
Kronecker delta
In mathematics, the Kronecker delta (named after Leopold Kronecker) is a function of two variables, usually just non-negative integers. The function is 1 if the variables are equal, and 0 otherwise:
\delta_ = \begin
0 &\text i \neq j, \\
1 &\ ...
. As
the row/column coordinates on a matrix correspond to the upper/lower indices on the tensor product.
Common operations in this notation
In Einstein notation, the usual element reference
for the
-th row and
-th column of matrix
becomes
. We can then write the following operations in Einstein notation as follows.
Inner product
The
inner product
In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...
of two vectors is the sum of the products of their corresponding components, with the indices of one vector lowered (see
#Raising and lowering indices):
In the case of an
orthonormal basis
In mathematics, particularly linear algebra, an orthonormal basis for an inner product space V with finite Dimension (linear algebra), dimension is a Basis (linear algebra), basis for V whose vectors are orthonormal, that is, they are all unit vec ...
, we have
, and the expression simplifies to:
Vector cross product
In three dimensions, the
cross product
In mathematics, the cross product or vector product (occasionally directed area product, to emphasize its geometric significance) is a binary operation on two vectors in a three-dimensional oriented Euclidean vector space (named here E), and ...
of two vectors with respect to a
positively oriented orthonormal basis, meaning that
, can be expressed as:
Here,
is the
Levi-Civita symbol
In mathematics, particularly in linear algebra, tensor analysis, and differential geometry, the Levi-Civita symbol or Levi-Civita epsilon represents a collection of numbers defined from the sign of a permutation of the natural numbers , for some ...
. Since the basis is orthonormal, raising the index
does not alter the value of
, when treated as a tensor.
Matrix-vector multiplication
The product of a matrix with a column vector is:
equivalent to
This is a special case of matrix multiplication.
Matrix multiplication
The
matrix product
In mathematics, specifically in linear algebra, matrix multiplication is a binary operation that produces a matrix from two matrices. For matrix multiplication, the number of columns in the first matrix must be equal to the number of rows in the s ...
of two matrices and is:
equivalent to
Trace
For a
square matrix
In mathematics, a square matrix is a Matrix (mathematics), matrix with the same number of rows and columns. An ''n''-by-''n'' matrix is known as a square matrix of order Any two square matrices of the same order can be added and multiplied.
Squ ...
, the
trace is the sum of the diagonal elements, hence the sum over a common index .
Outer product
The
outer product
In linear algebra, the outer product of two coordinate vectors is the matrix whose entries are all products of an element in the first vector with an element in the second vector. If the two coordinate vectors have dimensions ''n'' and ''m'', the ...
of the column vector by the row vector yields an matrix :
Since and represent two ''different'' indices, there is no summation and the indices are not eliminated by the multiplication.
Raising and lowering indices
Given a
tensor
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
, one can
raise an index or lower an index by contracting the tensor with the
metric tensor
In the mathematical field of differential geometry, a metric tensor (or simply metric) is an additional structure on a manifold (such as a surface) that allows defining distances and angles, just as the inner product on a Euclidean space allows ...
, . For example, taking the tensor , one can lower an index:
Or one can raise an index:
See also
*
Tensor
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
*
Abstract index notation
Abstract index notation (also referred to as slot-naming index notation) is a mathematical notation for tensors and spinors that uses indices to indicate their types, rather than their components in a particular basis. The indices are mere placeh ...
*
Bra–ket notation
Bra–ket notation, also called Dirac notation, is a notation for linear algebra and linear operators on complex vector spaces together with their dual space both in the finite-dimensional and infinite-dimensional case. It is specifically de ...
*
Penrose graphical notation
In mathematics and physics, Penrose graphical notation or tensor diagram notation is a (usually handwritten) visual depiction of multilinear functions or tensors proposed by Roger Penrose in 1971. A diagram in the notation consists of several sh ...
*
Levi-Civita symbol
In mathematics, particularly in linear algebra, tensor analysis, and differential geometry, the Levi-Civita symbol or Levi-Civita epsilon represents a collection of numbers defined from the sign of a permutation of the natural numbers , for some ...
*
DeWitt notation
Notes
#This applies only for numerical indices. The situation is the opposite for
abstract indices. Then, vectors themselves carry upper abstract indices and covectors carry lower abstract indices, as per the example in the
introduction
Introduction, The Introduction, Intro, or The Intro may refer to:
General use
* Introduction (music), an opening section of a piece of music
* Introduction (writing), a beginning section to a book, article or essay which states its purpose and g ...
of this article. Elements of a basis of vectors may carry a lower ''numerical'' index and an upper ''abstract'' index.
References
Bibliography
* .
External links
*
*
*
{{tensors
Mathematical notation
Multilinear algebra
Tensors
Riemannian geometry
Mathematical physics
Albert Einstein