mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...

, every

analytic function In mathematics, an analytic function is a function that is locally given by a convergent power series. There exist both real analytic functions and complex analytic functions. Functions of each type are infinitely differentiable, but complex ...

can be used for defining a matrix function that maps

square matrices In mathematics, a square matrix is a matrix with the same number of rows and columns. An ''n''-by-''n'' matrix is known as a square matrix of order Any two square matrices of the same order can be added and multiplied. Square matrices are often ...

with complex entries to square matrices of the same size. This is used for defining the exponential of a matrix, which is involved in the

closed-form Closed form may refer to: Mathematics * Closed-form expression, a finitary expression * Closed differential form In mathematics, especially vector calculus and differential topology, a closed form is a differential form ''α'' whose exterior deri ...

solution of systems of

linear differential equation In mathematics, a linear differential equation is a differential equation that is linear equation, linear in the unknown function and its derivatives, so it can be written in the form a_0(x)y + a_1(x)y' + a_2(x)y'' \cdots + a_n(x)y^ = b(x) wher ...

Extending scalar function to matrix functions

There are several techniques for lifting a real function to a

square matrix In mathematics, a square matrix is a Matrix (mathematics), matrix with the same number of rows and columns. An ''n''-by-''n'' matrix is known as a square matrix of order Any two square matrices of the same order can be added and multiplied. Squ ...

function such that interesting properties are maintained. All of the following techniques yield the same matrix function, but the domains on which the function is defined may differ.

Power series

If the

has the

Taylor expansion In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor ser ...

f(x) = c_0 + c_1 x + c_2 x^2 + \cdots

then a matrix function

A\mapsto f(A)

can be defined by substituting by a

: powers become matrix powers, additions become matrix sums and multiplications by coefficients become

scalar multiplication In mathematics, scalar multiplication is one of the basic operations defining a vector space in linear algebra (or more generally, a module in abstract algebra). In common geometrical contexts, scalar multiplication of a real Euclidean vector ...

s. If the series converges for

, x,  < r

, then the corresponding matrix series converges for matrices such that

\, A\,  < r

for some

matrix norm In the field of mathematics, norms are defined for elements within a vector space. Specifically, when the vector space comprises matrices, such norms are referred to as matrix norms. Matrix norms differ from vector norms in that they must also ...

that satisfies

\, AB\, \leq \, A\, \, B\,

Diagonalizable matrices

A square matrix is

diagonalizable In linear algebra, a square matrix A is called diagonalizable or non-defective if it is similar to a diagonal matrix. That is, if there exists an invertible matrix P and a diagonal matrix D such that . This is equivalent to (Such D are not ...

, if there is an

invertible matrix In linear algebra, an invertible matrix (''non-singular'', ''non-degenarate'' or ''regular'') is a square matrix that has an inverse. In other words, if some other matrix is multiplied by the invertible matrix, the result can be multiplied by a ...

such that

D = P^\,A\,P

is a

diagonal matrix In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagon ...

, that is, has the shape

D=\begin
d_1 & \cdots & 0 \\
\vdots & \ddots & \vdots \\
0      & \cdots & d_n
\end.

A = P\,D\,P^,

it is natural to set

f(A)=P\, \begin
f(d_1) & \cdots & 0 \\
\vdots & \ddots & \vdots \\
0      & \cdots & f(d_n)
\end\,P^.

It can be verified that the matrix does not depend on a particular choice of . For example, suppose one is seeking

\Gamma(A) = (A-1)!

for

A = \begin
1&3\\
2&1
\end .

One has

A = P \begin
1-\sqrt& 0 \\
0 & 1+ \sqrt
\end P^~,

for

P= \begin
1/2 & 1/2 \\
-\frac &\frac
\end ~.

Application of the formula then simply yields

\Gamma(A) = \begin
1/2 & 1/2 \\
-\frac & \frac
\end \cdot
\begin
\Gamma(1-\sqrt) & 0\\
0&\Gamma(1+\sqrt)
\end \cdot
\begin
1 & -\sqrt/2 \\
 1 & \sqrt/2
\end \approx
\begin 2.8114 & 0.4080 \\
0.2720 & 2.8114
\end ~.

Likewise,

A^4 = \begin
1/2 & 1/2 \\
-\frac & \frac
\end \cdot
\begin
(1-\sqrt)^4 & 0\\
0&(1+\sqrt)^4
\end \cdot
\begin
1 & -\sqrt/2 \\
 1 & \sqrt/2
\end =
\begin 73 & 84\\
56 & 73
\end ~.

Jordan decomposition

All complex matrices, whether they are diagonalizable or not, have a

Jordan normal form \begin \lambda_1 1\hphantom\hphantom\\ \hphantom\lambda_1 1\hphantom\\ \hphantom\lambda_1\hphantom\\ \hphantom\lambda_2 1\hphantom\hphantom\\ \hphantom\hphantom\lambda_2\hphantom\\ \hphantom\lambda_3\hphantom\\ \hphantom\ddots\hphantom\\ ...

A = P\,J\,P^

, where the matrix ''J'' consists of

Jordan block In the mathematical discipline of matrix theory, a Jordan matrix, named after Camille Jordan, is a block diagonal matrix over a ring (whose identities are the zero 0 and one 1), where each block along the diagonal, called a Jordan block, has th ...

s. Consider these blocks separately and apply the

power series In mathematics, a power series (in one variable) is an infinite series of the form \sum_^\infty a_n \left(x - c\right)^n = a_0 + a_1 (x - c) + a_2 (x - c)^2 + \dots where ''a_n'' represents the coefficient of the ''n''th term and ''c'' is a co ...

to a Jordan block:

f \left( \begin
\lambda & 1       & 0      & \cdots & 0 \\
0       & \lambda & 1      & \vdots & \vdots \\
0       & 0       & \ddots & \ddots & \vdots \\
\vdots  & \cdots  & \ddots & \lambda & 1 \\
0       & \cdots  & \cdots & 0 & \lambda
\end \right) =

\begin
\frac & \frac & \frac & \cdots                & \frac \\
0                     & \frac  & \frac  & \vdots                & \frac \\
0                     & 0                      & \ddots                  & \ddots                & \vdots \\
\vdots                & \cdots                 & \ddots                  & \frac & \frac \\
0                     & \cdots                 & \cdots                  & 0                     & \frac
\end.

This definition can be used to extend the domain of the matrix function beyond the set of matrices with

spectral radius ''Spectral'' is a 2016 Hungarian-American military science fiction action film co-written and directed by Nic Mathieu. Written with Ian Fried (screenwriter), Ian Fried & George Nolfi, the film stars James Badge Dale as DARPA research scientist Ma ...

smaller than the radius of convergence of the power series. Note that there is also a connection to

divided differences In mathematics, divided differences is an algorithm, historically used for computing tables of logarithms and trigonometric functions. Charles Babbage's difference engine, an early mechanical calculator, was designed to use this algorithm in its ...

. A related notion is the

Jordan–Chevalley decomposition In mathematics, specifically linear algebra, the Jordan–Chevalley decomposition, named after Camille Jordan and Claude Chevalley, expresses a linear operator in a unique way as the sum of two other linear operators which are simpler to understand ...

which expresses a matrix as a sum of a diagonalizable and a nilpotent part.

Hermitian matrices

Hermitian matrix In mathematics, a Hermitian matrix (or self-adjoint matrix) is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the -th row and -th column is equal to the complex conjugate of the element in the ...

has all real eigenvalues and can always be diagonalized by a

unitary matrix In linear algebra, an invertible complex square matrix is unitary if its matrix inverse equals its conjugate transpose , that is, if U^* U = UU^* = I, where is the identity matrix. In physics, especially in quantum mechanics, the conjugate ...

P, according to the

spectral theorem In linear algebra and functional analysis, a spectral theorem is a result about when a linear operator or matrix can be diagonalized (that is, represented as a diagonal matrix in some basis). This is extremely useful because computations involvin ...

. In this case, the Jordan definition is natural. Moreover, this definition allows one to extend standard inequalities for real functions: If

f(a) \leq g(a)

for all eigenvalues of

A

, then

f(A) \preceq g(A)

. (As a convention,

X \preceq Y \Leftrightarrow Y - X

is a

positive-semidefinite matrix In mathematics, a symmetric matrix M with real entries is positive-definite if the real number \mathbf^\mathsf M \mathbf is positive for every nonzero real column vector \mathbf, where \mathbf^\mathsf is the row vector transpose of \mathbf. Mo ...

.) The proof follows directly from the definition.

Cauchy integral

Cauchy's integral formula In mathematics, Cauchy's integral formula, named after Augustin-Louis Cauchy, is a central statement in complex analysis. It expresses the fact that a holomorphic function defined on a disk is completely determined by its values on the boundary o ...

from

complex analysis Complex analysis, traditionally known as the theory of functions of a complex variable, is the branch of mathematical analysis that investigates functions of complex numbers. It is helpful in many branches of mathematics, including algebraic ...

can also be used to generalize scalar functions to matrix functions. Cauchy's integral formula states that for any

defined on a set , one has

f(x) = \frac \oint_\! \, \mathrmz ~,

where is a closed simple curve inside the domain enclosing . Now, replace by a matrix and consider a path inside that encloses all

eigenvalue In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

s of . One possibility to achieve this is to let be a circle around the

origin Origin(s) or The Origin may refer to: Arts, entertainment, and media Comics and manga * ''Origin'' (comics), a Wolverine comic book mini-series published by Marvel Comics in 2002 * ''The Origin'' (Buffy comic), a 1999 ''Buffy the Vampire Sl ...

with

radius In classical geometry, a radius (: radii or radiuses) of a circle or sphere is any of the line segments from its Centre (geometry), center to its perimeter, and in more modern usage, it is also their length. The radius of a regular polygon is th ...

larger than for an arbitrary

. Then, is definable by

f(A) = \frac \oint_C f(z)\left(z I - A\right)^ \mathrmz \,.

This integral can readily be evaluated numerically using the

trapezium rule In calculus, the trapezoidal rule (or trapezium rule in British English) is a technique for numerical integration, i.e., approximating the definite integral: \int_a^b f(x) \, dx. The trapezoidal rule works by approximating the region under the ...

, which converges exponentially in this case. That means that the

precision Precision, precise or precisely may refer to: Arts and media * ''Precision'' (march), the official marching music of the Royal Military College of Canada * "Precision" (song), by Big Sean * ''Precisely'' (sketch), a dramatic sketch by the Eng ...

of the result doubles when the number of nodes is doubled. In routine cases, this is bypassed by

Sylvester's formula In matrix theory, Sylvester's formula or Sylvester's matrix theorem (named after J. J. Sylvester) or Lagrange−Sylvester interpolation expresses an analytic function of a matrix as a polynomial in , in terms of the eigenvalues and eigenvectors ...

. This idea applied to

bounded linear operator In functional analysis and operator theory, a bounded linear operator is a linear transformation L : X \to Y between topological vector spaces (TVSs) X and Y that maps bounded subsets of X to bounded subsets of Y. If X and Y are normed vector ...

s on a

Banach space In mathematics, more specifically in functional analysis, a Banach space (, ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and ...

, which can be seen as infinite matrices, leads to the

holomorphic functional calculus In mathematics, holomorphic functional calculus is functional calculus with holomorphic functions. That is to say, given a holomorphic function ''f'' of a complex argument ''z'' and an operator ''T'', the aim is to construct an operator, ''f''(''T ...

Matrix perturbations

The above Taylor power series allows the scalar

x

to be replaced by the matrix. This is not true in general when expanding in terms of

A(\eta) = A+\eta B

about

\eta = 0

unless

,B 0

. A counterexample is

f(x) = x^

, which has a finite length

Taylor series In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor ser ...

. We compute this in two ways, * Distributive law:

f(A + \eta B) = (A+\eta B)^ = A^ + \eta(A^B + ABA + BA^) + \eta^(AB^ + BAB + B^A) + \eta^B^

* Using scalar Taylor expansion for

f(a+\eta b)

and replacing scalars with matrices at the end:

&\to A^3 = + 3A^2(\eta B) + 3A(\eta B)^2 + (\eta B)^3 \end

The scalar expression assumes commutativity while the matrix expression does not, and thus they cannot be equated directly unless

,B 0

. For some ''f''(''x'') this can be dealt with using the same method as scalar Taylor series. For example,

f(x) = \frac

. If

A^

exists then

f(A+\eta B) = f(\mathbb + \eta A^B)f(A)

. The expansion of the first term then follows the power series given above,

f(\mathbb + \eta A^B) = \mathbb - \eta A^B + (-\eta A^B)^2 + \cdots = \sum_^\infty (-\eta A^B)^n

The convergence criteria of the power series then apply, requiring

\Vert \eta A^B \Vert

to be sufficiently small under the appropriate matrix norm. For more general problems, which cannot be rewritten in such a way that the two matrices commute, the ordering of matrix products produced by repeated application of the Leibniz rule must be tracked.

Arbitrary function of a 2×2 matrix

An arbitrary function ''f''(''A'') of a 2×2 matrix A has its

simplify to

f(A) = \frac I + \frac \frac ~,

where

\lambda_\pm

are the eigenvalues of its characteristic equation, , and are given by

\lambda_\pm = \frac \pm \sqrt .

However, if there is degeneracy, the following formula is used, where f' is the derivative of f.

f(A) = f \left( \frac \right) I + \mathrm \left( \fracI - A \right ) f' \left( \frac \right) .

Examples

Matrix polynomial In mathematics, a matrix polynomial is a polynomial with square matrices as variables. Given an ordinary, scalar-valued polynomial : P(x) = \sum_^n =a_0 + a_1 x+ a_2 x^2 + \cdots + a_n x^n, this polynomial evaluated at a matrix A is :P(A) = \s ...

* Matrix root *

Matrix logarithm In mathematics, a logarithm of a matrix is another matrix such that the matrix exponential of the latter matrix equals the original matrix. It is thus a generalization of the scalar logarithm and in some sense an inverse function of the matrix exp ...

Matrix exponential In mathematics, the matrix exponential is a matrix function on square matrix, square matrices analogous to the ordinary exponential function. It is used to solve systems of linear differential equations. In the theory of Lie groups, the matrix exp ...

* Matrix sign function

Classes of matrix functions

Using the semidefinite ordering (

X \preceq Y \Leftrightarrow Y - X

is positive-semidefinite and

X \prec Y \Leftrightarrow Y - X

positive definite In mathematics, positive definiteness is a property of any object to which a bilinear form In mathematics, a bilinear form is a bilinear map on a vector space (the elements of which are called '' vectors'') over a field ''K'' (the elements of w ...

), some of the classes of scalar functions can be extended to matrix functions of

Hermitian matrices In mathematics, a Hermitian matrix (or self-adjoint matrix) is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the -th row and -th column is equal to the complex conjugate of the element in the -th ...

Operator monotone

A function is called operator monotone if and only if

0 \prec A \preceq H \Rightarrow f(A) \preceq f(H)

for all self-adjoint matrices with spectra in the domain of . This is analogous to

monotone function In mathematics, a monotonic function (or monotone function) is a function between ordered sets that preserves or reverses the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of ord ...

in the scalar case.

Operator concave/convex

A function is called operator concave if and only if

\tau f(A) + (1-\tau) f(H) \preceq f \left ( \tau A + (1-\tau)H \right )

for all self-adjoint matrices with spectra in the domain of and

\tau \in,1 /math>. This definition is analogous to a concave scalar function . An operator convex function can be defined be switching \preceq to \succeq in the definition above.

Examples

The matrix log is both operator monotone and operator concave. The matrix square is operator convex. The matrix exponential is none of these. Loewner's theorem states that a function on an ''open'' interval is operator monotone if and only if it has an analytic extension to the upper and lower complex half planes so that the upper half plane is mapped to itself.

Notes

References

* {{Authority control Matrix theory Mathematical physics

Extending scalar function to matrix functions

Power series

Diagonalizable matrices

Jordan decomposition

Hermitian matrices

Cauchy integral

Matrix perturbations

Arbitrary function of a 2×2 matrix

Examples

Classes of matrix functions

Operator monotone

Operator concave/convex

Examples

See also

Notes

References