mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...

, majorization is a

preorder In mathematics, especially in order theory, a preorder or quasiorder is a binary relation that is reflexive relation, reflexive and Transitive relation, transitive. The name is meant to suggest that preorders are ''almost'' partial orders, ...

on vectors of

real numbers In mathematics, a real number is a number that can be used to measurement, measure a continuous variable, continuous one-dimensional quantity such as a time, duration or temperature. Here, ''continuous'' means that pairs of values can have arbi ...

. For two such vectors,

\mathbf,\ \mathbf \in \mathbb^n

, we say that

\mathbf

weakly majorizes (or dominates)

\mathbf

from below, commonly denoted

\mathbf \succ_w \mathbf,

when :

\sum_^k x_i^ \geq \sum_^k y_i^

for all

k=1,\,\dots,\,n

, where

x_i^

denotes

i

^th largest entry of

x

. If

\mathbf, \mathbf

further satisfy

\sum_^n x_i = \sum_^n y_i

, we say that

\mathbf

majorizes (or dominates)

\mathbf

, commonly denoted

\mathbf \succ \mathbf

. Both weak majorization and majorization are partial orders for vectors whose entries are non-decreasing, but only a

for general vectors, since majorization is agnostic to the ordering of the entries in vectors, e.g., the statement

(1,2)\prec (0,3)

is simply equivalent to

(2,1)\prec (3,0)

. Specifically,

\mathbf \succ \mathbf \wedge \mathbf \succ \mathbf

if and only if

\mathbf, \mathbf

are permutations of each other. Similarly for

\succ_w

. Majorizing also sometimes refers to entrywise ordering, e.g. the real-valued function ''f'' majorizes the real-valued function ''g'' when

f(x) \geq g(x)

for all

x

in the domain, or other technical definitions, such as majorizing measures in

probability theory Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...

Equivalent conditions

Geometric definition

For

\mathbf,\ \mathbf \in \mathbb^n,

we have

\mathbf \prec \mathbf

if and only if

\mathbf

is in the convex hull of all vectors obtained by permuting the coordinates of

\mathbf

. This is equivalent to saying that

\mathbf = \mathbf\mathbf

for some

doubly stochastic matrix In mathematics, especially in probability and combinatorics, a doubly stochastic matrix (also called bistochastic matrix) is a square matrix X=(x_) of nonnegative real numbers, each of whose rows and columns sums to 1, i.e., :\sum_i x_=\sum_j x_ ...

\mathbf

.Barry C. Arnold. "Majorization and the Lorenz Order: A Brief Introduction". Springer-Verlag Lecture Notes in Statistics, vol. 43, 1987. In particular,

\mathbf

can be written as a

convex combination In convex geometry and Vector space, vector algebra, a convex combination is a linear combination of point (geometry), points (which can be vector (geometric), vectors, scalar (mathematics), scalars, or more generally points in an affine sp ...

n

permutations of

\mathbf

. In other words,

\mathbf

is in the

permutahedron In mathematics, the permutohedron (also spelled permutahedron) of order is an -dimensional polytope embedded in an -dimensional space. Its vertex (geometry), vertex coordinates (labels) are the permutations of the first natural numbers. The edg ...

\mathbf

. Figure 1 displays the convex hull in 2D for the vector

\mathbf=(3,\,1)

. Notice that the center of the convex hull, which is an interval in this case, is the vector

\mathbf=(2,\,2)

. This is the "smallest" vector satisfying

\mathbf \prec \mathbf

for this given vector

\mathbf

. Figure 2 shows the convex hull in 3D. The center of the convex hull, which is a 2D polygon in this case, is the "smallest" vector

\mathbf

satisfying

\mathbf \prec \mathbf

for this given vector

\mathbf

Other definitions

Each of the following statements is true if and only if

\mathbf\succ \mathbf

. * From

\mathbf

we can produce

\mathbf

by a finite sequence of "Robin Hood operations" where we replace two elements

x_i

and

x_j < x_i

with

x_i-\varepsilon

and

x_j+\varepsilon

, respectively, for some

\varepsilon \in (0, x_i-x_j)

. * For every convex function

h:\mathbb\to \mathbb

\sum_^d h(x_i) \geq \sum_^d h(y_i)

. **In fact, a special case suffices:

\sum_i=\sum_i

and, for every ,

\sum_^d \max(0,x_i-t) \geq\sum_^d \max(0,y_i-t)

. *For every

t \in \mathbb

\sum_^d , x_j-t,  \geq \sum_^d , y_j-t,

. *

Each vector

\mathbf

can be plotted as a concave curve by connecting

(0,0), (1, x_1^), (2, x_1^+x_2^), \dots, (n, x_1^+x_2^ + \dots +x_n^)

. Then

\mathbf\succ \mathbf

is equivalent to the curve of

\mathbf

being higher than that of

\mathbf

Examples

Among non-negative vectors with three components,

(1, 0, 0)

and permutations of it majorize all other vectors

(p_1, p_2, p_3)

such that

p_1 + p_2 + p_3 = 1

. For example,

(1, 0, 0) \succ (1/2, 0, 1/2)

. Similarly,

(1/3, 1/3, 1/3)

is majorized by all other such vectors, so

(1/2, 0, 1/2) \succ (1/3, 1/3, 1/3)

. This behavior extends to general-length

probability vector In mathematics and statistics, a probability vector or stochastic vector is a vector with non-negative entries that add up to one. The positions (indices) of a probability vector represent the possible outcomes of a discrete random variable, and ...

s: the singleton vector majorizes all other probability vectors, and the uniform distribution is majorized by all probability vectors.

Schur convexity

A function

f:\mathbb^n \to \mathbb

is said to be Schur convex when

\mathbf \succ \mathbf

implies

f(\mathbf) \geq  f(\mathbf)

. Hence, Schur-convex functions translate the ordering of vectors to a standard ordering in

\mathbb

. Similarly,

f(\mathbf)

is Schur concave when

\mathbf \succ \mathbf

implies

f(\mathbf) \leq f(\mathbf).

An example of a Schur-convex function is the max function,

\max(\mathbf)=x_^

. Schur convex functions are necessarily symmetric that the entries of it argument can be switched without modifying the value of the function. Therefore, linear functions, which are convex, are not Schur-convex unless they are symmetric. If a function is symmetric and convex, then it is Schur-convex.

Generalizations

Majorization can be generalized to the Lorenz ordering, a partial order on distribution functions. For example, a

wealth distribution The distribution of wealth is a comparison of the wealth of various members or groups in a society. It shows one aspect of economic inequality or economic heterogeneity. The distribution of wealth differs from the income distribution in that ...

is Lorenz-greater than another if its

Lorenz curve In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing Economic inequality, inequality of the wealth distribution. The curve is a graph ...

lies below the other. As such, a Lorenz-greater wealth distribution has a higher

Gini coefficient In economics, the Gini coefficient ( ), also known as the Gini index or Gini ratio, is a measure of statistical dispersion intended to represent the income distribution, income inequality, the wealth distribution, wealth inequality, or the ...

, and has more

income disparity In economics, income distribution covers how a country's total GDP is distributed amongst its population. Economic theory and economic policy have long seen income and its distribution as a central concern. Unequal distribution of income causes e ...

. The majorization preorder can be naturally extended to

density matrices In quantum mechanics, a density matrix (or density operator) is a matrix used in calculating the probabilities of the outcomes of measurements performed on physical systems. It is a generalization of the state vectors or wavefunctions: while th ...

in the context of

quantum information Quantum information is the information of the state of a quantum system. It is the basic entity of study in quantum information theory, and can be manipulated using quantum information processing techniques. Quantum information refers to both t ...

. In particular,

\rho\succ\rho'

exactly when

\mathrm

rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...

succ\mathrm rho'/math> (where

\mathrm

denotes the state's

spectrum A spectrum (: spectra or spectrums) is a set of related ideas, objects, or properties whose features overlap such that they blend to form a continuum. The word ''spectrum'' was first used scientifically in optics to describe the rainbow of co ...

). Similarly, one can say a

Hermitian operator In mathematics, a self-adjoint operator on a complex vector space ''V'' with inner product \langle\cdot,\cdot\rangle is a linear map ''A'' (from ''V'' to itself) that is its own adjoint. That is, \langle Ax,y \rangle = \langle x,Ay \rangle for al ...

\mathbf

, majorizes another,

\mathbf

, if the set of eigenvalues of

\mathbf

majorizes that of

\mathbf

Notes

References

* J. Karamata. "Sur une inegalite relative aux fonctions convexes." ''Publ. Math. Univ. Belgrade'' 1, 145–158, 1932. * G. H. Hardy, J. E. Littlewood and G. Pólya, ''Inequalities'', 2nd edition, 1952, Cambridge University Press, London. * ''Inequalities: Theory of Majorization and Its Applications'' Albert W. Marshall,

Ingram Olkin Ingram Olkin (July 23, 1924 – April 28, 2016) was a professor emeritus and chair of statistics and education at Stanford University and the Stanford Graduate School of Education. He is known for developing statistical analysis for evaluatin ...

, Barry Arnold, Second edition. Springer Series in Statistics. Springer, New York, 2011.
A tribute to Marshall and Olkin's book "Inequalities: Theory of Majorization and its Applications"
* ''Matrix Analysis'' (1996) Rajendra Bhatia, Springer, * ''Topics in Matrix Analysis'' (1994) Roger A. Horn and Charles R. Johnson, Cambridge University Press, * ''Majorization and Matrix Monotone Functions in Wireless Communications'' (2007) Eduard Jorswieck and Holger Boche, Now Publishers, * ''The Cauchy Schwarz Master Class'' (2004) J. Michael Steele, Cambridge University Press, {{ISBN, 978-0-521-54677-5

External links

Software

OCTAVE In music, an octave (: eighth) or perfect octave (sometimes called the diapason) is an interval between two notes, one having twice the frequency of vibration of the other. The octave relationship is a natural phenomenon that has been referr ...

MATLAB MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementat ...

br>code to check majorization
Order theory Linear algebra