Majorization
   HOME

TheInfoList



OR:

In
mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
, majorization is a
preorder In mathematics, especially in order theory, a preorder or quasiorder is a binary relation that is reflexive relation, reflexive and Transitive relation, transitive. The name is meant to suggest that preorders are ''almost'' partial orders, ...
on vectors of
real numbers In mathematics, a real number is a number that can be used to measurement, measure a continuous variable, continuous one-dimensional quantity such as a time, duration or temperature. Here, ''continuous'' means that pairs of values can have arbi ...
. For two such vectors, \mathbf,\ \mathbf \in \mathbb^n, we say that \mathbf weakly majorizes (or dominates) \mathbf from below, commonly denoted \mathbf \succ_w \mathbf, when : \sum_^k x_i^ \geq \sum_^k y_i^ for all k=1,\,\dots,\,n, where x_i^ denotes ith largest entry of x. If \mathbf, \mathbf further satisfy \sum_^n x_i = \sum_^n y_i, we say that \mathbf majorizes (or dominates) \mathbf , commonly denoted \mathbf \succ \mathbf. Both weak majorization and majorization are partial orders for vectors whose entries are non-decreasing, but only a
preorder In mathematics, especially in order theory, a preorder or quasiorder is a binary relation that is reflexive relation, reflexive and Transitive relation, transitive. The name is meant to suggest that preorders are ''almost'' partial orders, ...
for general vectors, since majorization is agnostic to the ordering of the entries in vectors, e.g., the statement (1,2)\prec (0,3) is simply equivalent to (2,1)\prec (3,0). Specifically, \mathbf \succ \mathbf \wedge \mathbf \succ \mathbf if and only if \mathbf, \mathbf are permutations of each other. Similarly for \succ_w. Majorizing also sometimes refers to entrywise ordering, e.g. the real-valued function ''f'' majorizes the real-valued function ''g'' when f(x) \geq g(x) for all x in the domain, or other technical definitions, such as majorizing measures in
probability theory Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
.


Equivalent conditions


Geometric definition

For \mathbf,\ \mathbf \in \mathbb^n, we have \mathbf \prec \mathbf if and only if \mathbf is in the convex hull of all vectors obtained by permuting the coordinates of \mathbf. This is equivalent to saying that \mathbf = \mathbf\mathbf for some
doubly stochastic matrix In mathematics, especially in probability and combinatorics, a doubly stochastic matrix (also called bistochastic matrix) is a square matrix X=(x_) of nonnegative real numbers, each of whose rows and columns sums to 1, i.e., :\sum_i x_=\sum_j x_ ...
\mathbf.Barry C. Arnold. "Majorization and the Lorenz Order: A Brief Introduction". Springer-Verlag Lecture Notes in Statistics, vol. 43, 1987. In particular, \mathbf can be written as a
convex combination In convex geometry and Vector space, vector algebra, a convex combination is a linear combination of point (geometry), points (which can be vector (geometric), vectors, scalar (mathematics), scalars, or more generally points in an affine sp ...
of n permutations of \mathbf. In other words, \mathbf is in the
permutahedron In mathematics, the permutohedron (also spelled permutahedron) of order is an -dimensional polytope embedded in an -dimensional space. Its vertex (geometry), vertex coordinates (labels) are the permutations of the first natural numbers. The edg ...
of \mathbf. Figure 1 displays the convex hull in 2D for the vector \mathbf=(3,\,1). Notice that the center of the convex hull, which is an interval in this case, is the vector \mathbf=(2,\,2). This is the "smallest" vector satisfying \mathbf \prec \mathbf for this given vector \mathbf. Figure 2 shows the convex hull in 3D. The center of the convex hull, which is a 2D polygon in this case, is the "smallest" vector \mathbf satisfying \mathbf \prec \mathbf for this given vector \mathbf.


Other definitions

Each of the following statements is true if and only if \mathbf\succ \mathbf. * From \mathbf we can produce \mathbf by a finite sequence of "Robin Hood operations" where we replace two elements x_i and x_j < x_i with x_i-\varepsilon and x_j+\varepsilon, respectively, for some \varepsilon \in (0, x_i-x_j). * For every convex function h:\mathbb\to \mathbb, \sum_^d h(x_i) \geq \sum_^d h(y_i). **In fact, a special case suffices: \sum_i=\sum_i and, for every , \sum_^d \max(0,x_i-t) \geq\sum_^d \max(0,y_i-t). *For every t \in \mathbb, \sum_^d , x_j-t, \geq \sum_^d , y_j-t, . *Each vector \mathbf can be plotted as a concave curve by connecting (0,0), (1, x_1^), (2, x_1^+x_2^), \dots, (n, x_1^+x_2^ + \dots +x_n^). Then \mathbf\succ \mathbf is equivalent to the curve of \mathbf being higher than that of \mathbf.


Examples

Among non-negative vectors with three components, (1, 0, 0) and permutations of it majorize all other vectors (p_1, p_2, p_3) such that p_1 + p_2 + p_3 = 1. For example, (1, 0, 0) \succ (1/2, 0, 1/2). Similarly, (1/3, 1/3, 1/3) is majorized by all other such vectors, so (1/2, 0, 1/2) \succ (1/3, 1/3, 1/3). This behavior extends to general-length
probability vector In mathematics and statistics, a probability vector or stochastic vector is a vector with non-negative entries that add up to one. The positions (indices) of a probability vector represent the possible outcomes of a discrete random variable, and ...
s: the singleton vector majorizes all other probability vectors, and the uniform distribution is majorized by all probability vectors.


Schur convexity

A function f:\mathbb^n \to \mathbb is said to be Schur convex when \mathbf \succ \mathbf implies f(\mathbf) \geq f(\mathbf). Hence, Schur-convex functions translate the ordering of vectors to a standard ordering in \mathbb. Similarly, f(\mathbf) is Schur concave when \mathbf \succ \mathbf implies f(\mathbf) \leq f(\mathbf). An example of a Schur-convex function is the max function, \max(\mathbf)=x_^. Schur convex functions are necessarily symmetric that the entries of it argument can be switched without modifying the value of the function. Therefore, linear functions, which are convex, are not Schur-convex unless they are symmetric. If a function is symmetric and convex, then it is Schur-convex.


Generalizations

Majorization can be generalized to the Lorenz ordering, a partial order on distribution functions. For example, a
wealth distribution The distribution of wealth is a comparison of the wealth of various members or groups in a society. It shows one aspect of economic inequality or economic heterogeneity. The distribution of wealth differs from the income distribution in that ...
is Lorenz-greater than another if its
Lorenz curve In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing Economic inequality, inequality of the wealth distribution. The curve is a graph ...
lies below the other. As such, a Lorenz-greater wealth distribution has a higher
Gini coefficient In economics, the Gini coefficient ( ), also known as the Gini index or Gini ratio, is a measure of statistical dispersion intended to represent the income distribution, income inequality, the wealth distribution, wealth inequality, or the ...
, and has more
income disparity In economics, income distribution covers how a country's total GDP is distributed amongst its population. Economic theory and economic policy have long seen income and its distribution as a central concern. Unequal distribution of income causes e ...
. The majorization preorder can be naturally extended to
density matrices In quantum mechanics, a density matrix (or density operator) is a matrix used in calculating the probabilities of the outcomes of measurements performed on physical systems. It is a generalization of the state vectors or wavefunctions: while th ...
in the context of
quantum information Quantum information is the information of the state of a quantum system. It is the basic entity of study in quantum information theory, and can be manipulated using quantum information processing techniques. Quantum information refers to both t ...
. In particular, \rho\succ\rho' exactly when \mathrm
rho Rho (; uppercase Ρ, lowercase ρ or ; or ) is the seventeenth letter of the Greek alphabet. In the system of Greek numerals it has a value of 100. It is derived from Phoenician alphabet, Phoenician letter resh . Its uppercase form uses the same ...
succ\mathrm rho'/math> (where \mathrm denotes the state's
spectrum A spectrum (: spectra or spectrums) is a set of related ideas, objects, or properties whose features overlap such that they blend to form a continuum. The word ''spectrum'' was first used scientifically in optics to describe the rainbow of co ...
). Similarly, one can say a
Hermitian operator In mathematics, a self-adjoint operator on a complex vector space ''V'' with inner product \langle\cdot,\cdot\rangle is a linear map ''A'' (from ''V'' to itself) that is its own adjoint. That is, \langle Ax,y \rangle = \langle x,Ay \rangle for al ...
, \mathbf, majorizes another, \mathbf, if the set of eigenvalues of \mathbf majorizes that of \mathbf.


See also

*
Muirhead's inequality In mathematics, Muirhead's inequality, named after Robert Franklin Muirhead, also known as the "bunching" method, generalizes the inequality of arithmetic and geometric means. Preliminary definitions ''a''-mean For any real vector :a=(a_1,\do ...
*
Karamata's Inequality In mathematics, Karamata's inequality, named after Jovan Karamata, also known as the majorization inequality, is a theorem in elementary algebra for convex and concave real-valued functions, defined on an interval of the real line. It generalizes t ...
*
Schur-convex function In mathematics, a Schur-convex function, also known as S-convex, isotonic function and order-preserving function is a function f: \mathbb^d\rightarrow \mathbb that for all x,y\in \mathbb^d such that x is majorized by y, one has that f(x)\le f(y). ...
* Schur–Horn theorem relating diagonal entries of a matrix to its eigenvalues. * For positive
integer number An integer is the number zero ( 0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number ( −1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative ...
s, weak majorization is called
Dominance order In discrete mathematics, dominance order (synonyms: dominance ordering, majorization order, natural ordering) is a partial order on the set of partition (number theory), partitions of a positive integer ''n'' that plays an important role in algeb ...
. *
Leximin order In mathematics, leximin order is a total preorder on finite-dimensional vectors. A more accurate but less common term is leximin preorder. The leximin order is particularly important in social choice theory and fair division. Definition A vec ...


Notes


References

* J. Karamata. "Sur une inegalite relative aux fonctions convexes." ''Publ. Math. Univ. Belgrade'' 1, 145–158, 1932. * G. H. Hardy, J. E. Littlewood and G. Pólya, ''Inequalities'', 2nd edition, 1952, Cambridge University Press, London. * ''Inequalities: Theory of Majorization and Its Applications'' Albert W. Marshall,
Ingram Olkin Ingram Olkin (July 23, 1924 – April 28, 2016) was a professor emeritus and chair of statistics and education at Stanford University and the Stanford Graduate School of Education. He is known for developing statistical analysis for evaluatin ...
, Barry Arnold, Second edition. Springer Series in Statistics. Springer, New York, 2011.
A tribute to Marshall and Olkin's book "Inequalities: Theory of Majorization and its Applications"
* ''Matrix Analysis'' (1996) Rajendra Bhatia, Springer, * ''Topics in Matrix Analysis'' (1994) Roger A. Horn and Charles R. Johnson, Cambridge University Press, * ''Majorization and Matrix Monotone Functions in Wireless Communications'' (2007) Eduard Jorswieck and Holger Boche, Now Publishers, * ''The Cauchy Schwarz Master Class'' (2004) J. Michael Steele, Cambridge University Press, {{ISBN, 978-0-521-54677-5


External links






Software

*
OCTAVE In music, an octave (: eighth) or perfect octave (sometimes called the diapason) is an interval between two notes, one having twice the frequency of vibration of the other. The octave relationship is a natural phenomenon that has been referr ...
/
MATLAB MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementat ...
br>code to check majorization
Order theory Linear algebra