The Theil index is a statistic primarily used to measure
economic inequality
There are wide varieties of economic inequality, most notably income inequality measured using the distribution of income (the amount of money people are paid) and wealth inequality measured using the distribution of wealth (the amount of ...
and other economic phenomena, though it has also been used to measure racial segregation.
The Theil index ''T''
T is the same as
redundancy in
information theory
Information theory is the scientific study of the quantification, storage, and communication of information. The field was originally established by the works of Harry Nyquist and Ralph Hartley, in the 1920s, and Claude Shannon in the 1940s. ...
which is the maximum possible
entropy
Entropy is a scientific concept, as well as a measurable physical property, that is most commonly associated with a state of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodyna ...
of the data minus the observed entropy. It is a special case of the
generalized entropy index
The generalized entropy index has been proposed as a measure of income inequality in a population. It is derived from information theory as a measure of redundancy in data. In information theory a measure of redundancy can be interpreted as no ...
. It can be viewed as a measure of redundancy, lack of diversity, isolation, segregation, inequality, non-randomness, and compressibility. It was proposed by a Dutch
econometrician
Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8� ...
Henri Theil (1924-2000) at the
Erasmus University Rotterdam
Erasmus University Rotterdam (abbreviated as ''EUR'', nl, Erasmus Universiteit Rotterdam ) is a public research university located in Rotterdam, Netherlands. The university is named after Desiderius Erasmus Roterodamus, a 15th-century humani ...
.
Henri Theil himself said (1967): "The (Theil) index can be interpreted as the expected information content of the indirect message which transforms the population shares as prior probabilities into the income shares as posterior probabilities."
Amartya Sen
Amartya Kumar Sen (; born 3 November 1933) is an Indian economist and philosopher, who since 1972 has taught and worked in the United Kingdom and the United States. Sen has made contributions to welfare economics, social choice theory, economi ...
noted, "But the fact remains that the Theil index is an arbitrary formula, and the average of the logarithms of the reciprocals of income shares weighted by income is not a measure that is exactly overflowing with intuitive sense."
Formula
For a population of ''N'' "agents" each with characteristic ''x'', the situation may be represented by the list ''x''
''i'' (''i'' = 1,...,''N'') where ''x''
''i'' is the characteristic of agent ''i''. For example, if the characteristic is income, then ''x
i'' is the income of agent ''i''.
The Theil ''T'' index is defined as
:
and the Theil ''L'' index is defined as
:
where
is the mean income:
:
Theil-L is an income-distribution's dis-entropy per person, measured with respect to maximum entropy (...which is achieved with complete equality).
(In an alternative interpretation of it, Theil-L is the natural-logarithm of the geometric-mean of the ratio: (mean income)/(income i), over all the incomes. The related Atkinson(1) is just 1 minus the geometric-mean of (income i)/(mean income),over the income distribution.)
Because a transfer between a larger income & a smaller one will change the smaller income's ratio more than it changes the larger income's ratio, the transfer-principle is satisfied by this index.
Equivalently, if the situation is characterized by a discrete distribution function ''f''
''k'' (''k'' = 0,...,''W'') where ''f''
''k'' is the fraction of the population with income ''k'' and ''W'' = ''Nμ'' is the total income, then
and the Theil index is:
:
where
is again the mean income:
:
Note that in this case income ''k'' is an integer and ''k=1'' represents the smallest increment of income possible (e.g., cents).
if the situation is characterized by a continuous distribution function ''f''(''k'') (supported from 0 to infinity) where ''f''(''k'') ''dk'' is the fraction of the population with income ''k'' to ''k'' + ''dk'', then the Theil index is:
:
where the mean is:
:
Theil indices for some common continuous probability distributions are given in the table below:
:
If everyone has the same income, then ''T''
T equals 0. If one person has all the income, then ''T''
T gives the result
, which is maximum inequality. Dividing ''T''
T by
can normalize the equation to range from 0 to 1, but then the
independence axiom
The independence of irrelevant alternatives (IIA), also known as binary independence or the independence axiom, is an axiom of decision theory and various social sciences. The term is used in different connotation in several contexts. Although it ...
is violated: