In
economics
Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services.
Economics focuses on the behaviour and interac ...
, the Gini coefficient ( ), also known as the Gini index or Gini ratio, is a
measure of statistical dispersion intended to represent the
income inequality
In economics, income distribution covers how a country's total GDP is distributed amongst its population. Economic theory and economic policy have long seen income and its distribution as a central concern. Unequal distribution of income causes ...
, the
wealth inequality
The distribution of wealth is a comparison of the wealth of various members or groups in a society. It shows one aspect of economic inequality or economic heterogeneity.
The distribution of wealth differs from the income distribution in that ...
, or the
consumption inequality within a nation or a
social group
In the social sciences, a social group is defined as two or more people who interact with one another, share similar characteristics, and collectively have a sense of unity. Regardless, social groups come in a myriad of sizes and varieties. F ...
. It was developed by Italian
statistician
A statistician is a person who works with Theory, theoretical or applied statistics. The profession exists in both the private sector, private and public sectors.
It is common to combine statistical knowledge with expertise in other subjects, a ...
and
sociologist Corrado Gini.
The Gini coefficient measures the
inequality among the values of a
frequency distribution
In statistics, the frequency or absolute frequency of an Event (probability theory), event i is the number n_i of times the observation has occurred/been recorded in an experiment or study. These frequencies are often depicted graphically or tabu ...
, such as
income
Income is the consumption and saving opportunity gained by an entity within a specified timeframe, which is generally expressed in monetary terms. Income is difficult to define conceptually and the definition may be different across fields. F ...
levels. A Gini coefficient of 0 reflects perfect equality, where all income or wealth values are the same. In contrast, a Gini coefficient of 1 (or 100%) reflects maximal inequality among values, where a single individual has all the income while all others have none.
Corrado Gini proposed the Gini coefficient as a measure of
inequality of
income
Income is the consumption and saving opportunity gained by an entity within a specified timeframe, which is generally expressed in monetary terms. Income is difficult to define conceptually and the definition may be different across fields. F ...
or
wealth
Wealth is the abundance of valuable financial assets or physical possessions which can be converted into a form that can be used for transactions. This includes the core meaning as held in the originating Old English word , which is from an ...
. For
OECD countries in the late 20th century, considering the effect of
tax
A tax is a mandatory financial charge or levy imposed on an individual or legal entity by a governmental organization to support government spending and public expenditures collectively or to regulate and reduce negative externalities. Tax co ...
es and
transfer payments
In macroeconomics and finance, a transfer payment (also called a government transfer or simply fiscal transfer) is a redistribution of income and wealth by means of the government making a payment, without goods or services being received in r ...
, the income Gini coefficient ranged between 0.24 and 0.49, with
Slovakia
Slovakia, officially the Slovak Republic, is a landlocked country in Central Europe. It is bordered by Poland to the north, Ukraine to the east, Hungary to the south, Austria to the west, and the Czech Republic to the northwest. Slovakia's m ...
being the lowest and
Mexico
Mexico, officially the United Mexican States, is a country in North America. It is the northernmost country in Latin America, and borders the United States to the north, and Guatemala and Belize to the southeast; while having maritime boundar ...
the highest.
African countries had the highest pre-tax Gini coefficients in 2008–2009, with
South Africa
South Africa, officially the Republic of South Africa (RSA), is the Southern Africa, southernmost country in Africa. Its Provinces of South Africa, nine provinces are bounded to the south by of coastline that stretches along the Atlantic O ...
having the world's highest, estimated to be 0.63 to 0.7. However, this figure drops to 0.52 after social assistance is taken into account and drops again to 0.47 after taxation.
Slovakia has the lowest Gini coefficient, with a Gini coefficient of 0.232. Various sources have estimated the Gini coefficient of the global income in 2005 to be between 0.61 and 0.68.
There are multiple issues in interpreting a Gini coefficient, as the same value may result from many different distribution curves. The demographic structure should be taken into account to mitigate this. Countries with an aging population or those with an increased birth rate experience an increasing pre-tax Gini coefficient even if real income distribution for working adults remains constant. Many scholars have devised over a dozen variants of the Gini coefficient.
History
The Italian statistician
Corrado Gini developed the Gini coefficient and published it in his 1912 paper ''Variabilità e mutabilità'' (). Building on the work of American economist
Max Lorenz, Gini proposed using the difference between the hypothetical straight line depicting perfect equality and the actual line depicting people's incomes as a measure of inequality. In this paper, he introduced the concept of simple mean difference as a measure of variability.
He then applied the simple mean difference of observed variables to income and wealth inequality in his work ''On the measurement of concentration and variability of characters'' in 1914. Here, he presented the concentration
ratio
In mathematics, a ratio () shows how many times one number contains another. For example, if there are eight oranges and six lemons in a bowl of fruit, then the ratio of oranges to lemons is eight to six (that is, 8:6, which is equivalent to the ...
, which further developed into today's Gini coefficient. Secondly, Gini observed that improving methods introduced by Lorenz, Chatelain, or Séailles could also achieve his proposed ratio.
In 1915,
Gaetano Pietra
Gaetano Pietra (10 August 1879, in Castiglione delle Stiviere, Italy – 1961, in Villanova dello Iudrio, Udine, Italy) was an Italian statistician.
Life and career
Gaetano Pietra was a professor at the University of Ferrara. In 1927, he f ...
introduced a geometrical interpretation between Gini's proposed ratio and between the observed area of concentration and maximum concentration. This altered version of the Gini coefficient became the most commonly used inequality index in upcoming years.
According to data from the
OECD
The Organisation for Economic Co-operation and Development (OECD; , OCDE) is an international organization, intergovernmental organization with 38 member countries, founded in 1961 to stimulate economic progress and international trade, wor ...
, the Gini coefficient was first officially used country-wide in
Canada
Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, making it the world's List of coun ...
in the 1970s. Canadian index of income inequality ranged from 0.303 to 0.284 from 1976 to the end of the 1980s. The OECD has published more data on countries since the start of the 21st century. The Central European countries of
Slovenia
Slovenia, officially the Republic of Slovenia, is a country in Central Europe. It borders Italy to the west, Austria to the north, Hungary to the northeast, Croatia to the south and southeast, and a short (46.6 km) coastline within the Adriati ...
,
Czechia, and
Slovakia
Slovakia, officially the Slovak Republic, is a landlocked country in Central Europe. It is bordered by Poland to the north, Ukraine to the east, Hungary to the south, Austria to the west, and the Czech Republic to the northwest. Slovakia's m ...
have had the lowest inequality index of all OECD countries ever since the 2000s.
Scandinavia
Scandinavia is a subregion#Europe, subregion of northern Europe, with strong historical, cultural, and linguistic ties between its constituent peoples. ''Scandinavia'' most commonly refers to Denmark, Norway, and Sweden. It can sometimes also ...
n countries also frequently appeared at the top of the equality list in recent decades.
Definition

The Gini coefficient is an index for the degree of inequality in the distribution of income/wealth, used to estimate how far a country's wealth or income distribution deviates from an equal distribution.
The Gini coefficient is usually defined
mathematically based on the
Lorenz curve
In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing Economic inequality, inequality of the wealth distribution.
The curve is a graph ...
, which plots the proportion of the total income of the population (y-axis) that is cumulatively earned by the bottom ''x'' of the population (see diagram). The line at 45 degrees thus represents perfect equality of incomes. The Gini coefficient can then be thought of as the ratio of the area that lies between the line of equality and the Lorenz curve (marked ''A'' in the diagram) over the total area under the line of equality (marked ''A'' and ''B'' in the diagram); i.e., . If there are no negative incomes, it is also equal to 2''A'' and due to the fact that .
Assuming non-negative income or wealth for all, the Gini coefficient's theoretical range is from 0 (total equality) to 1 (absolute inequality). This measure is often rendered as a percentage, spanning 0 to 100. However, if negative values are factored in, as in cases of debt, the Gini index could exceed 1. Typically, we presuppose a positive mean or total, precluding a Gini coefficient below zero.
An alternative approach is to define the Gini coefficient as half of the
relative mean absolute difference, which is equivalent to the definition based on the
Lorenz curve
In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing Economic inequality, inequality of the wealth distribution.
The curve is a graph ...
. The mean absolute difference is the average
absolute difference
The absolute difference of two real numbers x and y is given by , x-y, , the absolute value of their difference. It describes the distance on the real line between the points corresponding to x and y, and is a special case of the Lp distance fo ...
of all pairs of items of the population, and the relative mean
absolute difference
The absolute difference of two real numbers x and y is given by , x-y, , the absolute value of their difference. It describes the distance on the real line between the points corresponding to x and y, and is a special case of the Lp distance fo ...
is the mean absolute difference divided by the
average
In colloquial, ordinary language, an average is a single number or value that best represents a set of data. The type of average taken as most typically representative of a list of numbers is the arithmetic mean the sum of the numbers divided by ...
,
, to normalize for scale. If ''x''
''i'' is the wealth or income of person ''i'', and there are ''n'' persons, then the Gini coefficient ''G'' is given by:
:
When the income (or wealth) distribution is given as a continuous
probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
''p''(''x''), the Gini coefficient is again half of the relative mean absolute difference:
:
where
is the mean of the distribution, and the lower limits of integration may be replaced by zero when all incomes are positive.
Calculation

While the income distribution of any particular country
will not correspond perfectly to the theoretical models, these models can provide a qualitative explanation of the income distribution in a nation given the Gini coefficient.
Example: Two levels of income
The extreme cases are represented by the most equal possible society in which every person receives the same income (), and the most unequal society (with ''N'' individuals) where a single person receives 100% of the total income and the remaining people receive none ().
A simple case assumes just two levels of income, low and high. If the high income group is a proportion ''u'' of the population and earns a proportion ''f'' of all income, then the Gini coefficient is . A more graded distribution with these same values ''u'' and ''f'' will always have a higher Gini coefficient than .
For example, if the wealthiest ''u ='' 20% of the population has ''f ='' 80% of all income (see
Pareto principle), the income Gini coefficient is at least 60%. In another example, if ''u ='' 1% of the world's population owns ''f ='' 50% of all wealth, the wealth Gini coefficient is at least 49%.
Alternative expressions
In some cases, this equation can be applied to calculate the Gini coefficient without direct reference to the
Lorenz curve
In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing Economic inequality, inequality of the wealth distribution.
The curve is a graph ...
. For example, (taking ''y'' to indicate the income or wealth of a person or household):
* For a population of ''n'' individuals with values
,
::
:This may be simplified to:
::
The Gini coefficient can also be considered as half the
relative mean absolute difference. For a random sample ''S'' with values
, the sample Gini coefficient
:
is a
consistent estimator
In statistics, a consistent estimator or asymptotically consistent estimator is an estimator—a rule for computing estimates of a parameter ''θ''0—having the property that as the number of data points used increases indefinitely, the result ...
of the population Gini coefficient, but is not in general
unbiased
Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is inaccurate, closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individ ...
. In simplified form:
:
There does not exist a sample statistic that is always an unbiased estimator of the population Gini coefficient.
Discrete probability distribution
For a
discrete probability distribution
In probability theory and statistics, a probability distribution is a function that gives the probabilities of occurrence of possible events for an experiment. It is a mathematical description of a random phenomenon in terms of its sample spa ...
with probability mass function
, where
is the fraction of the population with income or wealth
, the Gini coefficient is:
:
where
:
If the points with non-zero probabilities are indexed in increasing order
, then:
:
where
:
and
These formulas are also applicable in the limit, as
Continuous probability distribution
When the population is large, the income distribution may be represented by a continuous
probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
''f''(''x'') where ''f''(''x'') ''dx'' is the fraction of the population with wealth or income in the interval ''dx'' about ''x''. If ''F''(''x'') is the
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ever ...
for ''f''(''x''):
:
and ''L''(''x'') is the Lorenz function:
:
then the
Lorenz curve
In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing Economic inequality, inequality of the wealth distribution.
The curve is a graph ...
''L''(''F'') may then be represented as a function parametric in ''L''(''x'') and ''F''(''x'') and the value of ''B'' can be found by
integration:
:
The Gini coefficient can also be calculated directly from the
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ever ...
of the distribution ''F''(''y''). Defining ''μ'' as the mean of the distribution, then specifying that ''F''(''y'') is zero for all negative values, the Gini coefficient is given by:
:
The latter result comes from
integration by parts
In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...
. ''(Note that this formula can be applied when there are negative values if the integration is taken from minus infinity to plus infinity.)''
The Gini coefficient may be expressed in terms of the
quantile function ''Q''(''F'') ''(inverse of the cumulative distribution function: Q(F(x)) = x)''
:
Since the Gini coefficient is
independent of scale, if the distribution function can be expressed in the form ''f(x,φ,a,b,c...)'' where ''φ'' is a scale factor and ''a, b, c...'' are dimensionless parameters, then the Gini coefficient will be a function only of ''a, b, c...''.
For example, for the
exponential distribution
In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuousl ...
, which is a function of only ''x'' and a scale parameter, the Gini coefficient is a constant, equal to 1/2.
For some functional forms, the Gini index can be calculated explicitly. For example, if ''y'' follows a
log-normal distribution
In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normal distribution, normally distributed. Thus, if the random variable is log-normally distributed ...
with the standard deviation of logs equal to
, then
where
is the
error function
In mathematics, the error function (also called the Gauss error function), often denoted by , is a function \mathrm: \mathbb \to \mathbb defined as:
\operatorname z = \frac\int_0^z e^\,\mathrm dt.
The integral here is a complex Contour integrat ...
( since
, where
is the cumulative distribution function of a standard normal distribution).
[Crow, E. L., & Shimizu, K. (Eds.). (1988). Lognormal distributions: Theory and applications (Vol. 88). New York: M. Dekker, page 11.] In the table below, some examples for probability density functions with support on
are shown. The Dirac delta distribution represents the case where everyone has the same wealth (or income); it implies no variations between incomes.
:
*
is the Gamma function
*
is the Beta function
*
is the Beta function, Regularized incomplete beta function
Other approaches
Sometimes the entire Lorenz curve is not known, and only values at certain intervals are given. In that case, the Gini coefficient can be approximated using various techniques for
interpolating the missing values of the Lorenz curve. If (''X''
''k'', ''Y''
''k'') are the known points on the Lorenz curve, with the ''X''
''k'' indexed in increasing order (''X''
''k'' – 1 < ''X''
''k''), so that:
* ''X''
''k'' is the cumulated proportion of the population variable, for ''k'' = 0,...,''n'', with ''X''
0 = 0, ''X''
''n'' = 1.
* ''Y''
''k'' is the cumulated proportion of the income variable, for ''k'' = 0,...,''n'', with ''Y''
0 = 0, ''Y''
''n'' = 1.
* ''Y''
''k'' should be indexed in non-decreasing order (''Y''
''k'' > ''Y''
''k'' – 1)
If the Lorenz curve is approximated on each interval as a line between consecutive points, then the area B can be approximated with
trapezoids
In geometry, a trapezoid () in North American English, or trapezium () in British English, is a quadrilateral that has at least one pair of parallel sides.
The parallel sides are called the ''bases'' of the trapezoid. The other two sides are ...
and:
:
is the resulting approximation for G. More accurate results can be obtained using other methods to
approximate the area B, such as approximating the Lorenz curve with a
quadratic function
In mathematics, a quadratic function of a single variable (mathematics), variable is a function (mathematics), function of the form
:f(x)=ax^2+bx+c,\quad a \ne 0,
where is its variable, and , , and are coefficients. The mathematical expression, e ...
across pairs of intervals or building an appropriately smooth approximation to the underlying distribution function that matches the known data. If the population mean and boundary values for each interval are also known, these can also often be used to improve the accuracy of the approximation.
The Gini coefficient calculated from a sample is a statistic, and its standard error, or confidence intervals for the population Gini coefficient, should be reported. These can be calculated using
bootstrap techniques, mathematically complicated and computationally demanding even in an era of fast computers. Economist
Tomson Ogwang made the process more efficient by setting up a "trick regression model" in which respective income variables in the sample are ranked, with the lowest income being allocated rank 1. The model then expresses the rank (dependent variable) as the sum of a constant ''A'' and a
normal error term whose variance is inversely proportional to ''y''
''k'':
:
Thus, ''G'' can be expressed as a function of the weighted
least squares estimate of the constant ''A'' and that this can be used to speed up the calculation of the
jackknife estimate for the standard error. Economist David Giles argued that the
standard error
The standard error (SE) of a statistic (usually an estimator of a parameter, like the average or mean) is the standard deviation of its sampling distribution or an estimate of that standard deviation. In other words, it is the standard deviati ...
of the estimate of ''A'' can be used to derive the estimate of ''G'' directly without using a jackknife. This method only requires using ordinary least squares regression after ordering the sample data. The results compare favorably with the estimates from the
jackknife with agreement improving with increasing sample size.
However, it has been argued that this depends on the model's assumptions about the error distributions and the independence of error terms. These assumptions are often not valid for real data sets. There is still ongoing debate surrounding this topic.
Guillermina Jasso and
Angus Deaton
Sir Angus Stewart Deaton (born 19 October 1945) is a British-American economist and academic. Deaton is currently a Senior Scholar and the Dwight D. Eisenhower Professor of Economics and International Affairs Emeritus at the Princeton School ...
independently proposed the following formula for the Gini coefficient:
:
where
is mean income of the population, P
i is the income rank P of person i, with income X, such that the richest person receives a rank of 1 and the poorest a rank of ''N''. This effectively gives higher weight to poorer people in the income distribution, which allows the Gini to meet the
Transfer Principle
In model theory, a transfer principle states that all statements of some language that are true for some structure are true for another structure. One of the first examples was the Lefschetz principle, which states that any sentence in the firs ...
. Note that the Jasso-Deaton formula rescales the coefficient so that its value is one if all the
are zero except one. Note however Allison's reply on the need to divide by N² instead.
FAO
The Food and Agriculture Organization of the United Nations; . (FAO) is a List of specialized agencies of the United Nations, specialized agency of the United Nations that leads international efforts to defeat hunger and improve nutrition ...
explains another version of the formula.
Generalized inequality indices
The Gini coefficient and other standard inequality indices reduce to a common form. Perfect equality—the absence of inequality—exists when and only when the inequality ratio,
, equals 1 for all j units in some population (for example, there is perfect income equality when everyone's income
equals the mean income
, so that
for everyone). Measures of inequality, then, are measures of the average deviations of the
from 1; the greater the average deviation, the greater the inequality. Based on these observations the inequality indices have this common form:
:
where ''p''
''j'' weights the units by their population share, and ''f''(''r''
''j'') is a function of the deviation of each unit's ''r''
''j'' from 1, the point of equality. The insight of this generalized inequality index is that inequality indices differ because they employ different functions of the distance of the inequality ratios (the ''r''
''j'') from 1.
Of income distributions
Gini coefficients of income are calculated on a market income and a disposable income basis. The Gini coefficient on market income—sometimes referred to as a pre-tax Gini coefficient—is calculated on income before taxes and transfers. It measures inequality in income without considering the effect of taxes and social spending already in place in a country. The Gini coefficient on disposable income—sometimes referred to as the after-tax Gini coefficient—is calculated on income after taxes and transfers. It measures inequality in income after considering the effect of taxes and social spending already in place in a country.
For
OECD
The Organisation for Economic Co-operation and Development (OECD; , OCDE) is an international organization, intergovernmental organization with 38 member countries, founded in 1961 to stimulate economic progress and international trade, wor ...
countries over the 2008–2009 period, the Gini coefficient (pre-taxes and transfers) for a total population ranged between 0.34 and 0.53, with South Korea the lowest and Italy the highest. The Gini coefficient (after-taxes and transfers) for a total population ranged between 0.25 and 0.48, with Denmark the lowest and Mexico the highest. For the United States, the country with the largest population among OECD countries, the pre-tax Gini index was 0.49, and the after-tax Gini index was 0.38 in 2008–2009. The OECD average for total populations in OECD countries was 0.46 for the pre-tax income Gini index and 0.31 for the after-tax income Gini index.
Taxes and social spending that were in place in 2008–2009 period in OECD countries significantly lowered effective income inequality, and in general, "European countries—especially Nordic and Continental
welfare states—achieve lower levels of income inequality than other countries."
Using the Gini can help quantify differences in
welfare
Welfare may refer to:
Philosophy
*Well-being (happiness, prosperity, or flourishing) of a person or group
* Utility in utilitarianism
* Value in value theory
Economics
* Utility, a general term for individual well-being in economics and decision ...
and
compensation policies and philosophies. However, it should be borne in mind that the Gini coefficient can be misleading when used to make political comparisons between large and small countries or those with different immigration policies (see
limitations section).
The Gini coefficient for the entire world has been estimated by various parties to be between 0.61 and 0.68.
The graph shows the values expressed as a percentage in their historical development for a number of countries.
Regional income Gini indices
According to UNICEF, Latin America and the Caribbean region had the highest net income Gini index in the world at 48.3, on an unweighted average basis in 2008. The remaining regional averages were: sub-Saharan Africa (44.2), Asia (40.4), Middle East and North Africa (39.2), Eastern Europe and Central Asia (35.4), and High-income Countries (30.9). Using the same method, the United States is claimed to have a Gini index of 36, while South Africa had the highest income Gini index score of 67.8.
World income Gini index since 1800s
Taking income distribution of all human beings, worldwide income inequality has been constantly increasing since the early 19th century (and will keep on increasing over the years) . There was a steady increase in the global income inequality Gini score from 1820 to 2002, with a significant increase between 1980 and 2002. This trend appears to have peaked and begun a reversal with rapid economic growth in emerging economies, particularly in the large populations of
BRIC countries.
The table below presents the estimated world income Gini coefficients over the last 200 years, as calculated by Milanovic.
More detailed data from similar sources plots a continuous decline since 1988. This is attributed to
globalization
Globalization is the process of increasing interdependence and integration among the economies, markets, societies, and cultures of different countries worldwide. This is made possible by the reduction of barriers to international trade, th ...
increasing incomes for billions of poor people, mostly in countries like China and India. Developing countries like Brazil have also improved basic services like health care, education, and sanitation; others like Chile and Mexico have enacted more
progressive tax
A progressive tax is a tax in which the tax rate increases as the taxable amount increases. The term ''progressive'' refers to the way the tax rate progresses from low to high, with the result that a taxpayer's average tax rate is less than the ...
policies.
Of social development
The Gini coefficient is widely used in fields as diverse as sociology, economics, health science, ecology, engineering, and agriculture.
For example, in social sciences and economics, in addition to income Gini coefficients, scholars have published education Gini coefficients and opportunity Gini coefficients.
Education
Education Gini index estimates the inequality in education for a given population. It is used to discern trends in social development through educational attainment over time. A study across 85 countries by three
World Bank
The World Bank is an international financial institution that provides loans and Grant (money), grants to the governments of Least developed countries, low- and Developing country, middle-income countries for the purposes of economic development ...
economists, Vinod Thomas, Yan Wang, and Xibo Fan, estimated Mali had the highest education Gini index of 0.92 in 1990 (implying very high inequality in educational attainment across the population), while the United States had the lowest education inequality Gini index of 0.14. Between 1960 and 1990, China, India and South Korea had the fastest drop in education inequality Gini Index. They also claim education Gini index for the United States slightly increased over the 1980–1990 period.
Though India's education Gini Index has been falling from 1960 through 1990, most of the population still has not received any education, while 10 percent of the population received more than 40% of the total educational hours in the nation. This means that a large portion of capable children in the country are not receiving the support necessary to allow them to become positive contributors to society. This will lead to a deadweight loss to the national society because there are many people who are underdeveloped and underutilized.
Opportunity
Similar in concept to the Gini income coefficient, the Gini opportunity coefficient measures inequality in opportunities.
The concept builds on
Amartya Sen
Amartya Kumar Sen (; born 3 November 1933) is an Indian economist and philosopher. Sen has taught and worked in England and the United States since 1972. In 1998, Sen received the Nobel Memorial Prize in Economic Sciences for his contributions ...
's suggestion that inequality coefficients of social development should be premised on the process of enlarging people's choices and enhancing their capabilities, rather than on the process of reducing income inequality. Kovacevic, in a review of the Gini opportunity coefficient, explained that the coefficient estimates how well a society enables its citizens to achieve success in life where the success is based on a person's choices, efforts and talents, not their background defined by a set of predetermined circumstances at birth, such as gender, race, place of birth, parent's income and circumstances beyond the control of that individual.
In 2003, Roemer
reported Italy and Spain exhibited the largest opportunity inequality Gini index amongst advanced economies.
Income mobility
In 1978,
Anthony Shorrocks introduced a measure based on income Gini coefficients to estimate income mobility. This measure, generalized by Maasoumi and Zandvakili, is now generally referred to as
Shorrocks index, sometimes as Shorrocks mobility index or Shorrocks rigidity index. It attempts to estimate whether the income inequality Gini coefficient is permanent or temporary and to what extent a country or region enables economic mobility to its people so that they can move from one (e.g., bottom 20%) income quantile to another (e.g., middle 20%) over time. In other words, the Shorrocks index compares inequality of short-term earnings, such as the annual income of households, to inequality of long-term earnings, such as 5-year or 10-year total income for the same households.
Shorrocks index is calculated in several different ways, a common approach being from the ratio of income Gini coefficients between short-term and long-term for the same region or country.
A 2010 study using social security income data for the United States since 1937 and Gini-based Shorrock's indices concludes that income mobility in the United States has had a complicated history, primarily due to the mass influx of women into the American labor force after World War II. Income inequality and income mobility trends have been different for men and women workers between 1937 and the 2000s. When men and women are considered together, the Gini coefficient-based Shorrocks index trends imply long-term income inequality has been substantially reduced among all workers, in recent decades for the United States.
Other scholars, using just 1990s data or other short periods have come to different conclusions. For example, Sastre and Ayala conclude from their study of income Gini coefficient data between 1993 and 1998 for six developed economies that France had the least income mobility, Italy the highest, and the United States and Germany intermediate levels of income mobility over those five years.
Features
The Gini coefficient has features that make it useful as a measure of dispersion in a population, and inequalities in particular.
The coefficient ranges from 0, for perfect equality, to 1, indicating perfect inequality. The Gini is based on the comparison of cumulative proportions of the population against cumulative proportions of income they receive.
Limitations
Relative, not absolute
The Gini coefficient is a relative measure. The Gini coefficient of a developing country can rise (due to increasing inequality of income) even when the number of people in absolute poverty decreases. This is because the Gini coefficient measures relative, not absolute, wealth.
Gini coefficients are simple, and this simplicity can lead to oversights and can confuse the comparison of different populations; for example, while both Bangladesh (per capita income of $1,693) and the Netherlands (per capita income of $42,183) had an income Gini coefficient of 0.31 in 2010,
the quality of life, economic opportunity and absolute income in these countries are very different, i.e. countries may have identical Gini coefficients, but differ greatly in wealth. Basic necessities may be available to all in a developed economy, while in an undeveloped economy with the same Gini coefficient, basic necessities may be unavailable to most or unequally available due to lower absolute wealth.
Mathematical limitations
Gini has some mathematical limitations as well. It is not additive and different sets of people cannot be averaged to obtain the Gini coefficient of all the people in the sets.
Even when the total income of a population is the same, in certain situations two countries with different income distributions can have the same Gini index (e.g. cases when income Lorenz Curves cross).
Table A illustrates one such situation. Both countries have a Gini coefficient of 0.2, but the average income distributions for household groups are different. As another example, in a population where the lowest 50% of individuals have no income, and the other 50% have equal income, the Gini coefficient is 0.5; whereas for another population where the lowest 75% of people have 25% of income and the top 25% have 75% of the income, the Gini index is also 0.5. Economies with similar incomes and Gini coefficients can have very different income distributions. Bellù and Liberati claim that ranking income inequality between two populations is not always possible based on their Gini indices.
Similarly, computational social scientist Fabian Stephany illustrates that income inequality within the population, e.g., in specific socioeconomic groups of same age and education, also remains undetected by conventional Gini indices.
Income Gini can conceal wealth inequality
A Gini index does not contain information about absolute national or personal incomes. Populations can simultaneously have very low income Gini indices and very high wealth Gini indexes. By measuring inequality in income, the Gini ignores the differential efficiency of the use of household income. By ignoring wealth (except as it contributes to income), the Gini can create the appearance of inequality when the people compared are at different stages in their life. Wealthy countries such as Sweden can show a low Gini coefficient for the disposable income of 0.31, thereby appearing equal, yet have a very high Gini coefficient for wealth of 0.79 to 0.86, suggesting an extremely unequal wealth distribution in its society. These factors are not assessed in income-based Gini.
Country size and granularity bias
Gini index has a downward-bias for small populations. Counties or states or countries with small populations and less diverse economies will tend to report small Gini coefficients. For economically diverse large population groups, a much higher coefficient is expected than for each of its regions. For example, taking the world economy as a whole and income distribution for all human beings, different scholars estimate the global Gini index to range between 0.61 and 0.68.
As with other inequality coefficients, the Gini coefficient is influenced by the
granularity
Granularity (also called graininess) is the degree to which a material or system is composed of distinguishable pieces, "granules" or "grains" (metaphorically).
It can either refer to the extent to which a larger entity is subdivided, or the ...
of the measurements. For example, five 20% quantiles (low granularity) will usually yield a lower Gini coefficient than twenty 5% quantiles (high granularity) for the same distribution. Philippe Monfort has shown that using inconsistent or unspecified granularity limits the usefulness of Gini coefficient measurements.
Changes in population
Changing income inequality, measured by Gini coefficients, can be due to structural changes in a society such as growing population (increased birth rates, aging populations, emigration, immigration) and income mobility.
Another limitation of the Gini coefficient is that it is not a proper measure of
egalitarianism
Egalitarianism (; also equalitarianism) is a school of thought within political philosophy that builds on the concept of social equality, prioritizing it for all people. Egalitarian doctrines are generally characterized by the idea that all hum ...
, as it only measures income dispersion. For example, suppose two equally egalitarian countries pursue different
immigration policies. In that case, the country accepting a higher proportion of low-income or impoverished migrants will report a higher Gini coefficient and, therefore, may exhibit more income inequality.
Household vs individual
The Gini coefficient measure gives different results when applied to individuals instead of households, for the same economy and same income distributions. If household data is used, the measured value of income Gini depends on how the household is defined. The comparison is not meaningful when different populations are not measured with consistent definitions. Furthermore, changes to the household income Gini can be driven by changes in household formation, such as increased divorce rates or
extended family
An extended family is a family that extends beyond the nuclear family of parents and their children to include aunts, uncles, grandparents, cousins or other relatives, all living nearby or in the same household. Particular forms include the stem ...
households splitting into
nuclear families.
Deininger and
Squire
In the Middle Ages, a squire was the shield- or armour-bearer of a knight. Boys served a knight as an attendant, doing simple but important tasks such as saddling a horse or caring for the knight's weapons and armour.
Terminology
''Squire'' ...
(1996) show that the income Gini coefficient based on individual income rather than household income is different. For example, for the United States, they found that the individual income-based Gini index was 0.35, while for France, 0.43. According to their individual-focused method, in the 108 countries they studied, South Africa had the world's highest Gini coefficient at 0.62, Malaysia had Asia's highest Gini coefficient at 0.5, Brazil the highest at 0.57 in Latin America and the Caribbean region, and Turkey the highest at 0.5 in OECD countries.
Billionaire
Thomas Kwok claimed the income Gini coefficient for Hong Kong has been high (0.434 in 2010
), in part because of structural changes in its population. Over recent decades, Hong Kong has witnessed increasing numbers of small households, elderly households, and elderly living alone. The combined income is now split into more households. Many older people live separately from their children in Hong Kong. These social changes have caused substantial changes in household income distribution. The income Gini coefficient, claims Kwok, does not discern these structural changes in its society.
Household money income distribution for the United States, summarized in Table C of this section, confirms that this issue is not limited to just Hong Kong. According to the US Census Bureau, between 1979 and 2010, the population of the United States experienced structural changes in overall households; the income for all income brackets increased in inflation-adjusted terms, household income distributions shifted into higher income brackets over time, while the income Gini coefficient increased.
[Congressional Budget Office: Trends in the Distribution of Household Income Between 1979 and 2007]
October 2011. see pp. i–x, with definitions on ii–iii
Instantaneous inequality vs lifetime inequality
The Gini coefficient is unable to discern the effects of structural changes in populations.
Expanding on the importance of life-span measures, the Gini coefficient as a point-estimate of equality at a certain time ignores life-span changes in income. Typically, increases in the proportion of young or old members of a society will drive apparent changes in equality simply because people generally have lower incomes and wealth when they are young than when they are old. Because of this, factors such as age distribution within a population and mobility within income classes can create the appearance of inequality when none exist, taking into account demographic effects. Thus a given economy may have a higher Gini coefficient at any timepoint compared to another, while the Gini coefficient calculated over individuals' lifetime income is lower than the apparently more equal (at a given point in time) economy's.
Essentially, what matters is not just inequality in any particular year but the distribution composition over time.
Benefits and income in kind
Inaccuracies in assign monetary value to
income in kind
Income in kind, or in-kind income, is income other than money income. It includes many employee benefits and government-provided goods and services, such as toll-free roads, food stamps, public schooling, or socialized medicine.
Types of Inc ...
reduce the accuracy of Gini as a measurement of true inequality.
While taxes and cash transfers are relatively straightforward to account for, other government benefits can be difficult to value. Benefits such as subsidized housing, medical care, and education are difficult to value objectively, as it depends on the quality and extent of the benefit. In absence of a free market, valuing these income transfers as household income is subjective. The theoretical model of the Gini coefficient is limited to accepting correct or incorrect subjective assumptions.
In subsistence-driven and
informal economies, people may have significant income in other forms than money, for example, through
subsistence farming
Subsistence agriculture occurs when farmers grow crops on smallholdings to meet the needs of themselves and their families. Subsistence agriculturalists target farm output for survival and for mostly local requirements. Planting decisions occ ...
or
barter
In trade, barter (derived from ''bareter'') is a system of exchange (economics), exchange in which participants in a financial transaction, transaction directly exchange good (economics), goods or service (economics), services for other goods ...
ing. These forms of income tend to accrue to poor segments of populations in emerging and transitional economy countries such as those in sub-Saharan Africa, Latin America, Asia, and Eastern Europe. Informal economy accounts for over half of global employment and as much as 90 percent of employment in some of the poorer sub-Saharan countries with high official Gini inequality coefficients. Schneider et al., in their 2010 study of 162 countries, report about 31.2%, or about $20 trillion, of world's
GDP is informal. In developing countries, the informal economy predominates for all income brackets except the richer, urban upper-income bracket populations. Even in developed economies, 8% (United States) to 27% (Italy) of each nation's GDP is informal. The resulting informal income predominates as a livelihood activity for those in the lowest income brackets. The value and distribution of the incomes from informal or underground economy is difficult to quantify, making true income Gini coefficients estimates difficult.
Different assumptions and quantifications of these incomes will yield different Gini coefficients.
Alternatives
Given the limitations of the Gini coefficient, other statistical methods are used in combination or as an alternative measure of population dispersity. For example, ''entropy measures'' are frequently used (e.g. the
Atkinson index The Atkinson index (also known as the Atkinson measure or Atkinson inequality measure) is a measure of income inequality developed by British economist Anthony Barnes Atkinson. The measure is useful in determining which end of the distribution cont ...
or the
Theil Index
The Theil index is a statistic primarily used to measure economic inequality and other economic phenomena, though it has also been used to measure racial segregation. The Theil index ''T''T is the same as redundancy in information theory which i ...
and
Mean log deviation as special cases of the
generalized entropy index
The generalized entropy index has been proposed as a measure of income inequality in a population. It is derived from information theory as a measure of redundancy in data. In information theory a measure of redundancy can be interpreted as no ...
). These measures attempt to compare the distribution of resources by intelligent agents in the market with a maximum
entropy
Entropy is a scientific concept, most commonly associated with states of disorder, randomness, or uncertainty. The term and the concept are used in diverse fields, from classical thermodynamics, where it was first recognized, to the micros ...
random distribution, which would occur if these agents acted like non-interacting particles in a closed system following the laws of statistical physics.
The Ortego two-parameter model
may be superior to the GINI index.
Relation to other statistical measures
There is a summary measure of the diagnostic ability of a binary classifier system that is also called the ''Gini coefficient'', which is defined as twice the area between the
receiver operating characteristic
A receiver operating characteristic curve, or ROC curve, is a graph of a function, graphical plot that illustrates the performance of a binary classifier model (can be used for multi class classification as well) at varying threshold values. ROC ...
(ROC) curve and its diagonal. It is related to the
AUC (
Area Under the ROC Curve) measure of performance given by
and to
Mann–Whitney U. Although both Gini coefficients are defined as areas between certain curves and share certain properties, there is no simple direct relationship between the Gini coefficient of statistical dispersion and the Gini coefficient of a classifier.
The Gini index is also related to the Pietra index — both of which measure statistical heterogeneity and are derived from the Lorenz curve and the diagonal line.
In certain fields such as ecology, inverse Simpson's index
is used to quantify diversity, and this should not be confused with the
Simpson index . These indicators are related to Gini. The inverse Simpson index increases with diversity, unlike the Simpson index and Gini coefficient, which decrease with diversity. The Simpson index is in the range
, 1
The comma is a punctuation mark that appears in several variants in different languages. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical; others give it the appearance of a miniature fille ...
where 0 means maximum and 1 means minimum diversity (or heterogeneity). Since diversity indices typically increase with increasing heterogeneity, the Simpson index is often transformed into inverse Simpson, or using the complement
, known as the Gini-Simpson Index.
The
Lorenz curve
In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing Economic inequality, inequality of the wealth distribution.
The curve is a graph ...
is another method of graphical representation of wealth distribution. It was developed 9 years before the Gini coefficient, which quantifies the extent to which the Lorenz curve deviates from the perfect equality line (with
slope
In mathematics, the slope or gradient of a Line (mathematics), line is a number that describes the direction (geometry), direction of the line on a plane (geometry), plane. Often denoted by the letter ''m'', slope is calculated as the ratio of t ...
of 1). The
Hoover index (also known as Robin Hood index) presents the percentage of total population's income that would have to be redistributed to make the Gini coefficient equal to 0 (perfect equality).
Gini coefficients for pre-modern societies
In recent decades, researchers have attempted to estimate Gini coefficients for pre-20th century societies. In the absence of household income surveys and income taxes, scholars have relied on proxy variables. These include wealth taxes in medieval European city states, patterns of landownership in
Roman Egypt
Roman Egypt was an imperial province of the Roman Empire from 30 BC to AD 642. The province encompassed most of modern-day Egypt except for the Sinai. It was bordered by the provinces of Crete and Cyrenaica to the west and Judaea, ...
, variation of the size of houses in societies from ancient Greece to Aztec Mexico, and inheritance and dowries in Babylonian society. Other data does not directly document variations in wealth or income but are known to reflect inequality, such as the ratio of rents to wages or of labor to capital.
Other uses
Although the Gini coefficient is most popular in economics, it can, in theory, be applied in any field of science that studies a distribution. For example, in ecology, the Gini coefficient has been used as a measure of
biodiversity
Biodiversity is the variability of life, life on Earth. It can be measured on various levels. There is for example genetic variability, species diversity, ecosystem diversity and Phylogenetics, phylogenetic diversity. Diversity is not distribut ...
, where the cumulative proportion of species is plotted against the cumulative proportion of individuals.
In health, it has been used as a measure of the inequality of health-related
quality of life
Quality of life (QOL) is defined by the World Health Organization as "an individual's perception of their position in life in the context of the culture and value systems in which they live and in relation to their goals, expectations, standards ...
in a population.
In education, it has been used as a measure of the inequality of universities.
In chemistry it has been used to express the selectivity of
protein kinase inhibitors against a panel of kinases.
In engineering, it has been used to evaluate the fairness achieved by Internet routers in scheduling packet transmissions from different flows of traffic.
In
machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
, it has been used as a unified metric for evaluating many-versus-many (all-to-all) similarity in vector spaces across various data types, including images and text, and to show their effectiveness in guiding machine learning training sample selection, especially in sparse information settings.
The Gini coefficient is sometimes used for the measurement of the discriminatory power of
rating
A rating is an evaluation or assessment of something, in terms of a metric (e.g. quality, quantity, a combination of both,...).
Rating or rating system may also refer to:
Business and economics
* Credit rating, estimating the credit worthiness ...
systems in
credit risk
Credit risk is the chance that a borrower does not repay a loan
In finance, a loan is the tender of money by one party to another with an agreement to pay it back. The recipient, or borrower, incurs a debt and is usually required to pay ...
management.
A 2005 study accessed US census data to measure home computer ownership and used the Gini coefficient to measure inequalities amongst whites and African Americans. Results indicated that although decreasing overall, home computer ownership inequality was substantially smaller among white households.
A 2016 peer-reviewed study titled Employing the Gini coefficient to measure participation inequality in treatment-focused Digital Health Social Networks illustrated that the Gini coefficient was helpful and accurate in measuring shifts in inequality, however as a standalone metric it failed to incorporate overall network size.
Discriminatory power refers to a credit risk model's ability to differentiate between defaulting and non-defaulting clients. The formula
, in the calculation section above, may be used for the final model and at the individual model factor level to quantify the discriminatory power of individual factors. It is related to the accuracy ratio in population assessment models.
The Gini coefficient has also been applied to analyze inequality in
dating apps.
Kaminskiy and Krivtsov
extended the concept of the Gini coefficient from economics to
reliability theory
Reliability engineering is a sub-discipline of systems engineering that emphasizes the ability of equipment to function without failure. Reliability is defined as the probability that a product, system, or service will perform its intended funct ...
and proposed a Gini-type coefficient that helps to assess the degree of aging of non-repairable systems or aging and rejuvenation of repairable systems. The coefficient is defined between −1 and 1 and can be used in both empirical and parametric life distributions. It takes negative values for the class of decreasing failure rate distributions and point processes with decreasing failure intensity rate and is positive for the increasing failure rate distributions and point processes with increasing failure intensity rate. The value of zero corresponds to the
exponential life distribution or the
Homogeneous Poisson Process.
See also
References
Further reading
*
*
*
*
*
*
*
*
*
*
* Reprinted in
*
*
*
*
*
*
*
*
* The Chinese version of this paper was published as
*
External links
* Deutsche Bundesbank
Do banks diversify loan portfolios? 2005 (on using e.g. the Gini coefficient for risk evaluation of loan portfolios)
Measuring Software Project Risk With The Gini Coefficient an application of the Gini coefficient to software
Travis Hale, University of Texas Inequality Project:The Theoretical Basics of Popular Inequality Measures online computation of examples
1A1BArticle from The Guardian analysing inequality in the UK 1974–2006World Income Inequality DatabaseBBC News: What is the Gini coefficient?Income Distribution and Poverty in OECD CountriesU.S. Income Distribution: Just How Unequal?
{{Authority control
1912 introductions
1912 in economic history
Concentration indicators
Demographic economics
Income inequality metrics
Welfare economics
Italian inventions