Lorenz curve
   HOME

TheInfoList



OR:

In
economics Economics () is the social science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and intera ...
, the Lorenz curve is a graphical representation of the
distribution of income In economics, income distribution covers how a country's total GDP is distributed amongst its population. Economic theory and economic policy have long seen income and its distribution as a central concern. Unequal distribution of income causes eco ...
or of
wealth Wealth is the abundance of Value (economics), valuable financial assets or property, physical possessions which can be converted into a form that can be used for financial transaction, transactions. This includes the core meaning as held in the ...
. It was developed by Max O. Lorenz in 1905 for representing
inequality Inequality may refer to: Economics * Attention inequality, unequal distribution of attention across users, groups of people, issues in etc. in attention economy * Economic inequality, difference in economic well-being between population groups * ...
of the
wealth distribution The distribution of wealth is a comparison of the wealth of various members or groups in a society. It shows one aspect of economic inequality or heterogeneity in economics, economic heterogeneity. The distribution of wealth differs from the i ...
. The curve is a
graph Graph may refer to: Mathematics *Graph (discrete mathematics), a structure made of vertices and edges **Graph theory, the study of such graphs and their properties *Graph (topology), a topological space resembling a graph in the sense of discre ...
showing the proportion of overall income or wealth assumed by the bottom ''x''% of the people, although this is not rigorously true for a finite population (see below). It is often used to represent
income distribution In economics, income distribution covers how a country's total GDP is distributed amongst its population. Economic theory and economic policy have long seen income and its distribution as a central concern. Unequal distribution of income causes eco ...
, where it shows for the bottom ''x''% of households, what percentage (''y''%) of the total income they have. The
percentage In mathematics, a percentage (from la, per centum, "by a hundred") is a number or ratio expressed as a fraction of 100. It is often denoted using the percent sign, "%", although the abbreviations "pct.", "pct" and sometimes "pc" are also us ...
of households is plotted on the ''x''-axis, the percentage of income on the ''y''-axis. It can also be used to show distribution of
asset In financial accountancy, financial accounting, an asset is any resource owned or controlled by a business or an economic entity. It is anything (tangible or intangible) that can be used to produce positive economic value. Assets represent value ...
s. In such use, many economists consider it to be a measure of
social inequality Social inequality occurs when resources in a given society are distributed unevenly, typically through norms of allocation, that engender specific patterns along lines of socially defined categories of persons. It posses and creates gender c ...
. The concept is useful in describing inequality among the size of individuals in
ecology Ecology () is the study of the relationships between living organisms, including humans, and their physical environment. Ecology considers organisms at the individual, population, community, ecosystem, and biosphere level. Ecology overlaps wi ...
and in studies of
biodiversity Biodiversity or biological diversity is the variety and variability of life on Earth. Biodiversity is a measure of variation at the genetic (''genetic variability''), species (''species diversity''), and ecosystem (''ecosystem diversity'') l ...
, where the cumulative proportion of species is plotted against the cumulative proportion of individuals. It is also useful in
business modeling Business process modeling (BPM) in business process management and systems engineering is the activity of representing processes of an enterprise, so that the current business processes may be analyzed, improved, and automated. BPM is typically ...
: e.g., in
consumer finance Personal finance is the financial management which an individual or a family unit performs to budget, save, and spend monetary resources over time, taking into account various financial risks and future life events. When planning personal fi ...
, to measure the actual percentage ''y''% of delinquencies attributable to the ''x''% of people with worst risk scores.


Explanation

Data from 2005. Points on the Lorenz curve represent statements such as, "the bottom 20% of all households have 10% of the total income." A perfectly equal income distribution would be one in which every person has the same income. In this case, the bottom ''N''% of society would always have ''N''% of the income. This can be depicted by the straight line ''y'' = ''x''; called the "line of perfect equality." By contrast, a perfectly unequal distribution would be one in which one person has all the income and everyone else has none. In that case, the curve would be at ''y'' = 0% for all ''x'' < 100%, and ''y'' = 100% when ''x'' = 100%. This curve is called the "line of perfect inequality." The Gini coefficient is the ratio of the area between the line of perfect equality and the observed Lorenz curve to the area between the line of perfect equality and the line of perfect inequality. The higher the coefficient, the more unequal the distribution is. In the diagram on the right, this is given by the ratio ''A''/(''A''+''B''), where ''A'' and ''B'' are the areas of regions as marked in the diagram.


Definition and calculation

The Lorenz curve is a probability plot (a
P–P plot In statistics, a P–P plot (probability–probability plot or percent–percent plot or P value plot) is a probability plot for assessing how closely two data sets agree, or for assessing how closely a dataset fits a particular model. It works b ...
) comparing the distribution of a
variable Variable may refer to: * Variable (computer science), a symbolic name associated with a value and whose associated value may be changed * Variable (mathematics), a symbol that represents a quantity in a mathematical expression, as used in many ...
against a hypothetical uniform distribution of that variable. It can usually be represented by a function ''L''(''F''), where ''F'', the cumulative portion of the population, is represented by the horizontal axis, and ''L'', the cumulative portion of the total wealth or income, is represented by the vertical axis. The curve ''L'' need not be a smoothly increasing function of ''F'', For wealth distributions there may be oligarchies or people with negative wealth for instance. For a discrete distribution of Y given by values ''y''1, ..., ''y''''n'' in non-decreasing order ( ''y''''i'' ≤ ''y''''i''+1) and their probabilities f(y_j) := \Pr(Y=y_j) the Lorenz curve is the
continuous Continuity or continuous may refer to: Mathematics * Continuity (mathematics), the opposing concept to discreteness; common examples include ** Continuous probability distribution or random variable in probability and statistics ** Continuous ...
piecewise linear function connecting the points ( ''F''''i'', ''L''''i'' ), ''i'' = 0 to ''n'', where ''F''0 = 0, ''L''0 = 0, and for ''i'' = 1 to ''n'': \begin F_i &:= \sum_^i f(y_j) \\ S_i &:= \sum_^i f(y_j) \, y_j \\ L_i &:= \frac \end When all ''y''''i'' are equally probable with probabilities 1/''n'' this simplifies to \begin F_i &= \frac i n \\ S_i &= \frac 1 n \sum_^i \; y_j \\ L_i &= \frac \end For a
continuous distribution In probability theory and statistics, a probability distribution is the mathematical Function (mathematics), function that gives the probabilities of occurrence of different possible outcomes for an Experiment (probability theory), experiment. ...
with the
probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...
''f'' and the
cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ev ...
''F'', the Lorenz curve ''L'' is given by: L(F(x)) = \frac = \frac where \mu denotes the average. The Lorenz curve ''L''(''F'') may then be plotted as a function parametric in ''x'': ''L''(''x'') vs. ''F''(''x''). In other contexts, the quantity computed here is known as the length biased (or size biased) distribution; it also has an important role in renewal theory. Alternatively, for a
cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ev ...
''F''(''x'') with inverse ''x''(''F''), the Lorenz curve ''L''(''F'') is directly given by: L(F) = \frac The inverse ''x''(''F'') may not exist because the cumulative distribution function has intervals of constant values. However, the previous formula can still apply by generalizing the definition of ''x''(''F''): x(F_1) = \inf \ where is the
infimum In mathematics, the infimum (abbreviated inf; plural infima) of a subset S of a partially ordered set P is a greatest element in P that is less than or equal to each element of S, if such an element exists. Consequently, the term ''greatest low ...
. For an example of a Lorenz curve, see
Pareto distribution The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto ( ), is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actua ...
.


Properties

A Lorenz curve always starts at (0,0) and ends at (1,1). The Lorenz curve is not defined if the mean of the probability distribution is zero or infinite. The Lorenz curve for a probability distribution is a
continuous function In mathematics, a continuous function is a function such that a continuous variation (that is a change without jump) of the argument induces a continuous variation of the value of the function. This means that there are no abrupt changes in value ...
. However, Lorenz curves representing discontinuous functions can be constructed as the limit of Lorenz curves of probability distributions, the line of perfect inequality being an example. The information in a Lorenz curve may be summarized by the Gini coefficient and the
Lorenz asymmetry coefficient The Lorenz asymmetry coefficient (LAC) is a summary statistic of the Lorenz curve that measures the degree of asymmetry of the curve. The Lorenz curve is used to describe the inequality in the distribution of a quantity (usually income or wealth in ...
. The Lorenz curve cannot rise above the line of perfect equality. A Lorenz curve that never falls beneath a second Lorenz curve and at least once runs above it, has Lorenz dominance over the second one. If the variable being measured cannot take negative values, the Lorenz curve: *cannot sink below the line of perfect inequality, *is
increasing In mathematics, a monotonic function (or monotone function) is a function between ordered sets that preserves or reverses the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of order ...
. Note however that a Lorenz curve for
net worth Net worth is the value of all the non-financial and financial assets owned by an individual or institution minus the value of all its outstanding liabilities. Since financial assets minus outstanding liabilities equal net financial assets, net ...
would start out by going negative due to the fact that some people have a negative net worth because of debt. The Lorenz curve is invariant under positive scaling. If ''X'' is a random variable, for any positive number ''c'' the random variable ''c'' X has the same Lorenz curve as ''X''. The Lorenz curve is flipped twice, once about F = 0.5 and once about ''L'' = 0.5, by negation. If ''X'' is a random variable with Lorenz curve ''L''X(''F''), then −''X'' has the Lorenz curve: : ''L'' − X = 1 − ''L'' X (1 − ''F'') The Lorenz curve is changed by translations so that the equality gap ''F'' − ''L''(''F'') changes in proportion to the ratio of the original and translated means. If ''X'' is a random variable with a Lorenz curve ''L'' X (''F'') and mean ''μ'' X , then for any constant ''c'' ≠ −''μ'' X , ''X'' + ''c'' has a Lorenz curve defined by: F - L_(F) = \frac ( F - L_X(F)) For a cumulative distribution function ''F''(''x'') with mean ''μ'' and (generalized) inverse ''x''(''F''), then for any ''F'' with 0 < ''F'' < 1 : *If the Lorenz curve is differentiable:\frac = \frac *If the Lorenz curve is twice differentiable, then the probability density function ''f''(''x'') exists at that point and: \frac = \frac\, *If ''L''(''F'') is continuously differentiable, then the tangent of ''L''(''F'') is parallel to the line of perfect equality at the point ''F''(''μ''). This is also the point at which the equality gap ''F'' − ''L''(''F''), the vertical distance between the Lorenz curve and the line of perfect equality, is greatest. The size of the gap is equal to half of the relative
mean absolute deviation The average absolute deviation (AAD) of a data set is the average of the absolute deviations from a central point. It is a summary statistic of statistical dispersion or variability. In the general form, the central point can be a mean, median, m ...
: F(\mu) - L(F(\mu)) = \frac


See also

*
Distribution (economics) In economics, distribution is the way total output, income, or wealth is distributed among individuals or among the factors of production (such as labour, land, and capital). In general theory and in for example the U.S. National Income and Prod ...
*
Distribution of wealth The distribution of wealth is a comparison of the wealth of various members or groups in a society. It shows one aspect of economic inequality or economic heterogeneity. The distribution of wealth differs from the income distribution in that ...
*
Welfare economics Welfare economics is a branch of economics that uses microeconomic techniques to evaluate well-being (welfare) at the aggregate (economy-wide) level. Attempting to apply the principles of welfare economics gives rise to the field of public econ ...
*
Income inequality metrics Income inequality metrics or income distribution metrics are used by social scientists to measure the distribution of income and economic inequality among the participants in a particular economy, such as that of a specific country or of the world ...
* Gini coefficient *
Hoover index The Hoover index, also known as the Robin Hood index or the Schutz index, is a measure of income inequality. It is equal to the percentage of the total population's income that would have to be redistributed to make all the incomes equal. i.e. Th ...
(a.k.a. Robin Hood index) *
ROC analysis A receiver operating characteristic curve, or ROC curve, is a graphical plot that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied. The method was originally developed for operators of ...
* Social welfare (political science) *
Economic inequality There are wide varieties of economic inequality, most notably income inequality measured using the distribution of income (the amount of money people are paid) and wealth inequality measured using the distribution of wealth (the amount of we ...
* Zipf's law *
Pareto distribution The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto ( ), is a power-law probability distribution that is used in description of social, quality control, scientific, geophysical, actua ...
* Mean deviation *
The Elephant Curve The Elephant Curve, also known as the ''Lakner-Milanovic'' graph or the ''global growth incidence curve'', is a graph that illustrates the unequal distribution of income growth for individuals belonging to different income groups. The original gra ...


References


Further reading

* * * *


External links


WIID
: World Income Inequality Database, a source of information on inequality, collected by WIDER (World Institute for Development Economics Research, part of United Nations University)
glcurve
Stata Stata (, , alternatively , occasionally stylized as STATA) is a general-purpose statistical software package developed by StataCorp for data manipulation, visualization, statistics, and automated reporting. It is used by researchers in many fie ...
module to plot Lorenz curve (type "findit glcurve" or "ssc install glcurve" in Stata prompt to install)
Free add-on to STATA to compute inequality and poverty measures

Free Online Software (Calculator)
computes the Gini Coefficient, plots the Lorenz curve, and computes many other measures of concentration for any dataset * Free Calculator

an
downloadable scripts
(
Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pro ...
and
Lua Lua or LUA may refer to: Science and technology * Lua (programming language) * Latvia University of Agriculture * Last universal ancestor, in evolution Ethnicity and language * Lua people, of Laos * Lawa people, of Thailand sometimes referred t ...
) for Atkinson, Gini, and Hoover inequalities * Users of th
R
data analysis software can install the "ineq" package which allows for computation of a variety of inequality indices including Gini, Atkinson, Theil. *
MATLAB Inequality Package
, including code for computing Gini, Atkinson, Theil indexes and for plotting the Lorenz Curve. Many examples are available. *
complete handout
about the Lorenz curve including various applications, including a
Excel spreadsheet
graphing Lorenz curves and calculating Gini coefficients as well as coefficients of variation.
LORENZ 3.0
is a
Mathematica Wolfram Mathematica is a software system with built-in libraries for several areas of technical computing that allow machine learning, statistics, symbolic computation, data manipulation, network analysis, time series analysis, NLP, optimizat ...
notebook which draw sample Lorenz curves and calculates Gini coefficients and
Lorenz asymmetry coefficient The Lorenz asymmetry coefficient (LAC) is a summary statistic of the Lorenz curve that measures the degree of asymmetry of the curve. The Lorenz curve is used to describe the inequality in the distribution of a quantity (usually income or wealth in ...
s from data in an Excel sheet. {{DEFAULTSORT:Lorenz Curve Economics curves Welfare economics Statistical charts and diagrams Income inequality metrics