Kemeny–Young method
   HOME

TheInfoList



OR:

The Kemeny–Young method is an
electoral system An electoral system or voting system is a set of rules that determine how elections and referendums are conducted and how their results are determined. Electoral systems are used in politics to elect governments, while non-political elections m ...
that uses
preferential ballot The term ranked voting (also known as preferential voting or ranked choice voting) refers to any voting system in which voters rank their candidates (or options) in a sequence of first or second (or third, etc.) on their respective ballots. Ran ...
s and
pairwise comparison Pairwise comparison generally is any process of comparing entities in pairs to judge which of each entity is preferred, or has a greater amount of some quantitative property, or whether or not the two entities are identical. The method of pairwi ...
counts to identify the most popular choices in an election. It is a
Condorcet method A Condorcet method (; ) is an election method that elects the candidate who wins a majority of the vote in every head-to-head election against each of the other candidates, that is, a candidate preferred by more voters than any others, whenever ...
because if there is a Condorcet winner, it will always be ranked as the most popular choice. This method assigns a score for each possible sequence, where each sequence considers which choice might be most popular, which choice might be second-most popular, which choice might be third-most popular, and so on down to which choice might be least-popular. The sequence that has the highest score is the winning sequence, and the first choice in the winning sequence is the most popular choice. (As explained below, ties can occur at any ranking level.) The Kemeny–Young method is also known as the Kemeny rule, VoteFair popularity ranking, the
maximum likelihood In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed stat ...
method, and the median relation.


Description

The Kemeny–Young method uses
preferential ballot The term ranked voting (also known as preferential voting or ranked choice voting) refers to any voting system in which voters rank their candidates (or options) in a sequence of first or second (or third, etc.) on their respective ballots. Ran ...
s on which voters rank choices according to their order of preference. A voter is allowed to rank more than one choice at the same preference level. Unranked choices are usually interpreted as least-preferred. Another way to view the ordering is that it is the one which minimizes the sum of the
Kendall tau distance The Kendall tau rank distance is a metric (distance function) that counts the number of pairwise disagreements between two ranking lists. The larger the distance, the more dissimilar the two lists are. Kendall tau distance is also called bubble-sor ...
s (
bubble sort Bubble sort, sometimes referred to as sinking sort, is a simple sorting algorithm that repeatedly steps through the input list element by element, comparing the current element with the one after it, swapping their values if needed. These passe ...
distance) to the voters' lists. Kemeny–Young calculations are usually done in two steps. The first step is to create a matrix or table that counts pairwise voter preferences. The second step is to test all possible rankings, calculate a score for each such ranking, and compare the scores. Each ranking score equals the sum of the pairwise counts that apply to that ranking. The ranking that has the largest score is identified as the overall ranking. (If more than one ranking has the same largest score, all these possible rankings are tied, and typically the overall ranking involves one or more ties.) In order to demonstrate how an individual preference order is converted into a tally table, it is worth considering the following example. Suppose that a single voter has a choice among four candidates (i.e. Elliot, Meredith, Roland, and Selden) and has the following preference order: These preferences can be expressed in a tally table. A tally table, which arranges all the pairwise counts in three columns, is useful for counting (tallying) ballot preferences and calculating ranking scores. The center column tracks when a voter indicates more than one choice at the same preference level. The above preference order can be expressed as the following tally table: Now suppose that multiple voters had voted on those four candidates. After all ballots have been counted, the same type of tally table can be used to summarize all the preferences of all the voters. Here is an example for a case that has 100 voters:
The sum of the counts in each row must equal the total number of votes. After the tally table has been completed, each possible ranking of choices is examined in turn, and its ranking score is calculated by adding the appropriate number from each row of the tally table. For example, the possible ranking: # Elliot # Roland # Meredith # Selden satisfies the preferences Elliot > Roland, Elliot > Meredith, Elliot > Selden, Roland > Meredith, Roland > Selden, and Meredith > Selden. The respective scores, taken from the table, are * Elliot > Roland: 30 * Elliot > Meredith: 60 * Elliot > Selden: 60 * Roland > Meredith: 70 * Roland > Selden: 60 * Meredith > Selden: 40 giving a total ranking score of 30 + 60 + 60 + 70 + 60 + 40 = 320.


Calculating the overall ranking

After the scores for every possible ranking have been calculated, the ranking that has the largest score can be identified, and becomes the overall ranking. In this case, the overall ranking is: # Roland # Elliot # Selden # Meredith with a ranking score of 370. If there are cycles or ties, more than one possible ranking can have the same largest score. Cycles are resolved by producing a single overall ranking where some of the choices are tied.


Summary matrix

After the overall ranking has been calculated, the pairwise comparison counts can be arranged in a summary matrix, as shown below, in which the choices appear in the winning order from most popular (top and left) to least popular (bottom and right). This matrix layout does not include the equal-preference pairwise counts that appear in the tally table: In this summary matrix, the largest ranking score equals the sum of the counts in the upper-right, triangular half of the matrix (shown here in bold, with a green background). No other possible ranking can have a summary matrix that yields a higher sum of numbers in the upper-right, triangular half. (If it did, that would be the overall ranking.) In this summary matrix, the sum of the numbers in the lower-left, triangular half of the matrix (shown here with a red background) are a minimum. The academic papers by John Kemeny and Peyton Young refer to finding this minimum sum, which is called the Kemeny score, and which is based on how many voters oppose (rather than support) each pairwise order:


Example

This matrix summarizes the corresponding
pairwise comparison Pairwise comparison generally is any process of comparing entities in pairs to judge which of each entity is preferred, or has a greater amount of some quantitative property, or whether or not the two entities are identical. The method of pairwi ...
counts:
The Kemeny–Young method arranges the pairwise comparison counts in the following tally table:
The ranking score for the possible ranking of Memphis first, Nashville second, Chattanooga third, and Knoxville fourth equals (the unit-less number) 345, which is the sum of the following annotated numbers. :42% (of the voters) prefer Memphis over Nashville :42% prefer Memphis over Chattanooga :42% prefer Memphis over Knoxville :68% prefer Nashville over Chattanooga :68% prefer Nashville over Knoxville :83% prefer Chattanooga over Knoxville
This table lists all the ranking scores:
The largest ranking score is 393, and this score is associated with the following possible ranking, so this ranking is also the overall ranking:
If a single winner is needed, the first choice, Nashville, is chosen. (In this example Nashville is the
Condorcet winner An electoral system satisfies the Condorcet winner criterion () if it always chooses the Condorcet winner when one exists. The candidate who wins a majority of the vote in every head-to-head election against each of the other candidatesthat is, a ...
.) The summary matrix below arranges the pairwise counts in order from most popular (top and left) to least popular (bottom and right):
In this arrangement the largest ranking score (393) equals the sum of the counts in bold, which are in the upper-right, triangular half of the matrix (with a green background).


Characteristics

In all cases that do not result in an exact tie, the Kemeny–Young method identifies a most-popular choice, second-most popular choice, and so on. A tie can occur at any preference level. Except in some cases where circular ambiguities are involved, the Kemeny–Young method only produces a tie at a preference level when the number of voters with one preference exactly matches the number of voters with the opposite preference.


Satisfied criteria for all Condorcet methods

All Condorcet methods, including the Kemeny–Young method, satisfy these criteria: :; Non-imposition ::There are voter preferences that can yield every possible overall order-of-preference result, including ties at any combination of preference levels. :;
Condorcet criterion An electoral system satisfies the Condorcet winner criterion () if it always chooses the Condorcet winner when one exists. The candidate who wins a majority of the vote in every head-to-head election against each of the other candidatesthat is, a ...
::If there is a choice that wins all pairwise contests, then this choice wins. :; Majority criterion ::If a majority of voters strictly prefer choice X to every other choice, then choice X is identified as the most popular. :;
Non-dictatorship In social choice theory, a dictatorship mechanism is a rule by which, among all possible alternatives, the results of voting mirror a single pre-determined person's preferences, without consideration of the other voters. Dictatorship by itself is n ...
::A single voter cannot control the outcome in all cases.


Additional satisfied criteria

The Kemeny–Young method also satisfies these criteria: :;
Unrestricted domain In social choice theory, unrestricted domain, or universality, is a property of social welfare functions in which all preferences of all voters (but no other considerations) are allowed. Intuitively, unrestricted domain is a common requirement for s ...
::Identifies the overall order of preference for all the choices. The method does this for all possible sets of voter preferences and always produces the same result for the same set of voter preferences. :;
Pareto efficiency Pareto efficiency or Pareto optimality is a situation where no action or allocation is available that makes one individual better off without making another worse off. The concept is named after Vilfredo Pareto (1848–1923), Italian civil engi ...
::Any pairwise preference expressed by every voter results in the preferred choice being ranked higher than the less-preferred choice. :; Monotonicity ::If voters increase a choice's preference level, the ranking result either does not change or the promoted choice increases in overall popularity. :;
Smith criterion The Smith criterion (sometimes generalized Condorcet criterion, but this can have other meanings) is a voting systems criterion defined such that it's satisfied when a voting system always elects a candidate that is in the Smith set, which is the ...
::The most popular choice is a member of the
Smith set In voting systems, the Smith set, named after John H. Smith, but also known as the top cycle, or as Generalized Top-Choice Assumption (GETCHA), is the smallest non-empty set of candidates in a particular election such that each member defeats ever ...
, which is the smallest nonempty set of choices such that every member of the set is pairwise preferred to every choice not in the Smith set. :; Independence of Smith-dominated alternatives ::If choice X is not in the
Smith set In voting systems, the Smith set, named after John H. Smith, but also known as the top cycle, or as Generalized Top-Choice Assumption (GETCHA), is the smallest non-empty set of candidates in a particular election such that each member defeats ever ...
, adding or withdrawing choice X does not change a result in which choice Y is identified as most popular. :;Reinforcement ::If all the ballots are divided into separate races and the overall ranking for the separate races are the same, then the same ranking occurs when all the ballots are combined. :;
Reversal symmetry Reversal symmetry is a voting system criterion which requires that if candidate A is the unique winner, and each voter's individual preferences are inverted, then A must not be elected. Methods that satisfy reversal symmetry include Borda count, r ...
::If the preferences on every ballot are inverted, then the previously most popular choice must not remain the most popular choice.


Failed criteria for all Condorcet methods

In common with all Condorcet methods, the Kemeny–Young method fails these criteria (which means the described criteria do not apply to the Kemeny–Young method): :;
Independence of irrelevant alternatives The independence of irrelevant alternatives (IIA), also known as binary independence or the independence axiom, is an axiom of decision theory and various social sciences. The term is used in different connotation in several contexts. Although it ...
::Adding or withdrawing choice X does not change a result in which choice Y is identified as most popular. :; Invulnerability to burying ::A voter cannot displace a choice from most popular by giving the choice an insincerely low ranking. :; Invulnerability to compromising ::A voter cannot cause a choice to become the most popular by giving the choice an insincerely high ranking. :; Participation ::Adding ballots that rank choice X over choice Y never cause choice Y, instead of choice X, to become most popular. :;
Later-no-harm The later-no-harm criterion is a voting system criterion formulated by Douglas Woodall. Woodall defined the criterion as " ding a later preference to a ballot should not harm any candidate already listed." For example, a ranked voting method in w ...
::Ranking an additional choice (that was otherwise unranked) cannot displace a choice from being identified as the most popular. :;
Consistency In classical deductive logic, a consistent theory is one that does not lead to a logical contradiction. The lack of contradiction can be defined in either semantic or syntactic terms. The semantic definition states that a theory is consistent ...
::If all the ballots are divided into separate races and choice X is identified as the most popular in every such race, then choice X is the most popular when all the ballots are combined.


Additional failed criteria

The Kemeny–Young method also fails these criteria (which means the described criteria do not apply to the Kemeny–Young method): :;
Independence of clones In voting systems theory, the independence of clones criterion measures an election method's robustness to strategic nomination. Nicolaus Tideman was the first to formulate this criterion, which states that the winner must not change due to the ...
::Offering a larger number of similar choices, instead of offering only a single such choice, does not change the probability that one of these choices is identified as most popular. :; Invulnerability to push-over ::A voter cannot cause choice X to become the most popular by giving choice Y an insincerely high ranking. :;
Schwartz Schwartz may refer to: *Schwartz (surname), a surname (and list of people with the name) *Schwartz (brand), a spice brand *Schwartz's, a delicatessen in Montreal, Quebec, Canada *Schwartz Publishing, an Australian publishing house *"Danny Schwartz" ...
::The choice identified as most popular is a member of the Schwartz set. :; Polynomial runtimeJ. Bartholdi III, C. A. Tovey, and M. A. Trick, "Voting schemes for which it can be difficult to tell who won the election", ''Social Choice and Welfare'', Vol. 6, No. 2 (1989), pp. 157–165. ::An algorithm is known to determine the winner using this method in a runtime that is polynomial in the number of choices.


Calculation methods and computational complexity

An algorithm for computing a Kemeny-Young ranking in time polynomial in the number of candidates is not known, and unlikely to exist since the problem is
NP-hard In computational complexity theory, NP-hardness ( non-deterministic polynomial-time hardness) is the defining property of a class of problems that are informally "at least as hard as the hardest problems in NP". A simple example of an NP-hard pr ...
even if there are just 4 voters (even) or 7 voters (odd). It has been reportedVincent Conitzer, Andrew Davenport, and Jayant Kalagnanam,
Improved bounds for computing Kemeny rankings
(2006).
that calculation methods based on
integer programming An integer programming problem is a mathematical optimization or feasibility program in which some or all of the variables are restricted to be integers. In many settings the term refers to integer linear programming (ILP), in which the objective ...
sometimes allowed the computation of full rankings for votes on as many as 40 candidates in seconds. However, certain 40-candidate 5-voter Kemeny elections generated at random were not solvable on a 3 GHz Pentium computer in a useful time bound in 2006. The Kemeny–Young method can be formulated as an instance of a more abstract problem, of finding weighted
feedback arc set In graph theory and graph algorithms, a feedback arc set or feedback edge set in a directed graph is a subset of the edges of the graph that contains at least one edge out of every cycle in the graph. Removing these edges from the graph breaks al ...
s in tournament graphs. As such, many methods for the computation of feedback arc sets can be applied to this problem, including a variant of the Held–Karp algorithm that can compute the Kemeny–Young ranking of n candidates in time O(n2^n), significantly faster for many candidates than the factorial time of testing all rankings. There exists a
polynomial-time approximation scheme In computer science (particularly algorithmics), a polynomial-time approximation scheme (PTAS) is a type of approximation algorithm for optimization problems (most often, NP-hard optimization problems). A PTAS is an algorithm which takes an insta ...
for computing a Kemeny-Young ranking, and there also exists a parameterized subexponential-time algorithm with running time O*(2O()) for computing such a ranking.Karpinski, M. and Schudy, W.
"Faster Algorithms for Feedback Arc Set Tournament, Kemeny Rank Aggregation and Betweenness Tournament"
in: Cheong, O., Chwa, K.-Y., and Park, K. (Eds.): ISAAC 2010, Part I, LNCS 6506, pp. 3-14.


History

The Kemeny–Young method was developed by John Kemeny in 1959.John Kemeny, "Mathematics without numbers", ''Daedalus'' 88 (1959), pp. 577–591. In 1978,
Peyton Young Hobart Peyton Young (born March 9, 1945) is an American game theorist and economist known for his contributions to evolutionary game theory and its application to the study of institutional and technological change, as well as the theory of learn ...
and Arthur Levenglick axiomatically characterized the method, showing that it is the unique neutral method satisfying
consistency In classical deductive logic, a consistent theory is one that does not lead to a logical contradiction. The lack of contradiction can be defined in either semantic or syntactic terms. The semantic definition states that a theory is consistent ...
and the so-called quasi-Condorcet criterion.H. P. Young and A. Levenglick,
A Consistent Extension of Condorcet's Election Principle
, ''SIAM Journal on Applied Mathematics'' 35, no. 2 (1978), pp. 285–300.
It can also be characterized using consistency and a monotonicity property. In other papers, Young adopted an
epistemic Epistemology (; ), or the theory of knowledge, is the branch of philosophy concerned with knowledge. Epistemology is considered a major subfield of philosophy, along with other major subfields such as ethics, logic, and metaphysics. Episte ...
approach to preference aggregation: he supposed that there was an objectively 'correct', but unknown preference order over the alternatives, and voters receive noisy signals of this true preference order (cf.
Condorcet's jury theorem Condorcet's jury theorem is a political science theorem about the relative probability of a given group of individuals arriving at a correct decision. The theorem was first expressed by the Marquis de Condorcet in his 1785 work ''Essay on the App ...
.) Using a simple probabilistic model for these noisy signals, Young showed that the Kemeny–Young method was the
maximum likelihood estimator In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed statis ...
of the true preference order. Young further argues that
Condorcet Marie Jean Antoine Nicolas de Caritat, Marquis of Condorcet (; 17 September 1743 – 29 March 1794), known as Nicolas de Condorcet, was a French philosopher and mathematician. His ideas, including support for a liberal economy, free and equal p ...
himself was aware of the Kemeny-Young rule and its maximum-likelihood interpretation, but was unable to clearly express his ideas. In the papers by John Kemeny and Peyton Young, the Kemeny scores use counts of how many voters oppose, rather than support, each pairwise preference, but the smallest such score identifies the same overall ranking. Since 1991 the method has been promoted under the name "VoteFair popularity ranking" by Richard Fobes.Richard Fobes, "The Creative Problem Solver's Toolbox", (), 1993, pp. 223–225.


Comparison table

The following table compares the Kemeny-Young method with other preferential single-winner election methods:


Notes


External links


VoteFair.org
nbsp;— A website that calculates Kemeny–Young results. For comparison, it also calculates the winner according to plurality, Condorcet, Borda count, and other voting methods.
VoteFair_Ranking.cpp
nbsp;— C++ program, available on GitHub under the MIT license, that calculates VoteFair ranking results, which include Condorcet-Kemeny calculations.
Condorcet Class
PHP
library A library is a collection of materials, books or media that are accessible for use and not just for display purposes. A library provides physical (hard copies) or digital access (soft copies) materials, and may be a physical location or a vi ...
supporting multiple Condorcet methods, including Kemeny–Young method.
C++ Program for Kemeny-Young Preference Aggregation
nbsp;— Command-line program for fast calculation of Kemeny-Young results, as source code and compiled binaries for Windows and Linux. Open source, except uses
Numerical Recipes ''Numerical Recipes'' is the generic title of a series of books on algorithms and numerical analysis by William H. Press, Saul A. Teukolsky, William T. Vetterling and Brian P. Flannery. In various editions, the books have been in print since 1 ...
.
C Program for Kemeny-Young Preference Aggregation
nbsp;— Implements Davenport's algorithm with no other library dependencies. Open source, LGPL licensed
A Ruby binding to the library
is also open source, LPGL licensed.

 — Tutorial that uses a simple formulation as integer program and is adaptable to other languages with bindings to lpsolve.
QuickVote
nbsp;— A website that calculates Kemeny–Young results, and gives further explanation and examples of the concept. It also calculates the winner according to plurality, Borda count, instant-runoff and other voting methods. {{DEFAULTSORT:Kemeny-Young method Electoral systems Monotonic Condorcet methods