The prisoner's dilemma is a
game theory
Game theory is the study of mathematical models of strategic interactions. It has applications in many fields of social science, and is used extensively in economics, logic, systems science and computer science. Initially, game theory addressed ...
thought experiment involving two
rational agent
A rational agent or rational being is a person or entity that always aims to perform optimal actions based on given premises and information. A rational agent can be anything that makes decisions, typically a person, firm, machine, or software.
...
s, each of whom can either cooperate for mutual benefit or betray their partner ("defect") for individual gain. The dilemma arises from the fact that while defecting is rational for each agent, cooperation yields a higher payoff for each. The puzzle was designed by
Merrill Flood and
Melvin Dresher
Melvin Dresher (born Dreszer; March 13, 1911 – June 4, 1992) was a Poland, Polish-born United States, American mathematician, notable for developing, alongside Merrill M. Flood, Merrill Flood, the game theory, game theoretical model of cooperat ...
in 1950 during their work at the
RAND Corporation
The RAND Corporation, doing business as RAND, is an American nonprofit global policy think tank, research institute, and public sector consulting firm. RAND engages in research and development (R&D) in several fields and industries. Since the ...
. They invited economist
Armen Alchian
Armen Albert Alchian (; April 12, 1914February 19, 2013) was an American economist who made major contributions to microeconomic theory and the theory of the firm. He spent almost his entire career at the University of California, Los Angeles (UCL ...
and mathematician John Williams to play a hundred rounds of the game, observing that Alchian and Williams often chose to cooperate. When asked about the results,
John Nash remarked that rational behavior in the
iterated version of the game can differ from that in a single-round version. This insight anticipated a
key result in game theory: cooperation can emerge in repeated interactions, even in situations where it is not rational in a one-off interaction.
Albert W. Tucker later named the game the "prisoner's dilemma" by framing the rewards in terms of prison sentences. The prisoner's dilemma models many
real-world situations involving strategic behavior. In casual usage, the label "prisoner's dilemma" is applied to any situation in which two entities can gain important benefits by cooperating or suffer by failing to do so, but find it difficult or expensive to coordinate their choices.
Premise
This "typical contemporary version" of the game is described in
William Poundstone's 1993 book ''Prisoner's Dilemma'':
Two members of a criminal gang are arrested and imprisoned. Each prisoner is in solitary confinement with no means of speaking to or exchanging messages with the other. The police admit they don't have enough evidence to convict the pair on the principal charge. They plan to sentence both to a year in prison on a lesser charge. Simultaneously, the police offer each prisoner a Faustian bargain. If he testifies against his partner, he will go free while the partner will get three years in prison on the main charge. Oh, yes, there is a catch ... If ''both'' prisoners testify against each other, both will be sentenced to two years in jail. The prisoners are given a little time to think this over, but in no case may either learn what the other has decided until he has irrevocably made his decision. Each is informed that the other prisoner is being offered the very same deal. Each prisoner is concerned only with his own welfare—with minimizing his own prison sentence.
This leads to three different possible outcomes for prisoners A and B:
# If A and B both remain silent, they will each serve one year in prison.
# If one testifies against the other but the other doesn’t, the one testifying will be set free while the other serves three years in prison.
# If A and B testify against each other, they will each serve two years.
Strategy for the prisoner's dilemma
Two prisoners are separated into individual rooms and cannot communicate with each other. It is assumed that both prisoners understand the nature of the game, have no loyalty to each other, and will have no opportunity for retribution or reward outside of the game. The normal game is shown below:
Regardless of what the other decides, each prisoner gets a higher reward by betraying the other ("defecting"). The reasoning involves analyzing both players'
best response
In game theory, the best response is the strategy (or strategies) which produces the most favorable outcome for a player, taking other players' strategies as given. The concept of a best response is central to John Nash's best-known contribution ...
s: B will either cooperate or defect. If B cooperates, A should defect, because going free is better than serving 1 year. If B defects, A should also defect, because serving 2 years is better than serving 3. So, either way, A should defect since defecting is A's best response regardless of B's strategy. Parallel reasoning will show that B should defect.
Defection always results in a better payoff than cooperation, so it is a strictly dominant strategy for both players. Mutual defection is the only strong
Nash equilibrium
In game theory, the Nash equilibrium is the most commonly used solution concept for non-cooperative games. A Nash equilibrium is a situation where no player could gain by changing their own strategy (holding all other players' strategies fixed) ...
in the game. Since the collectively ideal result of mutual cooperation is irrational from a self-interested standpoint, this Nash equilibrium is not
Pareto efficient
In welfare economics, a Pareto improvement formalizes the idea of an outcome being "better in every possible way". A change is called a Pareto improvement if it leaves at least one person in society better off without leaving anyone else worse ...
.
Generalized form
The structure of the traditional prisoner's dilemma can be generalized from its original prisoner setting. Suppose that the two players are represented by the colors red and blue and that each player chooses to either "cooperate" or "defect".
If both players cooperate, they both receive the reward
for cooperating. If both players defect, they both receive the punishment payoff
. If Blue defects while Red cooperates, then Blue receives the temptation payoff
, while Red receives the "sucker's" payoff,
. Similarly, if Blue cooperates while Red defects, then Blue receives the sucker's payoff
, while Red receives the temptation payoff
.
This can be expressed in
normal form:
and to be a prisoner's dilemma game in the strong sense, the following condition must hold for the payoffs:
:
The payoff relationship implies that mutual cooperation is superior to mutual defection, while the payoff relationships and imply that defection is the dominant strategy for both agents.
The iterated prisoner's dilemma
If two players play the prisoner's dilemma more than once in succession, remember their opponent's previous actions, and are allowed to change their strategy accordingly, the game is called the iterated prisoner's dilemma.
In addition to the general form above, the iterative version also requires that , to prevent alternating cooperation and defection giving a greater reward than mutual cooperation.
The iterated prisoner's dilemma is fundamental to some theories of human cooperation and trust. Assuming that the game effectively models transactions between two people that require trust, cooperative behavior in populations can be modeled by a multi-player iterated version of the game. In 1975,
Grofman and
Pool
Pool may refer to:
Bodies of water
* Swimming pool, usually an artificial structure containing a large body of water intended for swimming
* Reflecting pool, a shallow pool designed to reflect a structure and its surroundings
* Tide pool, a roc ...
estimated the count of scholarly articles devoted to it at over 2,000. The iterated prisoner's dilemma is also called the "
peace-war game".
General strategy
If the iterated prisoner's dilemma is played a finite number of times and both players know this, then the dominant strategy and Nash equilibrium is to defect in all rounds. The proof is
inductive: one might as well defect on the last turn, since the opponent will not have a chance to later retaliate. Therefore, both will defect on the last turn. Thus, the player might as well defect on the second-to-last turn, since the opponent will defect on the last no matter what is done, and so on. The same applies if the game length is unknown but has a known upper limit.
For
cooperation
Cooperation (written as co-operation in British English and, with a varied usage along time, coöperation) takes place when a group of organisms works or acts together for a collective benefit to the group as opposed to working in competition ...
to emerge between rational players, the number of rounds must be unknown or infinite. In that case, "always defect" may no longer be a dominant strategy. As shown by
Robert Aumann
Robert John Aumann (Yisrael Aumann, ; born June 8, 1930) is an Israeli-American mathematician, and a member of the United States National Academy of Sciences. He is a professor at the Center for the Study of Rationality in the Hebrew University ...
in a 1959 paper, rational players repeatedly interacting for indefinitely long games can sustain cooperation. Specifically, a player may be less willing to cooperate if their counterpart did not cooperate many times, which causes disappointment. Conversely, as time elapses, the likelihood of cooperation tends to rise, owing to the establishment of a "tacit agreement" among participating players. In experimental situations, cooperation can occur even when both participants know how many iterations will be played.
According to a 2019 experimental study in the ''American Economic Review'' that tested what strategies real-life subjects used in iterated prisoner's dilemma situations with perfect monitoring, the majority of chosen strategies were always to defect,
tit-for-tat
Tit for tat is an English saying meaning "equivalent retaliation". It is an alternation (linguistics), alternation of ''wikt:tip#Noun 3, tip for wikt:tap#Verb 2, tap'' "blow for blow", first recorded in 1558.
It is also a highly effective strat ...
, and
grim trigger
In game theory, grim trigger (also called the grim strategy or just grim) is a trigger strategy for a repeated game.
Initially, a player using grim trigger will cooperate, but as soon as the opponent defects (thus satisfying the trigger condition ...
. Which strategy the subjects chose depended on the parameters of the game.
Axelrod's tournament and successful strategy conditions
Interest in the iterated prisoner's dilemma was kindled by
Robert Axelrod in his 1984 book ''
The Evolution of Cooperation'', in which he reports on a tournament that he organized of the ''N''-step prisoner's dilemma (with ''N'' fixed) in which participants have to choose their strategy repeatedly and remember their previous encounters. Axelrod invited academic colleagues from around the world to devise computer strategies to compete in an iterated prisoner's dilemma tournament. The programs that were entered varied widely in algorithmic complexity, initial hostility, capacity for forgiveness, and so forth.
Axelrod discovered that when these encounters were repeated over a long period of time with many players, each with different strategies, greedy strategies tended to do very poorly in the long run while more
altruistic
Altruism is the concern for the well-being of others, independently of personal benefit or reciprocity.
The word ''altruism'' was popularised (and possibly coined) by the French philosopher Auguste Comte in French, as , for an antonym of egoi ...
strategies did better, as judged purely by self-interest. He used this to show a possible mechanism for the evolution of altruistic behavior from mechanisms that are initially purely selfish, by
natural selection
Natural selection is the differential survival and reproduction of individuals due to differences in phenotype. It is a key mechanism of evolution, the change in the Heredity, heritable traits characteristic of a population over generation ...
.
The winning
deterministic
Determinism is the metaphysical view that all events within the universe (or multiverse) can occur only in one possible way. Deterministic theories throughout the history of philosophy have developed from diverse and sometimes overlapping mo ...
strategy was
tit for tat
Tit for tat is an English saying meaning "equivalent retaliation". It is an alternation of '' tip for tap'' "blow for blow", first recorded in 1558.
It is also a highly effective strategy in game theory. An agent using this strategy will fi ...
, developed and entered into the tournament by
Anatol Rapoport
Anatol Borisovich Rapoport (; ; May 22, 1911January 20, 2007) was an American mathematical psychologist. He contributed to general systems theory, to mathematical biology and to the mathematical modeling of social interaction and stochastic ...
. It was the simplest of any program entered, containing only four lines of
BASIC
Basic or BASIC may refer to:
Science and technology
* BASIC, a computer programming language
* Basic (chemistry), having the properties of a base
* Basic access authentication, in HTTP
Entertainment
* Basic (film), ''Basic'' (film), a 2003 film
...
, and won the contest. The strategy is simply to cooperate on the first iteration of the game; after that, the player does what his or her opponent did on the previous move. Depending on the situation, a slightly better strategy can be "tit for tat with forgiveness": when the opponent defects, on the next move, the player sometimes cooperates anyway, with a small probability (around 1–5%, depending on the lineup of opponents). This allows for occasional recovery from getting trapped in a cycle of defections.
After analyzing the top-scoring strategies, Axelrod stated several conditions necessary for a strategy to succeed:
* Nice: The strategy will not be the first to defect (this is sometimes referred to as an "optimistic" algorithm), i.e., it will not "cheat" on its opponent for purely self-interested reasons first. Almost all the top-scoring strategies were nice.
* Retaliating: The strategy must sometimes retaliate. An example of a non-retaliating strategy is Always Cooperate, a very bad choice that will frequently be exploited by "nasty" strategies.
* Forgiving: Successful strategies must be forgiving. Though players will retaliate, they will cooperate again if the opponent does not continue to defect. This can stop long runs of revenge and counter-revenge, maximizing points.
* Non-envious: The strategy must not strive to score more than the opponent.
In contrast to the one-time prisoner's dilemma game, the optimal strategy in the iterated prisoner's dilemma depends upon the strategies of likely opponents, and how they will react to defections and cooperation. For example, if a population consists entirely of players who always defect, except for one who follows the tit-for-tat strategy, that person is at a slight disadvantage because of the loss on the first turn. In such a population, the optimal strategy is to defect every time. More generally, given a population with a certain percentage of always-defectors with the rest being tit-for-tat players, the optimal strategy depends on the percentage and number of iterations played.
Other strategies
Deriving the optimal strategy is generally done in two ways:
*
Bayesian Nash equilibrium
In game theory, a Bayesian game is a strategic decision-making model which assumes players have incomplete information. Players may hold private information relevant to the game, meaning that the payoffs are not common knowledge. Bayesian games mo ...
: If the statistical distribution of opposing strategies can be determined an optimal counter-strategy can be derived analytically.
*
Monte Carlo
Monte Carlo ( ; ; or colloquially ; , ; ) is an official administrative area of Monaco, specifically the Ward (country subdivision), ward of Monte Carlo/Spélugues, where the Monte Carlo Casino is located. Informally, the name also refers to ...
simulations of populations have been made, where individuals with low scores die off, and those with high scores reproduce (a
genetic algorithm
In computer science and operations research, a genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to g ...
for finding an optimal strategy). The mix of algorithms in the final population generally depends on the mix in the initial population. The introduction of mutation (random variation during reproduction) lessens the dependency on the initial population; empirical experiments with such systems tend to produce tit-for-tat players, but no analytic proof exists that this will always occur.
In the strategy called
win-stay, lose-switch, faced with a failure to cooperate, the player switches strategy the next turn. In certain circumstances, Pavlov beats all other strategies by giving preferential treatment to co-players using a similar strategy.
Although tit-for-tat is considered the most
robust
Robustness is the property of being strong and healthy in constitution. When it is transposed into a system, it refers to the ability of tolerating perturbations that might affect the system's functional body. In the same line ''robustness'' can ...
basic strategy, a team from
Southampton University
The University of Southampton (abbreviated as ''Soton'' in post-nominal letters) is a public research university in Southampton, England. Southampton is a founding member of the Russell Group of research-intensive universities in the United K ...
in England introduced a more successful strategy at the 20th-anniversary iterated prisoner's dilemma competition. It relied on collusion between programs to achieve the highest number of points for a single program. The university submitted 60 programs to the competition, which were designed to recognize each other through a series of five to ten moves at the start. Once this recognition was made, one program would always cooperate and the other would always defect, assuring the maximum number of points for the defector. If the program realized that it was playing a non-Southampton player, it would continuously defect in an attempt to minimize the competing program's score. As a result, the 2004 Prisoners' Dilemma Tournament results show
University of Southampton
The University of Southampton (abbreviated as ''Soton'' in post-nominal letters) is a public university, public research university in Southampton, England. Southampton is a founding member of the Russell Group of research-intensive universit ...
's strategies in the first three places (and a number of positions towards the bottom), despite having fewer wins and many more losses than the GRIM strategy. The Southampton strategy takes advantage of the fact that multiple entries were allowed in this particular competition and that a team's performance was measured by that of the highest-scoring player (meaning that the use of self-sacrificing players was a form of
minmaxing).
Because of this new rule, this competition also has little theoretical significance when analyzing single-agent strategies as compared to Axelrod's seminal tournament. But it provided a basis for analyzing how to achieve cooperative strategies in multi-agent frameworks, especially in the presence of noise.
Long before this new-rules tournament was played,
Richard Dawkins
Richard Dawkins (born 26 March 1941) is a British evolutionary biology, evolutionary biologist, zoologist, science communicator and author. He is an Oxford fellow, emeritus fellow of New College, Oxford, and was Simonyi Professor for the Publ ...
, in his book ''
The Selfish Gene
''The Selfish Gene'' is a 1976 book on evolution by ethologist Richard Dawkins that promotes the gene-centred view of evolution, as opposed to views focused on the organism and the group. The book builds upon the thesis of George C. Willia ...
'', pointed out the possibility of such strategies winning if multiple entries were allowed, but wrote that Axelrod would most likely not have allowed them if they had been submitted. Such strategies also circumvent the rule against communication between players: the Southampton programs' "ten-move dance" allowed them to recognize one another, reinforcing how valuable communication can be in shifting the balance of the game.
Even without implicit collusion between
software strategies, tit-for-tat is not always the absolute winner of any given tournament; more precisely, its long-run results over a series of tournaments outperform its rivals, but this does not mean it is the most successful in the short term. The same applies to tit-for-tat with forgiveness and other optimal strategies.
This can also be illustrated using the Darwinian
ESS simulation. In such a simulation, tit-for-tat will almost always come to dominate, though nasty strategies will drift in and out of the population because a tit-for-tat population is penetrable by non-retaliating nice strategies, which in turn are easy prey for the nasty strategies. Dawkins showed that here, no static mix of strategies forms a stable equilibrium, and the system will always oscillate between bounds.
Stochastic iterated prisoner's dilemma
In a stochastic iterated prisoner's dilemma game, strategies are specified in terms of "cooperation probabilities".
In an encounter between player ''X'' and player ''Y'', ''X''s strategy is specified by a set of probabilities ''P'' of cooperating with ''Y''. ''P'' is a function of the outcomes of their previous encounters or some subset thereof. If ''P'' is a function of only their most recent ''n'' encounters, it is called a "memory-n" strategy. A memory-1 strategy is then specified by four cooperation probabilities:
, where ''P
cd'' is the probability that ''X'' will cooperate in the present encounter given that the previous encounter was characterized by ''X'' cooperating and ''Y'' defecting. If each of the probabilities are either 1 or 0, the strategy is called deterministic. An example of a deterministic strategy is the tit-for-tat strategy written as
, in which ''X'' responds as ''Y'' did in the previous encounter. Another is the win-stay, lose switch strategy written as
. It has been shown that for any memory-n strategy there is a corresponding memory-1 strategy that gives the same statistical results, so that only memory-1 strategies need be considered.
If
is defined as the above 4-element strategy vector of ''X'' and
as the 4-element strategy vector of ''Y'' (where the indices are from ''Y''
's point of view), a transition matrix ''M'' may be defined for ''X'' whose ''ij''-th entry is the probability that the outcome of a particular encounter between ''X'' and ''Y'' will be ''j'' given that the previous encounter was ''i'', where ''i'' and ''j'' are one of the four outcome indices: ''cc'', ''cd'', ''dc'', or ''dd''. For example, from ''X''s point of view, the probability that the outcome of the present encounter is ''cd'' given that the previous encounter was ''cd'' is equal to
. Under these definitions, the iterated prisoner's dilemma qualifies as a
stochastic process
In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables in a probability space, where the index of the family often has the interpretation of time. Sto ...
and ''M'' is a
stochastic matrix
In mathematics, a stochastic matrix is a square matrix used to describe the transitions of a Markov chain. Each of its entries is a nonnegative real number representing a probability. It is also called a probability matrix, transition matrix, ''s ...
, allowing all of the theory of stochastic processes to be applied.
One result of stochastic theory is that there exists a stationary vector ''v'' for the matrix ''v'' such that
. Without loss of generality, it may be specified that ''v'' is normalized so that the sum of its four components is unity. The ''ij''-th entry in
will give the probability that the outcome of an encounter between ''X'' and ''Y'' will be ''j'' given that the encounter ''n'' steps previous is ''i''. In the limit as ''n'' approaches infinity, ''M'' will converge to a matrix with fixed values, giving the long-term probabilities of an encounter producing ''j'' independent of ''i''. In other words, the rows of
will be identical, giving the long-term equilibrium result probabilities of the iterated prisoner's dilemma without the need to explicitly evaluate a large number of interactions. It can be seen that ''v'' is a stationary vector for
and particularly
, so that each row of
will be equal to ''v''. Thus, the stationary vector specifies the equilibrium outcome probabilities for ''X''. Defining
and
as the short-term payoff vectors for the outcomes (from ''X''s point of view), the equilibrium payoffs for ''X'' and ''Y'' can now be specified as
and
, allowing the two strategies ''P'' and ''Q'' to be compared for their long-term payoffs.
Zero-determinant strategies

In 2012,
William H. Press and
Freeman Dyson
Freeman John Dyson (15 December 1923 – 28 February 2020) was a British-American theoretical physics, theoretical physicist and mathematician known for his works in quantum field theory, astrophysics, random matrix, random matrices, math ...
published a new class of strategies for the stochastic iterated prisoner's dilemma called "zero-determinant" (ZD) strategies.
The long term payoffs for encounters between ''X'' and ''Y'' can be expressed as the determinant of a matrix which is a function of the two strategies and the short term payoff vectors:
and
, which do not involve the stationary vector ''v''. Since the determinant function
is linear in
, it follows that
(where
). Any strategies for which
are by definition a ZD strategy, and the long-term payoffs obey the relation
.
Tit-for-tat is a ZD strategy which is "fair", in the sense of not gaining advantage over the other player. But the ZD space also contains strategies that, in the case of two players, can allow one player to unilaterally set the other player's score or alternatively force an evolutionary player to achieve a payoff some percentage lower than his own. The extorted player could defect, but would thereby hurt himself by getting a lower payoff. Thus, extortion solutions turn the iterated prisoner's dilemma into a sort of
ultimatum game
The ultimatum game is a popular experimental economics game in which two players interact to decide how to divide a sum of money, first described by Nobel laureate John Harsanyi in 1961. The first player, the proposer, proposes a division of the ...
. Specifically, ''X'' is able to choose a strategy for which
, unilaterally setting ''s
y'' to a specific value within a particular range of values, independent of ''Y''s strategy, offering an opportunity for ''X'' to "extort" player ''Y'' (and vice versa). But if ''X'' tries to set ''s
x'' to a particular value, the range of possibilities is much smaller, consisting only of complete cooperation or complete defection.
An extension of the iterated prisoner's dilemma is an evolutionary stochastic iterated prisoner's dilemma, in which the relative abundance of particular strategies is allowed to change, with more successful strategies relatively increasing. This process may be accomplished by having less successful players imitate the more successful strategies, or by eliminating less successful players from the game, while multiplying the more successful ones. It has been shown that unfair ZD strategies are not
evolutionarily stable. The key intuition is that an evolutionarily stable strategy must not only be able to invade another population (which extortionary ZD strategies can do) but must also perform well against other players of the same type (which extortionary ZD players do poorly because they reduce each other's surplus).
Theory and simulations confirm that beyond a critical population size, ZD extortion loses out in evolutionary competition against more cooperative strategies, and as a result, the average payoff in the population increases when the population is larger. In addition, there are some cases in which extortioners may even catalyze cooperation by helping to break out of a face-off between uniform defectors and
win–stay, lose–switch In psychology, game theory, statistics, and machine learning, win–stay, lose–switch (also win–stay, lose–shift) is a heuristic learning strategy used to model learning in decision situations. It was first invented as an improvement over ran ...
agents.
While extortionary ZD strategies are not stable in large populations, another ZD class called "generous" strategies is both stable and robust. When the population is not too small, these strategies can supplant any other ZD strategy and even perform well against a broad array of generic strategies for iterated prisoner's dilemma, including win–stay, lose–switch. This was proven specifically for the
donation game by Alexander Stewart and Joshua Plotkin in 2013.
Generous strategies will cooperate with other cooperative players, and in the face of defection, the generous player loses more utility than its rival. Generous strategies are the intersection of ZD strategies and so-called "good" strategies, which were defined by Ethan Akin to be those for which the player responds to past mutual cooperation with future cooperation and splits expected payoffs equally if he receives at least the cooperative expected payoff.
[ ] Among good strategies, the generous (ZD) subset performs well when the population is not too small. If the population is very small, defection strategies tend to dominate.
Continuous iterated prisoner's dilemma
Most work on the iterated prisoner's dilemma has focused on the discrete case, in which players either cooperate or defect, because this model is relatively simple to analyze. However, some researchers have looked at models of the continuous iterated prisoner's dilemma, in which players are able to make a variable contribution to the other player. Le and Boyd found that in such situations, cooperation is much harder to evolve than in the discrete iterated prisoner's dilemma. In a continuous prisoner's dilemma, if a population starts off in a non-cooperative equilibrium, players who are only marginally more cooperative than non-cooperators get little benefit from
assorting with one another. By contrast, in a discrete prisoner's dilemma, tit-for-tat cooperators get a big payoff boost from assorting with one another in a non-cooperative equilibrium, relative to non-cooperators. Since nature arguably offers more opportunities for variable cooperation rather than a strict dichotomy of cooperation or defection, the continuous prisoner's dilemma may help explain why real-life examples of tit-for-tat-like cooperation are extremely rare even though tit-for-tat seems robust in theoretical models.
Real-life examples
Many instances of human interaction and natural processes have payoff matrices like the prisoner's dilemma's. It is therefore of interest to the
social science
Social science (often rendered in the plural as the social sciences) is one of the branches of science, devoted to the study of societies and the relationships among members within those societies. The term was formerly used to refer to the ...
s, such as
economics
Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services.
Economics focuses on the behaviour and interac ...
,
politics
Politics () is the set of activities that are associated with decision-making, making decisions in social group, groups, or other forms of power (social and political), power relations among individuals, such as the distribution of Social sta ...
, and
sociology
Sociology is the scientific study of human society that focuses on society, human social behavior, patterns of Interpersonal ties, social relationships, social interaction, and aspects of culture associated with everyday life. The term sociol ...
, as well as to the biological sciences, such as
ethology
Ethology is a branch of zoology that studies the behavior, behaviour of non-human animals. It has its scientific roots in the work of Charles Darwin and of American and German ornithology, ornithologists of the late 19th and early 20th cen ...
and
evolutionary biology
Evolutionary biology is the subfield of biology that studies the evolutionary processes such as natural selection, common descent, and speciation that produced the diversity of life on Earth. In the 1930s, the discipline of evolutionary biolo ...
. Many natural processes have been abstracted into models in which living beings are engaged in endless games of prisoner's dilemma.
Environmental studies
In
environmental studies
Environmental studies (EVS or EVST) is a multidisciplinary academic field which systematically studies human behavior, human interaction with the Natural environment, environment. Environmental studies connects principles from the physical sci ...
, the dilemma is evident in crises such as global
climate change
Present-day climate change includes both global warming—the ongoing increase in Global surface temperature, global average temperature—and its wider effects on Earth's climate system. Climate variability and change, Climate change in ...
. It is argued all countries will benefit from a stable climate, but any single country is often hesitant to curb
emissions. The immediate benefit to any one country from maintaining current behavior is perceived to be greater than the purported eventual benefit to that country if all countries' behavior was changed, therefore explaining the impasse concerning climate-change in 2007.
An important difference between climate-change politics and the prisoner's dilemma is uncertainty; the extent and pace at which pollution can change climate is not known. The dilemma faced by governments is therefore different from the prisoner's dilemma in that the payoffs of cooperation are unknown. This difference suggests that states will cooperate much less than in a real iterated prisoner's dilemma, so that the probability of avoiding a possible climate catastrophe is much smaller than that suggested by a game-theoretical analysis of the situation using a real iterated prisoner's dilemma.
Thomas Osang and Arundhati Nandy provide a theoretical explanation with proofs for a regulation-driven win-win situation along the lines of
Michael Porter
Michael Eugene Porter (born May 23, 1947) is an American businessman and professor at Harvard Business School. He was one of the founders of the consulting firm The Monitor Group (now part of Deloitte) and FSG, a social impact consultancy. ...
's hypothesis, in which government regulation of competing firms is substantial.
Animals
Cooperative behavior of many animals can be understood as an example of the iterated prisoner's dilemma. Often animals engage in long-term partnerships; for example,
guppies inspect predators cooperatively in groups, and they are thought to punish non-cooperative inspectors.
Vampire bat
Vampire bats, members of the subfamily Desmodontinae, are Phyllostomidae, leaf-nosed bats currently found in Central and South America. Their food source is the blood of other animals, a dietary trait called hematophagy. Three extant bat species ...
s are social animals that engage in reciprocal food exchange. Applying the payoffs from the prisoner's dilemma can help explain this behavior.
Psychology
In
addiction
Addiction is a neuropsychological disorder characterized by a persistent and intense urge to use a drug or engage in a behavior that produces natural reward, despite substantial harm and other negative consequences. Repetitive drug use can ...
research and
behavioral economics
Behavioral economics is the study of the psychological (e.g. cognitive, behavioral, affective, social) factors involved in the decisions of individuals or institutions, and how these decisions deviate from those implied by traditional economi ...
,
George Ainslie points out that addiction can be cast as an intertemporal prisoner's dilemma problem between the present and future selves of the addict. In this case, "defecting" means relapsing, where not relapsing both today and in the future is by far the best outcome. The case where one abstains today but relapses in the future is the worst outcome: in some sense, the discipline and self-sacrifice involved in abstaining today have been "wasted" because the future relapse means that the addict is right back where they started and will have to start over. Relapsing today and tomorrow is a slightly "better" outcome, because while the addict is still addicted, they haven't put the effort in to trying to stop. The final case, where one engages in the addictive behavior today while abstaining tomorrow, has the problem that (as in other prisoner's dilemmas) there is an obvious benefit to defecting "today", but tomorrow one will face the same prisoner's dilemma, and the same obvious benefit will be present then, ultimately leading to an endless string of defections.
In ''The Science of Trust'',
John Gottman
John Mordechai Gottman (born April 26, 1942) is an American psychologist and professor emeritus of psychology at the University of Washington. His research focuses on divorce prediction and marital stability through relationship analyses. Gottman ...
defines good relationships as those where partners know not to enter into mutual defection behavior, or at least not to get dynamically stuck there in a loop. In
cognitive neuroscience
Cognitive neuroscience is the scientific field that is concerned with the study of the Biology, biological processes and aspects that underlie cognition, with a specific focus on the neural connections in the brain which are involved in mental ...
, fast brain signaling associated with processing different rounds may indicate choices at the next round. Mutual cooperation outcomes entail brain activity changes predictive of how quickly a person will cooperate in kind at the next opportunity; this activity may be linked to basic homeostatic and motivational processes, possibly increasing the likelihood of short-cutting into mutual cooperation.
Economics
The prisoner's dilemma has been called the ''
E. coli
''Escherichia coli'' ( )Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Escherichia'' that is commonly foun ...
'' of social psychology, and it has been used widely to research various topics such as
oligopolistic
An oligopoly () is a market in which pricing control lies in the hands of a few sellers.
As a result of their significant market power, firms in oligopolistic markets can influence prices through manipulating the supply function. Firms in a ...
competition and collective action to produce a collective good.
Advertising is sometimes cited as a real example of the prisoner's dilemma. When
cigarette advertising was legal in the United States, competing cigarette manufacturers had to decide how much money to spend on advertising. The effectiveness of Firm A's advertising was partially determined by the advertising conducted by Firm B. Likewise, the profit derived from advertising for Firm B is affected by the advertising conducted by Firm A. If both Firm A and Firm B chose to advertise during a given period, then the advertisement from each firm negates the other's, receipts remain constant, and expenses increase due to the cost of advertising. Both firms would benefit from a reduction in advertising. However, should Firm B choose not to advertise, Firm A could benefit greatly by advertising. Nevertheless, the optimal amount of advertising by one firm depends on how much advertising the other undertakes. As the best strategy is dependent on what the other firm chooses there is no dominant strategy, which makes it slightly different from a prisoner's dilemma. The outcome is similar, though, in that both firms would be better off were they to advertise less than in the equilibrium.
Sometimes cooperative behaviors do emerge in business situations. For instance, cigarette manufacturers endorsed the making of laws banning cigarette advertising, understanding that this would reduce costs and increase profits across the industry.
Without enforceable agreements, members of a
cartel
A cartel is a group of independent market participants who collaborate with each other as well as agreeing not to compete with each other in order to improve their profits and dominate the market. A cartel is an organization formed by producers ...
are also involved in a (multi-player) prisoner's dilemma. "Cooperating" typically means agreeing to a
price floor
A price floor is a government- or group-imposed price control or limit on how low a price can be charged for a product, good, commodity, or service. It is one type of price support; other types include supply regulation and guarantee government pu ...
, while "defecting" means selling under this minimum level, instantly taking business from other cartel members.
Anti-trust
Competition law is the field of law that promotes or seeks to maintain market competition by regulating anti-competitive conduct by companies. Competition law is implemented through public and private enforcement. It is also known as antitrust ...
authorities want potential cartel members to mutually defect, ensuring the lowest possible prices for consumers.
Sport
Doping in sport
In competitive sports, doping is the use of banned athletic performance-enhancing drugs (PEDs) by athletes, as a way of cheating. As stated in the World Anti-Doping Code by WADA, doping is defined as the occurrence of one or more of the anti-d ...
has been cited as an example of a prisoner's dilemma. Two competing athletes have the option to use an illegal and/or dangerous drug to boost their performance. If neither athlete takes the drug, then neither gains an advantage. If only one does, then that athlete gains a significant advantage over the competitor, reduced by the legal and/or medical dangers of having taken the drug. But if both athletes take the drug, the benefits cancel out and only the dangers remain, putting them both in a worse position than if neither had doped.
International politics
In
international relations theory
International relations theory is the study of international relations (IR) from a theoretical perspective. It seeks to explain behaviors and outcomes in international politics. The three most prominent School of thought, schools of thought are ...
, the prisoner's dilemma is often used to demonstrate why cooperation fails in situations when cooperation between states is collectively optimal but individually suboptimal. A classic example is the
security dilemma
In international relations, the security dilemma (also referred to as the spiral model) is when the increase in one state's security (such as increasing its military strength) leads other states to fear for their own security (because they do not k ...
, whereby an increase in one state's security (such as increasing its military strength) leads other states to fear for their own security out of fear of offensive action.
Consequently, security-increasing measures can lead to tensions, escalation or conflict with one or more other parties, producing an outcome which no party truly desires.
The security dilemma is particularly intense in situations when it is hard to distinguish offensive weapons from defensive weapons, and offense has the advantage in any conflict over defense.
The prisoner's dilemma has frequently been used by
realist international relations theorists to demonstrate the why all states (regardless of their internal policies or professed ideology) under
international anarchy will struggle to cooperate with one another even when all benefit from such cooperation.
Critics of realism argue that iteration and extending the shadow of the future are solutions to the prisoner's dilemma. When actors play the prisoner's dilemma once, they have incentives to defect, but when they expect to play it repeatedly, they have greater incentives to cooperate.
Multiplayer dilemmas
Many real-life dilemmas involve multiple players. Although metaphorical,
Garrett Hardin
Garrett James Hardin (April 21, 1915 – September 14, 2003) was an American ecologist and microbiologist. He focused his career on the issue of human overpopulation, and is best known for his exposition of the tragedy of the commons in a 1968 p ...
's
tragedy of the commons
The tragedy of the commons is the concept that, if many people enjoy unfettered access to a finite, valuable resource, such as a pasture, they will tend to overuse it and may end up destroying its value altogether. Even if some users exercised vo ...
may be viewed as an example of a multi-player generalization of the prisoner's dilemma: each villager makes a choice for personal gain or restraint. The collective reward for unanimous or frequent defection is very low payoffs and the destruction of the commons.
The commons are not always exploited:
William Poundstone, in a book about the prisoner's dilemma, describes a situation in New Zealand where newspaper boxes are left unlocked. It is possible for people to
take a paper without paying (defecting), but very few do, feeling that if they do not pay then neither will others, destroying the system. Subsequent research by
Elinor Ostrom
Elinor Claire "Lin" Ostrom (née Awan; August 7, 1933 – June 12, 2012) was an American Political science, political scientist and Political economy, political economist whose work was associated with New institutional economics, New Institution ...
, winner of the 2009
Nobel Memorial Prize in Economic Sciences
The Nobel Memorial Prize in Economic Sciences, officially the Sveriges Riksbank Prize in Economic Sciences in Memory of Alfred Nobel (), commonly referred to as the Nobel Prize in Economics(), is an award in the field of economic sciences adminis ...
, hypothesized that the tragedy of the commons is oversimplified, with the negative outcome influenced by outside influences. Without complicating pressures, groups communicate and manage the commons among themselves for their mutual benefit, enforcing social norms to preserve the resource and achieve the maximum good for the group, an example of effecting the best-case outcome for prisoner's dilemma.
Academic settings
The prisoner's dilemma has been used in various academic settings to illustrate the complexities of cooperation and competition. One notable example is the classroom experiment conducted by sociology professor Dan Chambliss at
Hamilton College
Hamilton College is a Private college, private Liberal arts colleges in the United States, liberal arts college in Clinton, Oneida County, New York, Clinton, New York. It was established as the Hamilton-Oneida Academy in 1793 and received its c ...
in the 1980s. Starting in 1981, Chambliss proposed that if no student took the final exam, everyone would receive an A, but if even one student took it, those who didn't would receive a zero. In 1988,
John Werner, a first-year student, successfully organized his classmates to boycott the exam, demonstrating a practical application of game theory and the prisoner's dilemma concept.
Nearly 25 years later, a similar incident occurred at
Johns Hopkins University
The Johns Hopkins University (often abbreviated as Johns Hopkins, Hopkins, or JHU) is a private university, private research university in Baltimore, Maryland, United States. Founded in 1876 based on the European research institution model, J ...
in 2013. Professor Peter Fröhlich's grading policy scaled final exams according to the highest score, meaning that if everyone received the same score, they would all get an A. Students in Fröhlich's classes organized a boycott of the final exam, ensuring that no one took it. As a result, every student received an A, successfully solving the prisoner's dilemma in a mutually optimal way without iteration.
These examples highlight how the prisoner's dilemma can be used to explore cooperative behavior and strategic decision-making in educational contexts.
Related games
Closed-bag exchange
Douglas Hofstadter
Douglas Richard Hofstadter (born 15 February 1945) is an American cognitive and computer scientist whose research includes concepts such as the sense of self in relation to the external world, consciousness, analogy-making, Strange loop, strange ...
suggested that people often find problems such as the prisoner's dilemma problem easier to understand when it is illustrated in the form of a simple game, or trade-off. One of several examples he used was "closed bag exchange":
''Friend or Foe?''
''
Friend or Foe?'' is a game show that aired from 2002 to 2003 on the
Game Show Network
Game Show Network (GSN) is an American basic cable channel owned by the television network division of Sony Pictures Television. The channel's programming is primarily dedicated to game shows, including reruns of acquired game shows, along wit ...
in the US. On the game show, three pairs of people compete. When a pair is eliminated, they play a game similar to the prisoner's dilemma to determine how the winnings are split. If they both cooperate (Friend), they share the winnings 50–50. If one cooperates and the other defects (Foe), the defector gets all the winnings, and the cooperator gets nothing. If both defect, both leave with nothing. Notice that the reward matrix is slightly different from the standard one given above, as the rewards for the "both defect" and the "cooperate while the opponent defects" cases are identical. This makes the "both defect" case a weak equilibrium, compared with being a strict equilibrium in the standard prisoner's dilemma. If a contestant knows that their opponent is going to vote "Foe", then their own choice does not affect their own winnings. In a specific sense, ''Friend or Foe'' has a rewards model between prisoner's dilemma and the
game of Chicken
The game of chicken, also known as the hawk-dove game or snowdrift game, is a model of conflict for two players in game theory. The principle of the game is that while the ideal outcome is for one player to yield (to avoid the worst outcome if n ...
.
This is the rewards matrix:
This payoff matrix has also been used on the
British
British may refer to:
Peoples, culture, and language
* British people, nationals or natives of the United Kingdom, British Overseas Territories and Crown Dependencies.
* British national identity, the characteristics of British people and culture ...
television
Television (TV) is a telecommunication medium for transmitting moving images and sound. Additionally, the term can refer to a physical television set rather than the medium of transmission. Television is a mass medium for advertising, ...
programs ''Trust Me'', ''
Shafted
''Shafted'' was a British game show that aired on ITV from 5 to 26 November 2001 and was hosted by Robert Kilroy-Silk.
Format
The game begins with six players and is played in five rounds. In the first round, each player must secretly declar ...
'', ''
The Bank Job
''The Bank Job'' is a 2008 heist thriller film directed by Roger Donaldson and written by Dick Clement and Ian La Frenais. It is based on the 1971 burglary of Lloyds Bank safety deposit boxes in Baker Street. It stars Jason Statham.
The ...
'' and ''
Golden Balls
''Golden Balls'' is a British daytime game show that was presented by Jasper Carrott. It was broadcast on the ITV network from 18 June 2007 to 18 December 2009.
Gameplay Round 1
At the back of the studio is the "Golden Bank," a giant contrap ...
'', and on the
American game show ''
Take It All'', as well as for the winning couple on the reality shows ''
Bachelor Pad
''Bachelor Pad'' is an elimination-style two-hour American reality television game show that debuted on August 9, 2010 on ABC. The show features contestants from '' The Bachelor'' and ''The Bachelorette'', who compete for a final cash prize of $ ...
'' and ''
Love Island''. Game data from the ''
Golden Balls
''Golden Balls'' is a British daytime game show that was presented by Jasper Carrott. It was broadcast on the ITV network from 18 June 2007 to 18 December 2009.
Gameplay Round 1
At the back of the studio is the "Golden Bank," a giant contrap ...
'' series has been analyzed by a team of economists, who found that cooperation was "surprisingly high" for amounts of money that would seem consequential in the real world but were comparatively low in the context of the game.
Iterated snowdrift
Researchers from the
University of Lausanne
The University of Lausanne (UNIL; ) in Lausanne, Switzerland, was founded in 1537 as a school of Protestant theology, before being made a university in 1890. The university is the second-oldest in Switzerland, and one of the oldest universities ...
and the
University of Edinburgh
The University of Edinburgh (, ; abbreviated as ''Edin.'' in Post-nominal letters, post-nominals) is a Public university, public research university based in Edinburgh, Scotland. Founded by the City of Edinburgh Council, town council under th ...
have suggested that the "Iterated Snowdrift Game" may more closely reflect real-world social situations, although this model is actually a
chicken game. In this model, the risk of being exploited through defection is lower, and individuals always gain from taking the cooperative choice. The snowdrift game imagines two drivers who are stuck on opposite sides of a
snowdrift
A snowdrift is a deposit of snow sculpted by wind into a mound during a snowstorm. Snowdrifts resemble sand dunes and are formed in a similar manner, namely, by wind moving light snow and depositing it when the wind has virtually stopped, usu ...
, each of whom is given the option of shoveling snow to clear a path or remaining in their car. A player's highest payoff comes from leaving the opponent to clear all the snow by themselves, but the opponent is still nominally rewarded for their work.
This may better reflect real-world scenarios, the researchers giving the example of two scientists collaborating on a report, both of whom would benefit if the other worked harder. "But when your collaborator doesn't do any work, it's probably better for you to do all the work yourself. You'll still end up with a completed project."
Coordination games
In coordination games, players must coordinate their strategies for a good outcome. An example is two cars that abruptly meet in a blizzard; each must choose whether to swerve left or right. If both swerve left, or both right, the cars do not collide. The local
left- and right-hand traffic
Left-hand traffic (LHT) and right-hand traffic (RHT) are the practices, in bidirectional traffic, of keeping to the left side or to the right side of the road, respectively. They are fundamental to traffic flow, and are sometimes called the ' ...
convention helps to co-ordinate their actions.
Symmetrical co-ordination games include
Stag hunt
In game theory, the stag hunt, sometimes referred to as the assurance game, trust dilemma or common interest game, describes a conflict between safety and social cooperation. The stag hunt problem originated with philosopher Jean-Jacques Roussea ...
and
Bach or Stravinsky.
Asymmetric prisoner's dilemmas
A more general set of games is asymmetric. As in the prisoner's dilemma, the best outcome is cooperation, and there are motives for defection. Unlike the symmetric prisoner's dilemma, though, one player has more to lose and/or more to gain than the other. Some such games have been described as a prisoner's dilemma in which one prisoner has an
alibi
An alibi (, from the Latin, '' alibī'', meaning "somewhere else") is a statement by a person under suspicion in a crime that they were in a different place when the offence was committed. During a police investigation, all suspects are usually a ...
, hence the term "alibi game".
In experiments, players getting unequal payoffs in repeated games may seek to maximize profits, but only under the condition that both players receive equal payoffs; this may lead to a stable equilibrium strategy in which the disadvantaged player defects every X game, while the other always co-operates. Such behavior may depend on the experiment's social norms around fairness.
Software
Several software packages have been created to run simulations and tournaments of the prisoner's dilemma, some of which have their source code available:
* The source code for the
second tournament run by Robert Axelrod (written by Axelrod and many contributors in
Fortran)
* Prison, a library written in
Java
Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
, last updated in 1998
* Axelrod-Python, written in
Python
Python may refer to:
Snakes
* Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia
** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia
* Python (mythology), a mythical serpent
Computing
* Python (prog ...
* Evoplex, a fast agent-based modeling program released in 2018 by Marcos Cardinot
In fiction
Hannu Rajaniemi set the opening scene of his ''
The Quantum Thief'' trilogy in a "dilemma prison". The main theme of the series has been described as the "inadequacy of a binary universe" and the ultimate antagonist is a character called the All-Defector. The first book in the series was published in 2010, with the two sequels, ''
The Fractal Prince'' and ''
The Causal Angel'', published in 2012 and 2014, respectively.
A game modeled after the iterated prisoner's dilemma is a central focus of the 2012 video game ''
Zero Escape: Virtue's Last Reward'' and a minor part in its 2016 sequel ''
Zero Escape: Zero Time Dilemma''.
In ''
The Mysterious Benedict Society and the Prisoner's Dilemma
''The Mysterious Benedict Society and the Prisoner's Dilemma'' is a 2009 children's novel written by Trenton Lee Stewart and illustrated by Diana Sudyka. For a decade it remained the third and final book in the ''Mysterious Benedict Society'' ...
'' by
Trenton Lee Stewart, the main characters start by playing a version of the game and escaping from the "prison" altogether. Later, they become actual prisoners and escape once again.
In ''
The Adventure Zone
''The Adventure Zone'' is a weekly comedy and adventure actual play podcast, in which the McElroy family play ''Dungeons & Dragons'' along with other role-playing games. The show is distributed by the Maximum Fun network and hosted by brother ...
: Balance'' during ''The Suffering Game'' subarc, the player characters are twice presented with the prisoner's dilemma during their time in two liches' domain, once cooperating and once defecting.
In the eighth novel from the author James S. A. Corey, ''
Tiamat's Wrath'', Winston Duarte explains the prisoner's dilemma to his 14-year-old daughter, Teresa, to train her in strategic thinking.
The 2008 film ''
The Dark Knight
''The Dark Knight'' is a 2008 superhero film directed by Christopher Nolan, from a screenplay co-written with his brother Jonathan. Based on the DC Comics superhero Batman, it is the sequel to ''Batman Begins'' (2005), and the second inst ...
'' includes a scene loosely based on the problem in which the
Joker rigs two ferries, one containing prisoners and the other containing civilians, arming both groups with the means to detonate the bomb on each other's ferries, threatening to detonate them both if they hesitate.
In reality TV
In episode 9 of the second series of the Australian reality TV show ''
The Traitors
''The Traitors'' is a reality game show franchise created by the All3Media
All3Media Limited is a British worldwide independent television, film, and digital production and distribution company based in London. The All3Media group cons ...
'', three "traitors" reached the End Game. They participated in the "Traitor's Dilemma": they were given a choice to Share or Steal the prize pot. If all three traitors chose to share, then each would receive a third of the pot. If one or two traitors chose to steal and one or two chose to share, then those who shared would win nothing and the pot would be divided among those who stole. If all three chose to steal, then no one would win anything.
In moral philosophy
The prisoner's dilemma is commonly used as a thinking tool in
moral philosophy
Ethics is the philosophical study of moral phenomena. Also called moral philosophy, it investigates normative questions about what people ought to do or which behavior is morally right. Its main branches include normative ethics, applied et ...
as an illustration of the potential tension between the benefit of the individual and the benefit of the community.
Both the one-shot and the iterated prisoner's dilemma have applications in moral philosophy. Indeed, many of the moral situations, such as
genocide
Genocide is violence that targets individuals because of their membership of a group and aims at the destruction of a people. Raphael Lemkin, who first coined the term, defined genocide as "the destruction of a nation or of an ethnic group" by ...
, are not easily repeated more than once. Moreover, in many situations, the previous rounds' outcomes are unknown to the players, since they are not necessarily the same (e.g. interaction with a panhandler on the street).
The philosopher
David Gauthier
David Gauthier (; 10 September 1932 – 9 November 2023) was a Canadian philosopher best known for his neo- Hobbesian or contractarian theory of morality, as developed in his 1986 book ''Morals by Agreement''.
Life and career
David Gauthier w ...
uses the prisoner's dilemma to show how morality and rationality can conflict.
Some game theorists have criticized the use of the prisoner's dilemma as a thinking tool in moral philosophy.
[ ]Kenneth Binmore
Kenneth George "Ken" Binmore, (born 27 September 1940) is an English mathematician, economist, and game theorist, a Professor Emeritus of Economics at University College London (UCL) and a Visiting Emeritus Professor of Economics at the Unive ...
argued that the prisoner's dilemma does not accurately describe the game played by humanity, which he argues is closer to a coordination game
A coordination game is a type of simultaneous game found in game theory. It describes the situation where a player will earn a higher payoff when they select the same course of action as another player. The game is not one of pure conflict, which ...
. Brian Skyrms
Brian Skyrms (born 1938) is an American philosopher, Distinguished Professor of Logic and Philosophy of Science and Economics at the University of California, Irvine, and a professor of philosophy at Stanford University. He has worked on problem ...
shares this perspective.
Steven Kuhn suggests that these views may be reconciled by considering that moral behavior can modify the payoff matrix of a game, transforming it from a prisoner's dilemma into other games.[
]
Pure and impure prisoner's dilemma
A prisoner's dilemma is considered "impure" if a mixed strategy may give better expected payoffs than a pure strategy. This creates the interesting possibility that the moral action from a utilitarian perspective (i.e., aiming at maximizing the good of an action) may require randomization of one's strategy, such as cooperating with 80% chance and defecting with 20% chance.
See also
* Abilene paradox
The Abilene paradox is a collective fallacy, in which a group of people collectively decide on a course of action that is counter to the preferences of most or all individuals in the group, while each individual believes it to be aligned with the ...
* Centipede game
In game theory, the centipede game, first introduced by Robert W. Rosenthal, Robert Rosenthal in 1981, is an extensive form game in which two players take turns choosing either to take a slightly larger share of an increasing pot, or to pass the p ...
* Collective action problem
A collective action problem or social dilemma is a situation in which all individuals would be better off cooperating but fail to do so because of conflicting interests between individuals that discourage joint action. The collective action proble ...
* Externality
In economics, an externality is an Indirect costs, indirect cost (external cost) or indirect benefit (external benefit) to an uninvolved third party that arises as an effect of another party's (or parties') activity. Externalities can be conside ...
* Folk theorem (game theory)
In game theory, folk theorems are a class of theorems describing an abundance of Nash equilibrium payoff profiles in repeated games . The original Folk Theorem concerned the payoffs of all the Nash equilibria of an infinitely repeated game. This ...
* Free-rider problem
In economics, the free-rider problem is a type of market failure that occurs when those who benefit from resources, public goods and common pool resources do not pay for them or under-pay. Free riders may overuse common pool resources by not ...
* Gift-exchange game
* Hobbesian trap
The Hobbesian trap (or Schelling's dilemma) is a theory that explains why preemptive strikes occur between two groups, out of bilateral fear of an imminent attack. Without outside influences, this situation will lead to a fear spiral ( catch-22 ...
* Innocent prisoner's dilemma
* ''Liar Game
''Liar Game'' (stylized in all caps) is a Japanese manga series written and illustrated by Shinobu Kaitani. It was serialized in Shueisha's manga magazine ''Weekly Young Jump'' from February 2005 to January 2015. It was adapted into a Ja ...
''
* Metagame
A metagame, broadly defined as "a game beyond the game", typically refers to either of two concepts: a game which revolves around a core game; or the strategies and approaches to playing a game. A metagame can serve a broad range of purposes, a ...
* Optional prisoner's dilemma
* Prisoner's dilemma and cooperation
* Public goods game
The public goods game is a standard of experimental economics. In the basic game, subjects Information asymmetry, secretly choose how many of their Private good, private tokens to put into a public pot. The payoff of each player is her "private co ...
* Reciprocal altruism
In evolutionary biology, reciprocal altruism is a behaviour whereby an organism acts in a manner that temporarily reduces its fitness while increasing another organism's fitness, with the expectation that the other organism will act in a similar m ...
* Rent-seeking
Rent-seeking is the act of growing one's existing wealth by manipulating the social or political environment without creating new wealth.
Rent-seeking activities have negative effects on the rest of society. They result in reduced economic effi ...
* Social preferences
Social preferences describe the human tendency to not only care about one's own material payoff, but also the reference group's payoff or/and the intention that leads to the payoff. Social preferences are studied extensively in behavioral and expe ...
* Subjective expected relative similarity
* Superrationality
* Swift trust theory
* Tragedy of the commons
The tragedy of the commons is the concept that, if many people enjoy unfettered access to a finite, valuable resource, such as a pasture, they will tend to overuse it and may end up destroying its value altogether. Even if some users exercised vo ...
* Traveler's dilemma
In game theory, the traveler's dilemma (sometimes abbreviated TD) is a non-zero-sum game in which each player proposes a payoff. The lower of the two proposals wins; the lowball player receives the lowball payoff plus a small bonus, and the highbal ...
* Unscrupulous diner's dilemma
In game theory, the unscrupulous diner's dilemma (or just diner's dilemma) is an ''n''-player prisoner's dilemma. The situation imagined is that several people go out to eat, and before ordering, they agree to split the cost equally between them. ...
Notes
References
Bibliography
*
Further reading
* Amadae, S. (2016)
"Prisoner's Dilemma"
''Prisoners of Reason''. Cambridge University Press
Cambridge University Press was the university press of the University of Cambridge. Granted a letters patent by King Henry VIII in 1534, it was the oldest university press in the world. Cambridge University Press merged with Cambridge Assessme ...
, NY, pp. 24–61.
*
*
* Bicchieri, Cristina (1993). Rationality and Coordination. Cambridge University Press
Cambridge University Press was the university press of the University of Cambridge. Granted a letters patent by King Henry VIII in 1534, it was the oldest university press in the world. Cambridge University Press merged with Cambridge Assessme ...
.
*
*
* Dresher, M. (1961). ''The Mathematics of Games of Strategy: Theory and Applications'' Prentice-Hall
Prentice Hall was a major American educational publisher. It published print and digital content for the 6–12 and higher-education market. It was an independent company throughout the bulk of the twentieth century. In its last few years it ...
, Englewood Cliffs, NJ.
* Greif, A. (2006). ''Institutions and the Path to the Modern Economy: Lessons from Medieval Trade.'' Cambridge University Press, Cambridge
Cambridge ( ) is a List of cities in the United Kingdom, city and non-metropolitan district in the county of Cambridgeshire, England. It is the county town of Cambridgeshire and is located on the River Cam, north of London. As of the 2021 Unit ...
, UK.
*
* Rapoport, Anatol and Albert M. Chammah (1965). ''Prisoner's Dilemma''. University of Michigan Press
The University of Michigan Press is a university press that is a part of Michigan Publishing at the University of Michigan Library. It publishes 170 new titles each year in the humanities and social sciences. Titles from the press have earn ...
.
*
External links
*
The Bowerbird's Dilemma
The Prisoner's Dilemma in ornithology – mathematical cartoon by Larry Gonick.
*
Dawkins: Nice Guys Finish First
Axelrod
Iterated Prisoner's Dilemma Python
Python may refer to:
Snakes
* Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia
** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia
* Python (mythology), a mythical serpent
Computing
* Python (prog ...
library
Play Prisoner's Dilemma on ''oTree''
(N/A 11-5-17)
* Nicky Case'
Evolution of Trust
an example of the donation game
Iterated Prisoner's Dilemma online game
by Wayne Davis
What The Prisoner's Dilemma Reveals About Life, The Universe, and Everything
by Veritasium
Derek Alexander Muller (born 9 November 1982) is a Science communication, science communicator and media personality, best known for his YouTube channel Veritasium, which has over 17.8 million subscribers and 3.3 billion views as of April 2025. ...
{{DEFAULTSORT:Prisoner's dilemma
Dilemmas
Environmental studies
Inefficiency in game theory
Moral psychology
Non-cooperative games
Social psychology
Social science experiments
Thought experiments