game theory Game theory is the study of mathematical models of strategic interactions among rational agents. Myerson, Roger B. (1991). ''Game Theory: Analysis of Conflict,'' Harvard University Press, p.&nbs1 Chapter-preview links, ppvii–xi It has appli ...

, folk theorems are a class of theorems describing an abundance of

Nash equilibrium In game theory, the Nash equilibrium, named after the mathematician John Nash, is the most common way to define the solution of a non-cooperative game involving two or more players. In a Nash equilibrium, each player is assumed to know the equili ...

payoff profiles in

repeated games In game theory, a repeated game is an extensive form game that consists of a number of repetitions of some base game (called a stage game). The stage game is usually one of the well-studied list of games in game theory, 2-person games. Repeated ga ...

. The original Folk Theorem concerned the payoffs of all the Nash equilibria of an infinitely repeated game. This result was called the Folk Theorem because it was widely known among game theorists in the 1950s, even though no one had published it. Friedman's (1971) Theorem concerns the payoffs of certain subgame-perfect Nash equilibria (SPE) of an infinitely repeated game, and so strengthens the original Folk Theorem by using a stronger equilibrium concept: subgame-perfect Nash equilibria rather than Nash equilibria. The Folk Theorem suggests that if the players are patient enough and far-sighted (i.e. if the discount factor

\delta \to 1

), then repeated interaction can result in virtually any average payoff in an SPE equilibrium. "Virtually any" is here technically defined as "feasible" and "individually rational". For example, in the one-shot

Prisoner's Dilemma The Prisoner's Dilemma is an example of a game analyzed in game theory. It is also a thought experiment that challenges two completely rational agents to a dilemma: cooperate with their partner for mutual reward, or betray their partner ("def ...

, both players cooperating is not a Nash equilibrium. The only Nash equilibrium is that both players defect, which is also a mutual minmax profile. One folk theorem says that, in the infinitely repeated version of the game, provided players are sufficiently patient, there is a Nash equilibrium such that both players cooperate on the equilibrium path. But if the game is repeated a known finite number of times,

backward induction Backward induction is the process of reasoning backwards in time, from the end of a problem or situation, to determine a sequence of optimal actions. It proceeds by examining the last point at which a decision is to be made and then identifying wha ...

shows that both players will play the one-shot Nash equilibrium in each period, i.e. they will defect each time.

Setup and definitions

We start with a ''basic game'', also known as the ''stage game'', which is a ''n-''player game. In this game, each player has finitely many actions to choose from, and they make their choices simultaneously and without knowledge of the other player's choices. The collective choices of the players leads to a ''payoff profile,'' i.e. to a payoff for each of the players. The mapping from collective choices to payoff profiles is known to the players, and each player aims to maximize their payoff. If the collective choice is denoted by ''x,'' the payoff that player ''i'' receives, also known as player ''i'''s ''utility'', will be denoted by

u_i(x)

. We then consider a repetition of this stage game, finitely or infinitely many times. In each repetition, each player chooses one of their stage game options, and when making that choice, they may take into account the choices of the other players in the prior iterations. In this repeated game, a ''strategy'' for one of the players is a deterministic rule that specifies the player's choice in each iteration of the stage game, based on all other player's choices in the prior iterations. A choice of strategy for each of the players is a ''strategy profile,'' and it leads to a payout profile for the repeated game. There are a number of different ways such a strategy profile can be translated into a payout profile, outlined below. Any

payoff profile of a repeated game must satisfy two properties: #Individual rationality: the payoff must weakly dominate the minmax payoff profile of the constituent stage game. That is, the equilibrium payoff of each player must be at least as large as the minmax payoff of that player. This is because a player achieving less than his minmax payoff always has incentive to deviate by simply playing his minmax strategy at every history. #Feasibility: the payoff must be a

convex combination In convex geometry and vector algebra, a convex combination is a linear combination of points (which can be vectors, scalars, or more generally points in an affine space) where all coefficients are non-negative and sum to 1. In other w ...

of possible payoff profiles of the stage game. This is because the payoff in a repeated game is just a weighted average of payoffs in the basic games. Folk theorems are partially converse claims: they say that, under certain conditions (which are different in each folk theorem), ''every'' payoff profile that is both individually rational and feasible can be realized as a Nash equilibrium payoff profile of the repeated game. There are various folk theorems; some relate to finitely-repeated games while others relate to infinitely-repeated games.

Infinitely-repeated games without discounting

In the undiscounted model, the players are patient. They don't differentiate between utilities in different time periods. Hence, their utility in the repeated game is represented by the sum of utilities in the basic games. When the game is infinite, a common model for the utility in the infinitely-repeated game is the limit inferior of mean utility: If the game results in a path of outcomes

x_t

, where

x_t

denotes the collective choices of the players at iteration ''t'' (''t=0,1,2,...),'' player ''is utility is defined as ::

U_i = \liminf_ \frac \sum_^T u_i(x_t),

where

u_i

is the basic-game utility function of player ''i''. An infinitely-repeated game without discounting is often called a "supergame". The folk theorem in this case is very simple and contains no pre-conditions: every individually rational and feasible payoff profile in the basic game is a Nash equilibrium payoff profile in the repeated game. The proof employs what is called a ''grim'' or ''

grim trigger In game theory, grim trigger (also called the grim strategy or just grim) is a trigger strategy for a repeated game. Initially, a player using grim trigger will cooperate, but as soon as the opponent defects (thus satisfying the trigger condition) ...

'' strategy. All players start by playing the prescribed action and continue to do so until someone deviates. If player ''i'' deviates, all other players switch to picking the action which minmaxes player ''i'' forever after. The one-stage gain from deviation contributes 0 to the total utility of player ''i''. The utility of a deviating player cannot be higher than his minmax payoff. Hence all players stay on the intended path and this is indeed a Nash equilibrium.

Subgame perfection

The above Nash equilibrium is not always

subgame perfect In game theory, a subgame perfect equilibrium (or subgame perfect Nash equilibrium) is a refinement of a Nash equilibrium used in dynamic games. A strategy profile is a subgame perfect equilibrium if it represents a Nash equilibrium of every sub ...

. If punishment is costly for the punishers, the threat of punishment is not credible. A subgame perfect equilibrium requires a slightly more complicated strategy. The punishment should not last forever; it should last only a finite time which is sufficient to wipe out the gains from deviation. After that, the other players should return to the equilibrium path. The limit-of-means criterion ensures that any finite-time punishment has no effect on the final outcome. Hence, limited-time punishment is a subgame-perfect equilibrium. * Coalition subgame-perfect equilibria: An equilibrium is called a ''coalition Nash equilibrium'' if no coalition can gain from deviating. It is called a ''coalition subgame-perfect equilibrium'' if no coalition can gain from deviating after any history. With the limit-of-means criterion, a payoff profile is attainable in coalition-Nash-equilibrium or in coalition-subgame-perfect-equilibrium, if-and-only-if it is Pareto efficient and weakly-coalition-individually-rational.For every nonempty coalition

B

, there is a strategy of the other players (

N\setminus B

) such that for any strategy played by

B

, the payoff when

B

plays

c^B

is not trictly better for ''all'' members of

B

Overtaking

Some authors claim that the limit-of-means criterion is unrealistic, because it implies that utilities in any finite time-span contribute 0 to the total utility. However, if the utilities in any finite time-span contribute a positive value, and the value is undiscounted, then it is impossible to attribute a finite numeric utility to an infinite outcome sequence. A possible solution to this problem is that, instead of defining a numeric utility for each infinite outcome sequence, we just define the preference relation between two infinite sequences. We say that agent

i

(strictly) prefers the sequence of outcomes

y_t

over the sequence

x_t

, if: ::

\liminf_ \sum_^T ( u_i(y_t) - u_i(x_t)) > 0

For example, consider the sequences

u_i(x)=(0,0,0,0,\ldots)

and

u_i(y)=(-1,2,0,0,\ldots)

. According to the limit-of-means criterion, they provide the same utility to player ''i,'' but according to the overtaking criterion,

y

is better than

x

for player ''i''. See

overtaking criterion In economics, the overtaking criterion is used to compare infinite streams of outcomes. Mathematically, it is used to properly define a notion of optimality for a problem of optimal control on an unbounded time interval. Often, the decisions of a ...

for more information. The folk theorems with the overtaking criterion are slightly weaker than with the limit-of-means criterion. Only outcomes that are ''strictly'' individually rational, can be attained in Nash equilibrium. This is because, if an agent deviates, he gains in the short run, and this gain can be wiped out only if the punishment gives the deviator strictly less utility than the agreement path. The following folk theorems are known for the overtaking criterion: * Strict stationary equilibria: A Nash equilibrium is called ''strict'' if each player strictly prefers the infinite sequence of outcomes attained in equilibrium, over any other sequence he can deviate to. A Nash equilibrium is called ''stationary'' if the outcome is the same in each time-period. An outcome is attainable in strict-stationary-equilibrium if-and-only-if for every player the outcome is strictly better than the player's minimax outcome. * Strict stationary subgame-perfect equilibria: An outcome is attainable in strict-stationary-subgame-perfect-equilibrium, if for every player the outcome is strictly better than the player's minimax outcome (note that this is not an "if-and-only-if" result). To achieve subgame-perfect equilibrium with the overtaking criterion, it is required to punish not only the player that deviates from the agreement path, but also every player that does not cooperate in punishing the deviant. ** The "stationary equilibrium" concept can be generalized to a "periodic equilibrium", in which a finite number of outcomes is repeated periodically, and the payoff in a period is the arithmetic mean of the payoffs in the outcomes. That mean payoff should be strictly above the minimax payoff. * Strict stationary coalition equilibria: With the overtaking criterion, if an outcome is attainable in coalition-Nash-equilibrium, then it is Pareto efficient and weakly-coalition-individually-rational. On the other hand, if it is Pareto efficient and strongly-coalition-individually-rational for every nonempty coalition

B

, there is a strategy of the other players (

N\setminus B

) such that for any strategy played by

B

, the payoff is strictly worse for ''at least one'' member of

B

. it can be attained in strict-stationary-coalition-equilibrium.

Infinitely-repeated games with discounting

Assume that the payoff of a player in an infinitely repeated game is given by the ''average discounted criterion'' with discount factor 0 < ''δ'' < 1: :

U_i = (1-\delta) \sum_ \delta^t u_i(x_t),

The discount factor indicates how patient the players are. The factor

(1-\delta)

is introduced so that the payoff remain bounded when

\delta\rightarrow 1

. The folk theorem in this case requires that the payoff profile in the repeated game strictly dominates the minmax payoff profile (i.e., each player receives strictly more than the minmax payoff). Let ''a'' be a strategy profile of the stage game with payoff profile ''u'' which strictly dominates the minmax payoff profile. One can define a Nash equilibrium of the game with ''u'' as resulting payoff profile as follows: :1. All players start by playing ''a'' and continue to play ''a'' if no deviation occurs. :2. If any one player, say player ''i'', deviated, play the strategy profile ''m'' which minmaxes ''i'' forever after. :3. Ignore multilateral deviations. If player ''i'' gets ''ε'' more than his minmax payoff each stage by following 1, then the potential loss from punishment is :

\frac \varepsilon.

If ''δ'' is close to 1, this outweighs any finite one-stage gain, making the strategy a Nash equilibrium. An alternative statement of this folk theorem allows the equilibrium payoff profile ''u'' to be any individually rational feasible payoff profile; it only requires there exist an individually rational feasible payoff profile that strictly dominates the minmax payoff profile. Then, the folk theorem guarantees that it is possible to approach ''u'' in equilibrium to any desired precision (for every ''ε'' there exists a Nash equilibrium where the payoff profile is a distance ''ε'' away from ''u'').

Subgame perfection

Attaining a

equilibrium in discounted games is more difficult than in undiscounted games. The cost of punishment does not vanish (as with the limit-of-means criterion). It is not always possible to punish the non-punishers endlessly (as with the overtaking criterion) since the discount factor makes punishments far away in the future irrelevant for the present. Hence, a different approach is needed: the punishers should be rewarded. This requires an additional assumption, that the set of feasible payoff profiles is full dimensional and the min-max profile lies in its interior. The strategy is as follows. :1. All players start by playing ''a'' and continue to play ''a'' if no deviation occurs. :2. If any one player, say player ''i'', deviated, play the strategy profile ''m'' which minmaxes ''i'' for ''N'' periods. (Choose ''N'' and ''δ'' large enough so that no player has incentive to deviate from phase 1.) :3. If no players deviated from phase 2, all player ''j'' ≠ ''i'' gets rewarded ''ε'' above ''js min-max forever after, while player ''i'' continues receiving his min-max. (Full-dimensionality and the interior assumption is needed here.) :4. If player ''j'' deviated from phase 2, all players restart phase 2 with ''j'' as target. :5. Ignore multilateral deviations. Player ''j'' ≠ ''i'' now has no incentive to deviate from the punishment phase 2. This proves the subgame perfect folk theorem.

Finitely-repeated games without discount

Assume that the payoff of player ''i'' in a game that is repeated ''T'' times is given by a simple arithmetic mean: :

U_i = \frac \sum_^T u_i(x_t)

A folk theorem for this case has the following additional requirement: ::In the basic game, for every player ''i'', there is a Nash-equilibrium

E_i

that is strictly better, for ''i'', than his minmax payoff. This requirement is stronger than the requirement for discounted infinite games, which is in turn stronger than the requirement for undiscounted infinite games. This requirement is needed because of the last step. In the last step, the only stable outcome is a Nash-equilibrium in the basic game. Suppose a player ''i'' gains nothing from the Nash equilibrium (since it gives him only his minmax payoff). Then, there is no way to punish that player. On the other hand, if for every player there is a basic equilibrium which is strictly better than minmax, a repeated-game equilibrium can be constructed in two phases: # In the first phase, the players alternate strategies in the required frequencies to approximate the desired payoff profile. # In the last phase, the players play the preferred equilibrium of each of the players in turn. In the last phase, no player deviates since the actions are already a basic-game equilibrium. If an agent deviates in the first phase, he can be punished by minmaxing him in the last phase. If the game is sufficiently long, the effect of the last phase is negligible, so the equilibrium payoff approaches the desired profile.

Applications

Folk theorems can be applied to a diverse number of fields. For example: *

Anthropology Anthropology is the scientific study of humanity, concerned with human behavior, human biology, cultures, societies, and linguistics, in both the present and past, including past human species. Social anthropology studies patterns of be ...

: in a community where all behavior is well known, and where members of the community know that they will continue to have to deal with each other, then any pattern of behavior (

tradition A tradition is a belief or behavior (folk custom) passed down within a group or society with symbolic meaning or special significance with origins in the past. A component of cultural expressions and folklore, common examples include holidays or ...

taboo A taboo or tabu is a social group's ban, prohibition, or avoidance of something (usually an utterance or behavior) based on the group's sense that it is excessively repulsive, sacred, or allowed only for certain persons.''Encyclopædia Britannica ...

s, etc.) may be sustained by

social norm Social norms are shared standards of acceptable behavior by groups. Social norms can both be informal understandings that govern the behavior of members of a society, as well as be codified into rules and laws. Social normative influences or soci ...

s so long as the individuals of the community are better off remaining in the community than they would be leaving the community (the minimax condition). * International politics: agreements between countries cannot be effectively enforced. They are kept, however, because relations between countries are long-term and countries can use "minimax strategies" against each other. This possibility often depends on the discount factor of the relevant countries. If a country is very impatient (pays little attention to future outcomes), then it may be difficult to punish it (or punish it in a credible way). On the other hand, MIT economist

Franklin Fisher Franklin Marvin Fisher (December 13, 1934 – April 29, 2019) was an American economist. He taught economics at the Massachusetts Institute of Technology from 1960 to 2004. Biography Fisher attended Harvard University, where he was inducted into ...

has noted that the folk theorem is not a positive theory.Fisher, Franklin M. ''Games Economists Play: A Noncooperative View'' The RAND Journal of Economics, Vol. 20, No. 1. (Spring, 1989), pp. 113–124, this particular discussion is on page 118 In considering, for instance,

oligopoly An oligopoly (from Greek ὀλίγος, ''oligos'' "few" and πωλεῖν, ''polein'' "to sell") is a market structure in which a market or industry is dominated by a small number of large sellers or producers. Oligopolies often result f ...

behavior, the folk theorem does not tell the economist what firms will do, but rather that cost and demand functions are not sufficient for a general theory of oligopoly, and the economists must include the context within which oligopolies operate in their theory. In 2007, Borgs et al. proved that, despite the folk theorem, in the general case computing the Nash equilibria for repeated games is not easier than computing the Nash equilibria for one-shot finite games, a problem which lies in the

PPAD In computer science, PPAD ("Polynomial Parity Arguments on Directed graphs") is a complexity class introduced by Christos Papadimitriou in 1994. PPAD is a subclass of TFNP based on functions that can be shown to be total by a parity argument. The ...

complexity class. The practical consequence of this is that no efficient (polynomial-time) algorithm is known that computes the strategies required by folk theorems in the general case.

Summary of folk theorems

The following table compares various folk theorems in several aspects: * Horizon – whether the stage game is repeated finitely or infinitely many times. * Utilities – how the utility of a player in the repeated game is determined from the player's utilities in the stage game iterations. * Conditions on ''G'' (the stage game) – whether there are any technical conditions that should hold in the one-shot game in order for the theorem to work. * Conditions on ''x'' (the target payoff vector of the repeated game) – whether the theorem works for any individually rational and feasible payoff vector, or only on a subset of these vectors. * Equilibrium type – if all conditions are met, what kind of equilibrium is guaranteed by the theorem – Nash or Subgame-perfect? * Punishment type – what kind of punishment strategy is used to deter players from deviating?

Notes

References

* * * * A set of introductory notes to the Folk Theorem. {{Game theory Game theory equilibrium concepts Theorems