game theory Game theory is the study of mathematical models of strategic interactions. It has applications in many fields of social science, and is used extensively in economics, logic, systems science and computer science. Initially, game theory addressed ...

, a game is said to be a potential game if the incentive of all players to change their

strategy Strategy (from Greek στρατηγία ''stratēgia'', "troop leadership; office of general, command, generalship") is a general plan to achieve one or more long-term or overall goals under conditions of uncertainty. In the sense of the " a ...

can be expressed using a single global function called the potential function. The concept originated in a 1996 paper by Dov Monderer and

Lloyd Shapley Lloyd Stowell Shapley (; June 2, 1923 – March 12, 2016) was an American mathematician and Nobel Memorial Prize-winning economist. He contributed to the fields of mathematical economics and especially game theory. Shapley is generally conside ...

. The properties of several types of potential games have since been studied. Games can be either ''ordinal'' or ''cardinal'' potential games. In cardinal games, the difference in individual payoffs for each player from individually changing one's strategy, other things equal, has to have the same value as the difference in values for the potential function. In ordinal games, only the signs of the differences have to be the same. The potential function is a useful tool to analyze equilibrium properties of games, since the incentives of all players are mapped into one function, and the set of pure

Nash equilibria In game theory, the Nash equilibrium is the most commonly used solution concept for non-cooperative games. A Nash equilibrium is a situation where no player could gain by changing their own strategy (holding all other players' strategies fixed) ...

can be found by locating the local optima of the potential function. Convergence and finite-time convergence of an iterated game towards a Nash equilibrium can also be understood by studying the potential function. Potential games can be studied as

repeated game In game theory, a repeated game (or iterated game) is an extensive form game that consists of a number of repetitions of some base game (called a stage game). The stage game is usually one of the well-studied 2-person games. Repeated games capt ...

s with state so that every round played has a direct consequence on game's state in the next round. This approach has applications in distributed control such as distributed resource allocation, where players without a central correlation mechanism can cooperate to achieve a globally optimal resource distribution.

Definition

Let

N

be the number of players,

A

the set of action profiles over the action sets

A_

of each player and

u_i:A \to \mathbb

be the payoff function for player

1\le i\le N

. Given a game

G=(N,A=A_\times\ldots\times A_, u: A \rightarrow \reals^N)

, we say that

G

is a potential game with an exact (weighted, ordinal, generalized ordinal, best response) potential function if

\Phi: A \rightarrow \reals

is an exact (weighted, ordinal, generalized ordinal, best response, respectively) potential function for

G

. Here,

\Phi

is called * an exact potential function if

\forall i, \forall ,\ \forall

, ::

\Phi(a'_,a_)-\Phi(a''_,a_) = u_(a'_,a_)-u_(a''_,a_)

::That is: when player

i

switches from action

a'

to action

a''

, the change in the potential

\Phi

equals the change in the utility of that player. * a weighted potential function if there is a vector

w \in \reals_^N

such that

\forall i,\forall ,\ \forall

, ::

\Phi(a'_,a_)-\Phi(a''_,a_) = w_(u_(a'_,a_)-u_(a''_,a_))

That is: when a player switches action, the change in

\Phi

equals the change in the player's utility, times a positive player-specific weight. Every exact PF is a weighted PF with ''w_i''=1 for all ''i''. * an ordinal potential function if

\forall i,\forall ,\ \forall

, ::

u_(a'_,a_)-u_(a''_,a_)>0 \Leftrightarrow
 \Phi(a'_,a_)-\Phi(a''_,a_)>0

That is: when a player switches action, the ''sign'' of the change in

\Phi

equals the ''sign'' of the change in the player's utility, whereas the magnitude of change may differ. Every weighted PF is an ordinal PF. * a generalized ordinal potential function if

\forall i, \forall ,\  \forall

, ::

u_(a'_,a_)-u_(a''_,a_)>0 \Rightarrow
 \Phi(a'_,a_)-\Phi(a''_,a_) >0

That is: when a player switches action, if the player's utility increases, then the potential increases (but the opposite is not necessarily true). Every ordinal PF is a generalized-ordinal PF. *a best-response potential function if

\forall i\in N,\ \forall

, ::

b_i(a_)=\arg\max_ \Phi(a_i,a_)

where

b_i(a_)

is the best action for player

i

given

a_

. Note that while there are

N

utility functions, one for each player, there is only one potential function. Thus, through the lens of potential functions, the players become interchangeable (in the sense of one of the definitions above). Because of this ''symmetry'' of the game, decentralized algorithms based on the shared potential function often lead to convergence (in some of sense) to a Nash equilibria.

A simple example

In ''a'' 2-player, 2-action game with externalities, individual players' payoffs are given by the function , where is players i's action, is the opponent's action, and ''w'' is ''a'' positive

externality In economics, an externality is an Indirect costs, indirect cost (external cost) or indirect benefit (external benefit) to an uninvolved third party that arises as an effect of another party's (or parties') activity. Externalities can be conside ...

from choosing the same action. The action choices are +1 and −1, as seen in the

payoff matrix In game theory, normal form is a description of a ''game''. Unlike extensive-form game, extensive form, normal-form representations are not Graph (discrete mathematics), graphical ''per se'', but rather represent the game by way of a matrix (mathe ...

in Figure 1. This game has ''a'' potential function . If player 1 moves from −1 to +1, the payoff difference is . The change in potential is . The solution for player 2 is equivalent. Using numerical values , , , this example transforms into ''a'' simple battle of the sexes, as shown in Figure 2. The game has two pure Nash equilibria, and . These are also the local maxima of the potential function (Figure 3). The only stochastically stable equilibrium is , the global maximum of the potential function. A 2-player, 2-action game cannot be ''a'' potential game unless :

_(+1,-1)+u_1(-1,+1)_1(+1,+1)+u_1(-1,-1) =_(+1,-1)+u_2(-1,+1)_2(+1,+1)+u_2(-1,-1)

Potential games and congestion games

Exact potential games are equivalent to congestion games: Rosenthal. proved that every congestion game has an exact potential; Monderer and Shapley proved the opposite direction: every game with an exact potential function is a congestion game.

Potential games and improvement paths

An improvement path (also called Nash dynamics) is a sequence of strategy-vectors, in which each vector is attained from the previous vector by a single player switching his strategy to a strategy that strictly increases his utility. If a game has a generalized-ordinal-potential function

\Phi

, then

\Phi

is strictly increasing in every improvement path, so every improvement path is acyclic. If, in addition, the game has finitely many strategies, then every improvement path must be finite. This property is called the finite improvement property (FIP). We have just proved that every finite generalized-ordinal-potential game has the FIP. The opposite is also true: every finite game has the FIP has a generalized-ordinal-potential function. The terminal state in every finite improvement path is a Nash equilibrium, so FIP implies the existence of a pure-strategy Nash equilibrium. Moreover, it implies that a Nash equilibrium can be computed by a distributed process, in which each agent only has to improve his own strategy. A best-response path is a special case of an improvement path, in which each vector is attained from the previous vector by a single player switching his strategy to a best-response strategy. The property that every best-response path is finite is called the finite best-response property (FBRP). FBRP is weaker than FIP, and it still implies the existence of a pure-strategy Nash equilibrium. It also implies that a Nash equilibrium can be computed by a distributed process, but the computational burden on the agents is higher than with FIP, since they have to compute a best-response. An even weaker property is weak-acyclicity (WA). It means that, for any initial strategy-vector, ''there exists'' a finite best-response path starting at that vector. Weak-acyclicity is not sufficient for existence of a potential function (since some improvement-paths may be cyclic), but it is sufficient for the existence of pure-strategy Nash equilibrium. It implies that a Nash equilibrium can be computed almost-surely by a stochastic distributed process, in which at each point, a player is chosen at random, and this player chooses a best-strategy at random.

References

External links

* Lecture notes of Yishay Mansour abou
Potential and congestion games
* Section 19 in: * Non technical exposition by Huw Dixon of the inevitability of collusio
Chapter 8, Donut world and the duopoly archipelago
{{microeconomics Game theory game classes

Definition

A simple example

Potential games and congestion games

Potential games and improvement paths

See also

References

External links