In this chapter, we present the first simple connection between game theory and graph theory: we use graphs to analyze games and best-response dynamics. Using this graph-theoretic approach, we will be able to give a simple characterization of the class of games for which best-response dynamics converge. The characterization, as well as the method used to prove it, will be useful in the sequel.
3.1 A Graph Representation of Games
We can use graphs to analyze (normal-form) games as follows. Given a game , consider a directed graph where the set of vertices is the set of action profiles, and where we put a (directed) edge from a node to a node if and only if there exists some player such that (1) and differ only in component (i.e., ’s action) and (2) but ; that is, draw an edge between action profiles if we can go from the first action profile to the second in one step of BRD (i.e., one player improving its utility by switching to its best response). See Figures 3.1 and 3.2 for examples of such graphs for Bach–Stravinsky and the Traveler’s Dilemma.
We can now reinterpret BRD and PNE using standard graph-theoretic notions:
Note that it is not necessary that all nodes reach the same sink for BRD to converge; as in the Bach–Stravinsky game, some games can have multiple equilibria, or even multiple equilibria reachable from the same starting profile.
3.2 Characterizing Convergence of BRD
We can now use this graph representation to characterize the set of finite games for which BRD converges. Given a game , we define a potential (or “energy”) function , which maps outcome profiles to integers (denoting the “energy level”). A particularly natural potential function (which we will use later) is the utilitarian social welfare, or simply social welfare (SW), defined as the sum of all players’ utilities:
The notion of an ordinal potential function will be useful. Roughly speaking, a potential function is said to be ordinal if any profitable single-player deviation increases the potential:
Definition 3.1. is an ordinal potential function for a game if, for every action profile , every player , and every action , if
then
We may also consider a weaker form of a potential function, which we refer to as weakly ordinal, which simply requires the potential to increase for any profitable single-player deviation to a best response.
Definition 3.2. is a weakly ordinal potential function for a game if, for every action profile , every player , and every action such that but , we have that
Note that if is ordinal, then it is clearly also weakly ordinal, since if but then .
We now have the following characterization of the class of finite games for which BRD converges.
Theorem 3.1. BRD converges in a finite game if and only if has a weakly ordinal potential function.
Proof. We will prove each direction separately.
Claim 3.1. If a finite game has a weakly ordinal potential function, then BRD converges in .
Proof. Consider an arbitrary starting action profile (node) , and consider running BRD starting from . Each time some player best responds, the potential increases. Since the game has a finite number of outcomes, there are only a finite number of possible potentials; thus, after a finite number of best response steps, we must have reached the highest possible potential, which guarantees convergence as no more “deviations” are possible from that point.
■
Claim 3.2. If BRD converges in a finite game , then has a weakly ordinal potential function.
Proof. Consider some game where BRD converges, and consider the induced graph representation of . As noted above, since BRD always converges, is a acyclic, and thus every path from any starting point must eventually lead to a sink. We construct a potential function for as follows: for each action profile , let where is the length of the longest path from to any sink in . (In other words, is the largest number of steps that BRD can take to converge, starting from .)
We now show that each time we traverse an edge in the BRD graph (i.e., take one step in the BRD process) the potential increases—that is, decreases. Assume we traversed the edge and that . Let us then argue that . This directly follows since there exists some path of length from to some sink , and thus there exists a path of length from to by simply first traversing and then following .
■
The theorem directly follows from the above two claims.
■
In the sequel, we will use Theorem 3.1 as a way to show that BRD converges in the games we will be considering. In particular, for all those games, we will exhibit an ordinal potential function (which, as noted above, must also be weakly ordinal), and thus Theorem 3.1 can be used to deduce that BRD converges.
3.3 Better-response Dynamics
The fact that a game has an ordinal (as opposed to just weakly ordinal) potential function is interesting in its own right. In such games, a more general type of best-response dynamics—referred to as better-response dynamics—will always converge: better-response dynamics proceeds exactly as best-response dynamics except that players can switch to any profitable deviation (as opposed to just the best one). More precisely, at every step in the dynamics:
In particular, may not necessarily be the optimal response (i.e., a best response) to . For instance, consider the outcome in the Traveler’s Dilemma (which we defined in Section 1.6): player is currently getting in utility, and could “better-respond” by choosing ; in contrast, the only best response is choosing (which would yield utility ). See Figure 3.3 for an illustration of the (very complicated) “better-response dynamics graph” for the Traveler’s Dilemma.
Better-response dynamics captures situations where it may be hard for players to always find some best response (for instance, if the action space is large) but the players clearly want to change whenever they find some action that improves their utility—i.e., the players simply “myopically” try to improve their utility. Note that, since any valid execution of BRD is also a valid execution of the better-response dynamics, whenever better-response dynamics converges, so does BRD. More generally, we now have the following characterization of the class of games for which better-response dynamics converges.
Theorem 3.2. Better-response dynamics converges in a finite game if and only if has an ordinal potential function.
Proof. The proof is essentially identical to the proof of Theorem 3.1, except that in the “only-if direction” we replace the best-response graph by a “better-response graph.”
■
As mentioned, in the sequel, we will use ordinal potential functions to demonstrate convergence of BRD; those proofs thus also show convergence of better-response dynamics.
3.4 Games without PNE
As mentioned in chapter 1, there are finite games—such as the Rock–Paper–Scissors game—without any PNE. For such games, by Claim 1.4, BRD cannot converge. We end this section by illustrating in Figure 3.4 the BRD graph for the Rock–Paper–Scissors game. Notice that there is no sink in this graph; as such, the graph provides a visual proof of the inexistence of PNE in the Rock–Paper–Scissors game.
Notes
The notion of an ordinal potential function and the characterization of games for which better-response dynamics converge is due to Monderer and Shapley [MS96]. The use of potential functions to prove the existence of PNE dates back to the work by Rosenthal [Ros73].