Simple Games, Utility Functions, and Saddle Points

By goodmath on April 25, 2008.

Last time I wrote about Game Theory, I explained the basic idea of
zero sum games. In their simplest form, a game can be described by a payoff matrix,where each dimension of the matrix is the set of strategies which can be selected by one player, and each entry in the matrix describes the payoffs that result when all players select their strategy. Informally, a game is zero-sum when the total payoff is zero - that is, when any positive payout to one player is balanced by payouts from other players.

We can formalize some of the basic ideas more precisely. What we can say is that each move results in a state; and for each player y, there is a function, u_p, called a utility function, which maps from a state to a utility value. Taking the player utility functions as components, we can define the game's utility function in terms of tuples: for players 1..n,
the utility function maps from states to tuples of utility values: u(state)=(u₁, ..., u_n).

With the idea of the utility function in place, we can formally describe what it means to be a zero-sum game. A game is zero-sum if and only if for all game-ending states s, Σ_p∈Playersu_p(s) = 0.

One of the tricky things to understand is that for what we, informally, think of as a game, there are multiple utility functions that can used to study the game, and using different utility functions produce different games according to game theory.

For an example that's come up in comments to the last post, think
of a soccer game. One possible utility function for looking at a game
of soccer is simple win/lose. That is, for team t, the utility function
assigns -1 to losing, 0 to a tie, and 1 for a win. That's a perfectly reasonable utility function - and with it, soccer is a zero-sum game. But we can also measure goals: the utility of a game for a team is the number of goals that that team scored. We could also create an economic measure, where the utility of the game is the price-per-goal - the total cost of all of the players, divided by the number of goals. Or we could analyze the game in terms of players: for a goalie, the utility could be the ratio of scored goals
to total shots on goal.

To "solve" a game, you need to find the strategy that maximizes your utility. In a single-move game, there are often not any clear solutions. One
of the common ones is the maximin solution. Maximin is good at producing a reasonable result in the face of absolutely no knowledge of what your opponent is going to do. So you've got to assume that no matter what strategy you choose, your opponent will chose the strategy with the worst possible result for you. So you want to select the strategy with the best possible worst outcome - the strategy with the maximum minimum utility.

For example, consider the following game:

	A	B	C
1	10	-5	2
2	2	-3	4
3	-10	4	-2

In this, player 1 selects from strategies 1, 2, and 3; player 2 selects
from A, B, and C. Positive utility are payoffs to player 1, and negatives to player 2.

Looking at this, what's the best strategy for player 1? In strategy
1, his worst outcome is losing 5. In strategy 2, his worst outcome is losing three. And in strategy 3, his worst outcome is losing 10. So following the maximin solution, his best strategy is number 2, and the value of the
maximin is -3.

What about player 2? For her, the best outcome is the worst outcome for player 1. So she plays the minimax solution: she wants to find the
solution that produces the minimum maximum solution. The minimax is the same basic idea as the maximin; in fact, if you just reverse the sign of all of the utility values, then the maximin of the sign-reverse utility is the minimax of the original.

So, player two looks for the maximin. For strategy A, her worst outcome is losing 10. For strategy B, her worst outcome is losing 4. And for strategy C, her worst outcome is losing 4. So a first look with maximin says that B and C are tied - so she falls back to the second-worst outcome. In B, her second-worst outcome is winning 4; in C, her second-worst outcome is winning losing 2. So her best minimax strategy is B, and the minimax is 4.

Very rarely, there's a kind of optimal solution to a game. It's called a
saddle point. In a game with a saddle point, the minimax and the
maximin are equal. This is optimal, because it's a solution where both players
agree that the specific outcome is the best outcome that they could count on.

For example, look at the game below:

	A	B	C
1	10	5	2
2	-2	0	1
3	8	3	-4

For player 1, strategy 1 has minimum 2; strategy 2 has minimum 0, and
strategy 3 has minimum -4. So the maximin is strategy 1, with a value of 2.

For player 2, strategy A has a maximum of 10; strategy B has a maximum of 4; and strategy C has a maximum of 2. So the minimax strategy is strategy C,
with a value of 2.

So (1,C) is a saddlepoint, and players one and two are in agreement about
what the best possible outcome of the game is.

More like this

Iterated Zero-Sum Games

The games that we've looked at so far are the simplest case of basic games. In these games, we've got a payoff matrix, and both players can see the whole matrix - the players have equal information,

Back to Math: Solving Zero-Sum Games

When I last wrote about game theory, we were working up to how to find the general solution to an iterated two-player zero-sum game. Since it's been a while, I'll take a moment and refresh your memory a bit.

Hi Mark,

I was surprised to learn that Pareto distribution used by John Nash to descibe Noncooperative games and equilibria is related to the Dirac delta of engineering and physics.
Wikipedia on Pareto distribution has a graph and text demonstrating this relation.
The graph plots k=1,2,3,oo.
Infinity corresponds to the Dirac delta, continuous time, which in turn corresponds to the Kronecker delta, discrete time.
Numerous links are provided.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Moving on

August 2, 2010

Finally, at long last, I can tell you what I've been up to with finding a new home for this blog. I've created a new, community-based science blogging site, called Scientopia. With the help of many wonderful people, we're ready. We launched this morning. So to continue following GM/BM - along with…

Goodbye, Scienceblogs

July 7, 2010

So my decision is made. I'm closing up around here. I'm in the process of working out exactly where I'm going to go. With any luck, Seed will leave this blog here long enough for me to post an update with the new location. But I'm through with Seed and ScienceBlogs.

Seed, Conflicts of Interest, and Sleaze

July 6, 2010

As my friend Pal wrote about, Seed Media Group, the corporate overlords of the ScienceBlogs network that this blog belongs to, have apparently decided that blog space in these parts is now up for sale to advertisers. We've been advertiser supported since I joined up with SB. I've never minded…

Searching for Topics

June 28, 2010

As regular readers have no doubt noticed by now, posting on the blog has been slow lately. I've been trying to come back up to speed, but so far, that's been mainly in the form of bad math posts. I'd like to get back to the good stuff. Unfortunately, the chaos theory stuff that I was…

Saturday Recipe: Ginger Scallion Sauce

June 26, 2010

Today's recipe is something I made this week for the first time, and trying it was like a revelation. It's simple to make, it's got an absolutely spectacularly wonderful flavor - light and fresh - and it's incredibly versatile. It's damned near perfect. It's scallion ginger sauce, and once you try…