6.8L transcribed by Satyavarta
Let the traffic (cars/sec) on road , and the delay on road . Hence, for the situation in the figure,
Let the total traffic between the cities be 6 cars/sec.
The expression for the delay on the two possible paths may be expressed as
These conditions in equilibrium condition dictate the following distribution of traffic
Suppose a wide road is built from to with constant delay .
From ,
the cost of is
the cost of is
This makes the path cheaper than for all possible values of .
Then, all cars go over . But this raises the average delay to 31+31=62. Thus if every individual acts greedy, the average delay for every individual may increase. The objective function that the society seeks to minimize is the total delay, so that the scheme is beneficial to everyone. Formally, this objective function is
minimize (total delay)
Every individual can be charged equivalent to damage caused to society. Then the system has a desirable equilibrium. In the example here, a toll could be levied on the car to increase the efective cost of , so that the equilibrium exists at minimal average delay.
The cost is dependent on the individual. We shall soon study more theories on this behaviour, the Utility Theory.
A two-player, zero-sum game occurs as follows,
Playing a mixed strategy means that the moves made by the players (in the second step above) are not made deterministically. Rather, each strategy is a probability distribution over rows (columns)
for probability of choosing row (column).
Since exactly one row (column) may be selected (``played''),
.
This is in contrast to a pure strategy, where for exactly one and for all others.
Hence, the following symbols may be defined for further analysis of this game.
A : Payoff matrix
p : mixed strategy for row player
q : mixed strategy for column player
The payoff with such a scheme is calculated as for row player ( for column player), when is selected by row player, and by the column player. The expected payoff for a player is thus the sum of payoffs multiplied by the probability of that payoff. Thus,
Expected payoff to row player =
Then, since we are considering a zero-sum game, the expected payoffs for the players are
row player
column player
Each player wants to maximize his profit, and since he does not know what strategy the other one is using, he wants to maximize his expected profit irrespective of what the opponent plays. Hence row player wants to maximize , and column player wants to minimize .
For any pair of strategies
Now if a saddle point strategy exists (so that saddle points exist in , where and are vectors of 's and 's), then by definition of saddle point,
This is also a Nash Equilibrium, since no player can increase his own profit beyond .
Assuming rational behaviour from the players so that they play the best best strategy available to them, the guaranteed payoffs to the players are
row player
column player
Property:
Proof: Using the fact that is a saddle point, we may write
Hence,
We will use this property to prove the following property.
Property: Iff the guaranteed payoffs of the two players are equal, a saddle point (and hence, Nash Equilibrium) exists.
is Saddle Point
Proof:
From the above property,
Hence, the proof.
A relevant question to ask in such a situation is ``Does a Saddle Point always exist?'' The Min-Max Theorem says, yes, it does.
Proof: The proof for this theorem is on the lines of the concept of duality in Linear Programming, wherein these two behave as duals of each other. (The proof I illustrate here is taken from the scribe notes by Tugkan Batu, ORIE630 Mathematical Programming Fall 1998, of lectures by Jon Kleinberg.)
We write two linear programs, one for each player with expected payoff being the objective function, and show that this pair of linear programs are actually duals of each other which will prove the theorem.
The row player's objective function is . Given some , the problem min( subject to ) looks infinite, because of the possibilities of . However, for a given , there is an optimal pure strategy. The reason for this is if column player knows row player's mixed strategy, the column player can choose the column to maximize the expected payoff (instead of getting a weighted average over all columns). So, now we can write row columns' objective function as
Now, we can write the row player's problem as a linear program as follows:
Similarly, the column player's problem is formulated as a linear program as follows:
Now, if we take to be all the 1's vector then we can write this pair of linear programs as follows:
max | min |
The generic primal/dual pair are described as
Primal | Dual |
min | max |
Thus, the linear programs for the row and column player strategies are instantiations of this generic form with the substitution {, , , , , , , }.
Modelling Chess
= set of moves black can make.
The moves of opponents are all a path along a tree of moves that can be constructed, with the board position at each node and all the possible next moves as children. The discussion on extensive form games spills over to the next lecture.