Lecture 3

Hence,

$a_{ij}=a_{mj}=a_{mn}=a_{in}.$

Also, $a_{in}$ is a saddle point since $a_{in}$

$a_{ij}$ is a minimum in its row i, and $a_{in}$

$a_{mn}$ is a maximum in its column

. Similarly, $a_{mj}$ is a saddle point. Definition: Let $a_{i}$

$\min_{j}\hspace{0.1cm}{a_{ij}}$ be the guaranteed payoff to row player if he chooses row

. Let

$u_{r}$

$\max_{i}$ $a_{i}$

$\max_{i}$ $\min_{j}\hspace{0.1cm} \bf {{a_{ij}}}$
$u_{c}$

$\min_{j}$ ${\max_{i}\hspace{0.1cm} \bf {{a_{ij}}}}$

Lemma: For any matrix A,

$u_{c}$ $\geq$ $u_{r}$ .

Proof: We have, $a_{ij} \geq \min_{k}$ $a_{ik}$ $\forall i,j$ . Hence $\max_{i}\hspace{0.2cm}a_{ij}$ $\geq$ $\max_{i}$ $\min_{k}$ $a_{ik}$ , $\forall j$ .
This implies $\max_{i}\hspace{0.2cm}a_{ij}$ $\geq$ $u_{r}$ , $\forall j$ . Therefore $\min_{j}$ $\max_{i}\hspace{0.2cm}a_{ij}$ $\geq$ $u_{r}$

$\Rightarrow$

$\geq$

The above lemma also holds for the case of mixed strategies. The proof is given later

Theorem: Matrix A has a saddle point $\iff$ .

Proof: ( $\Rightarrow$ ) Let $a_{ij}$ be a saddle point of A. By definition, $a_{ij}$

$\min_{l}$ $a_{il}$ .
Also, $u_r \geq$ $a_{ij}$ and $a_{ij}$

$max_{k}$ $a_{kj}$ . Hence, $u_c \leq a_{ij}.$
Combining these two, $u_c \leq a_{ij} \leq u_r$ . But from the previous lemma $u_r \leq u_c$ .
Hence

.
( $\Leftarrow$ ) Choose

s.t., $\min_{k}$ $a_{ik}$

.
Now choose

s.t. $a_{il}$

$\min_{p}$ $a_{ip}$

.
Since $a_{il}$ is the minimum in row

and $\exists$ column

such that $\max_{q}$ $a_{qj}$

Thus $a_{il}$

$\max_{q}$ $a_{qj}$ $\geq$ $a_{ij}$ . Since $a_{il}$ is a minimum in its row, $a_{il}$

$a_{ij}$ .
Thus, $a_{ij}$ is also a minimum its row.
$\Rightarrow$ $a_{ij} = a_{il} = \max_{q}a_{qj}$
which proves that $a_{ij}$ is a saddle point of A.

Minimax Theorem

Definition : Row value for a mixed strategy is defined as

$\max_{p}$ $\min_{q}$ $p^{T}Aq$

Similarly column value is defined as,

$\min_{q}$ $\max_{p}$ $p^{T}Aq$

In other words,

is the amount of payoff that the row player is guaranteed to win on the average, assuming that he plays rationally.

Lemma: For any matrix A, $v_{c}$ $\geq$ $v_{r}$ .

Proof : We observe that $\mathbf{p}^{T}\mathbf{A}\mathbf{q} \geq \min_{\mathbf{q}} \mathbf{p}^{T}\mathbf{A}\mathbf{q} \forall \mathbf{p,q}$ . Taking the maximum over all $\mathbf{p}$ on both sides, $\max_{\mathbf{p}}\: \mathbf{p}^{T}\mathbf{A}\mathbf{q} \geq \max_{\mathbf{p}} \min_{\mathbf{q}}\: \mathbf{p}^{T}\mathbf{A}\mathbf{q} \forall \mathbf{q}$ . The RHS is $v_{r}$ , thus the previous equation can be re-written as $\max_{\mathbf{p}} \mathbf{p}^{T}\mathbf{A}\mathbf{q} \geq v_{r}, \forall \mathbf{q}$ . Therefore $\min_{\mathbf{q}} \max_{\mathbf{p}} \mathbf{p}^{T}\mathbf{A}\mathbf{q} \geq v_{r}, \forall \mathbf{q}$ . This proves $v_{c} \geq v_{c}$ .

$\mathbf{Theorem}$ : $\mathbf{p*,q*}$ is a Nash Equilibrium iff $v_{c} = v_{r} = \mathbf{p*}^{T}\mathbf{A}\mathbf{q*}$ .

Proof : ( $\Rightarrow$ ) As $\mathbf{p*,q*}$ is a Nash Equilibrium of A, $\mathbf{p*}^{T}\mathbf{A}\mathbf{q*} = \min_{\mathbf{q}}\mathbf{p*}^{T}\mathbf{A}\mathbf{q}$ . Also, $v_{r} \geq \mathbf{p*}^{T}\mathbf{A}\mathbf{q*}$ and $\mathbf{p*}^{T}\mathbf{A}\mathbf{q*} = \max_{\mathbf{p}} \mathbf{p}^{T}\mathbf{A}\mathbf{q*}$ . Hence, $v_{c} \leq \mathbf{p*}^{T}\mathbf{A}\mathbf{q*}$ . Combining these two we get, $v_{r} \geq v_{c}$ . But from previous lemma $v_{c} \geq v_{r}$ . This proves $v_{c} = v_{r}$ .
( $\Leftarrow$ ) Choose $\mathbf{p*}$ s.t., $\min_{\mathbf{q}}\mathbf{p*}^{T}\mathbf{A}\mathbf{q} = v_{r}$ . Now choose $\mathbf{q'}$ s.t. $\mathbf{p*}^{T}\mathbf{A}\mathbf{q'} = \min_{\mathbf{q}} \mathbf{p'}^{T}\mathbf{A}\mathbf{q} = v_{c} = v_{r}$ . Since $\mathbf{p'}^{T}\mathbf{A}\mathbf{q'}$ is the minimum over all $\mathbf{p}$ and $\exists$ strategy $\mathbf{q*}$ s.t.
$\max_{\mathbf{p}} \mathbf{p}^{T}\mathbf{A}\mathbf{q*} = v_{c}$ .
Thus $\mathbf{p'}^{T}\mathbf{A}\mathbf{q'} = v_{c} = \max_{\mathbf{p}} \mathbf{p}^{T}\mathbf{A}\mathbf{q*} \geq \mathbf{p*}^{T}\mathbf{A}\mathbf{q*}$ . Since $\mathbf{p*}^{T}\mathbf{A}\mathbf{q'}$ is the minimum over all $\mathbf{p}$ , $\mathbf{p*}^{T}\mathbf{A}\mathbf{q'} = \mathbf{p*}^{T}\mathbf{A}\mathbf{q*}$ .
Thus $\mathbf{p*}^{T}\mathbf{A}\mathbf{q*}$ is also minimum over all $\mathbf{p}$ for the same $\mathbf{q}$ . $\Rightarrow \mathbf{p*}^{T}\mathbf{A}\mathbf{q*} = \mathbf{p*}^{T}\mathbf{A}\mathbf{q'} = \max_{\mathbf{p}} \mathbf{p}^{T}\mathbf{A}\mathbf{q*}$ . Thus $\mathbf{p*,q*}$ is a Nash Equilibrium.

$\mathbf{ [Von Neumanns' Minimax Theorem]}$ For any two person zero-sum game specified by matrix $\mathbf{A}$ , optimal mixed strategies exist for both players. Moreover the row and column values are equal. In other words,

$\displaystyle \max_{\mathbf{p}} \min_{\mathbf{q}}\: \mathbf{p}^{T}\mathbf{A}\... ... = \min_{\mathbf{q}} \max_{\mathbf{p}}\: \mathbf{p}^{T}\mathbf{A}\mathbf{q}$

(1)

. Also if

and

denote the optimal strategies for the row and the column player respectively, then

$v_r = v_c = p^{*T}Aq^*$
(,) is a Nash Equilibrium for this two player game.

The optimal strategy for row player will yield the same payoff as the optimal strategy for Column player! If either the row or the column player plays her optimal strategy, the opponent cannot improve the expected payoff. Thus once a player has publicly committed to play the optimal strategy, it is possible for the other player to play the game with a pure strategy and still receive the optimal expected payoff.

Proof of Minimax Theorem

This proof requires the duality theorem, a well known result in linear programming. A linear programming problem can be defined in terms of constraints $\mathbf{Ax} \geq \mathbf{b}$ and $\mathbf{x} \geq 0$ , and a cost vector $\mathbf{C}$ . The goal is to minimize the cost $\mathbf{C^{T}x}$ subject to the constraints and given a cost vector. This is called the primal problem. Associated with every primal problem is a dual problem stated as follows. The constraints now become $\mathbf{A^{T}y} \leq\mathbf{C}$ and $\mathbf{y} \geq 0$ , the new cost vector is $\mathbf{b}$ and the goal is to maximize $\mathbf{b^{T}y}$ .
$\mathbf{ [Duality Theorem]}$ If either problem (primal or dual) has a best vector (called $\mathbf{x^{*}}$ or $\mathbf{y^{*}}$ ), then so does the other. The minimum $\mathbf{C^{T}x^{*}}$ equals the maximum $\mathbf{y^{*T}b}$

$\displaystyle \mathbf{C^{T}x^{*}} = \mathbf{b^{T}y^{*}}$

(2)

In terms of

(the row player) and

(column player), we want to minimize (primal) $\mathbf{c^{T}p}$ (called $\overline{p^{*}}$ ) subject to the constraints $\mathbf{p^{T}A} \geq \mathbf{b}$ and $\mathbf{p} \geq0$ . We also want to maximize (dual) $\mathbf{b^{T}q}$ (called $\overline{q^{*}}$ ) subject to the constraints $\mathbf{Aq} \leq\mathbf{c}$ and $\mathbf{q} \geq0$ , where $\mathbf{c}$ and $\mathbf{b}$ are unit vectors. In light of this formulation and the duality theorem, we can state

$\displaystyle \mathbf{c^{T}\overline{p^{*}}} = \mathbf{b^{T}\overline{q^{*}}} = \theta$

(3)

Therefore, our probability distributions that correspond to our optimal vectors $\mathbf{p^{*}}$ and $\mathbf{q^{*}}$ are obtained by setting $\mathbf{p^{*}} = \mathbf{\overline{p^{*}}} / \theta$ and $\mathbf{q^{*}} = \mathbf{\overline{q^{*}}} / \theta$ .

[Proof of Von Neumanns' Minimax Theorem] Since $\mathbf{\overline{p^{*}}^{T}A} \geq\mathbf{b}$ , $\forall \mathbf{q}, \; \mathbf{\overline{p^{*}}^{T}Aq} \geq\mathbf{b^{T}q}$ . And since $\mathbf{b^{T}q} = 1$ , this implies that $\mathbf{p^{*T}Aq} \geq1 / \theta$ . This gives a lower bound on how much

is winning ( $1 / \theta$ ). Similarly, $\mathbf{\overline{p^{*}}^{T}Aq} \geq\mathbf{b^{T}q} = 1$ implies that $\mathbf{p^{T}Aq^{*}} \leq1 / \theta$ and that $1 / \theta$ is an upper bound on

's loss. Therefore,

$\displaystyle \mathbf{p^{*T}Mq^{*}}$ $\displaystyle =$ $\displaystyle \frac{1}{\theta}$
$\displaystyle \mathbf{p^{T}Mq^{*}}$ $\displaystyle \leq$ $\displaystyle \mathbf{p^{*T}Mq^{*}} = \frac{1}{\theta}$
$\displaystyle \mathbf{p^{*T}Mq^{*}}$ $\displaystyle \leq$ $\displaystyle \mathbf{p^{*T}Mq}$

	C	D
C	5,5	0,10
D	10,0	1,1

	C	M
C	10,5	0,0
M	0,0	5,10

	H	T
H	1,-1	-1,1
T	1,-1	1,-1

I $\backslash$ II	A	B	C	D
A	5,2	2,6	1,4	0,4
B	0,0	3,2	2,1	1,1
C	7,0	2,2	1,5	5,1
D	9,5	1,3	0,2	4,8

Lecture 3

Strategic Games

Choice of Strategies

Examples of Nash equilibrium in some games

Prisoner's Dilemma

Battle of the Sexes: Cricket or Movie

Matching Pennies

Iterated Deletion to compute Nash Equilibrium.

A general two player constant sum game

Saddle Points

Theorem: Matrix A has a saddle point $\iff$ .

Minimax Theorem

Proof of Minimax Theorem

C	5	0	$\leftarrow$ Dominates
D	10	1




		$a_{ij},b_{ij}$

Lecture 3

Theorem: Matrix A has a saddle point .

Theorem: Matrix A has a saddle point $\iff$ .