carlos hurtado - university of illinoishrtdmrt2/teaching/gt_2016_19/l3.pdf · carlos hurtado...
TRANSCRIPT
Rationalizable Strategies and Nash Equilibrium
Carlos Hurtado
Department of EconomicsUniversity of Illinois at Urbana-Champaign
Junel 9th, 2016
C. Hurtado (UIUC - Economics) Game Theory
On the Agenda
1 Formalizing the Game
2 Rationalizability
3 Exercises
4 Nash Equilibrium
5 Exercises
C. Hurtado (UIUC - Economics) Game Theory
Formalizing the Game
On the Agenda
1 Formalizing the Game
2 Rationalizability
3 Exercises
4 Nash Equilibrium
5 Exercises
C. Hurtado (UIUC - Economics) Game Theory
Formalizing the Game
Formalizing the Game
I Let me fix some Notation:
- set of players: I = {1, 2, · · · ,N}- set of actions: ∀i ∈ I, ai ∈ Ai , where each player i has a set of actions Ai .- strategies for each player: ∀i ∈ I, si ∈ Si , where each player i has a set of
pure strategies Si available to him. A strategy is a complete contingent planfor playing the game, which specifies a feasible action of a player’sinformation sets in the game.
- profile of pure strategies: s = (s1, s2, · · · , sN) ∈∏N
i=1 Si = S.Note: let s−i = (s1, s2, · · · , si−1, si+1, · · · , sN) ∈ S−i , we will denotes = (si , s−i ) ∈ (Si , S−i ) = S.
- Payoff function: ui :∏N
i=1 Si → R, denoted by ui (si , s−i )- A mixed strategy for player i is a function σi : Si → [0, 1], which assigns a
probability σi (si ) ≥ 0 to each pure strategy si ∈ Si , satisfying∑si∈Si
σi (si ) = 1.
C. Hurtado (UIUC - Economics) Game Theory 1 / 16
Formalizing the Game
Formalizing the Game
I Notice now that even if there is no role for nature in a game, when players use(nondegenerate) mixed strategies, this induces a probability distribution overterminal nodes of the game.
I But we can easily extend payoffs again to define payoffs over a profile of mixedstrategies as follows:
ui (σ1, · · · , σN) =∑s∈S
[σ1(s1) · · ·σN(sN)] ui (s1, · · · , sN)
ui (σi , σ−i ) =∑si∈Si
∑s−i∈S−i
[∏j 6=i
σj (sj )
]σi (si )ui (si , s−i )
I For the above formula to make sense, it is critical that each player is randomizingindependently. That is, each player is independently tossing her own die to decideon which pure strategy to play.
C. Hurtado (UIUC - Economics) Game Theory 2 / 16
Formalizing the Game
Formalizing the Game
I If si is a strictly dominant strategy for player i , then for all σi ∈ ∆(Si ), σi 6= si ,and all σ−i ∈ ∆(S−i ), ui (si , σ−i ) > ui (σi , σ−i ).
I Let σi ∈ ∆(Si ), with σi 6= si , and let σ−i ∈ ∆(S−i ). Then,
ui (si , σ−i ) =∑
s−i∈S−i
[∏j 6=i
σj (sj )
]ui (si , s−i )
and
ui (σi , σ−i ) =∑s̃i∈Si
∑s−i∈S−i
[∏j 6=i
σj (sj )
]σi (s̃i )ui (s̃i , s−i )
Then, ui (si , σ−i )− ui (σi , σ−i ) is
∑s−i∈S−i
(∏j 6=i
σj (sj )
)[ui (si , s−i )−
∑s̃i∈Si
σi (s̃i )ui (s̃i , s−i )
]
C. Hurtado (UIUC - Economics) Game Theory 3 / 16
Formalizing the Game
Formalizing the Game
I If si is a strictly dominant strategy for player i , then for all σi ∈ ∆(Si ), σi 6= si ,and all σ−i ∈ ∆(S−i ), ui (si , σ−i ) > ui (σi , σ−i ).
I Let σi ∈ ∆(Si ), with σi 6= si , and let σ−i ∈ ∆(S−i ). Then,
ui (si , σ−i ) =∑
s−i∈S−i
[∏j 6=i
σj (sj )
]ui (si , s−i )
and
ui (σi , σ−i ) =∑s̃i∈Si
∑s−i∈S−i
[∏j 6=i
σj (sj )
]σi (s̃i )ui (s̃i , s−i )
Then, ui (si , σ−i )− ui (σi , σ−i ) is
∑s−i∈S−i
(∏j 6=i
σj (sj )
)[ui (si , s−i )−
∑s̃i∈Si
σi (s̃i )ui (s̃i , s−i )
]
C. Hurtado (UIUC - Economics) Game Theory 3 / 16
Formalizing the Game
Formalizing the Game
I ui (si , σ−i )− ui (σi , σ−i ) is
∑s−i∈S−i
(∏j 6=i
σj (sj )
)[ui (si , s−i )−
∑s̃i∈Si
σi (s̃i )ui (s̃i , s−i )
]
I Since si is strictly dominant, ui (si , s−i ) > ui (s̃i , s−i ) for all s̃i 6= si and all s−i .
I Hence, ui (si , s−i ) >∑s̃i∈Si
σi (s̃i )ui (s̃i , s−i ) for any σi ∈ ∆(Si ) such that σi 6= si
(why?).
I This implies the desired inequality: ui (si , σ−i )− ui (σi , σ−i ) > 0
C. Hurtado (UIUC - Economics) Game Theory 4 / 16
Formalizing the Game
Formalizing the Game
I We learned that: If si is a strictly dominant strategy for player i , then for allσi ∈ ∆(Si ), σi 6= si , and all σ−i ∈ ∆(S−i ), ui (si , σ−i ) > ui (σi , σ−i ).
I Exercise 1. Show that there can be no strategy σi ∈ ∆(Si ) such that for all si ∈ Siand s−i ∈ S−i , ui (σi , s−i ) > ui (si , s−i ).
I The preceding Theorem and Exercise show that there is absolutely no loss inrestricting attention to pure strategies for all players when looking for strictlydominant strategies.
C. Hurtado (UIUC - Economics) Game Theory 5 / 16
Rationalizability
On the Agenda
1 Formalizing the Game
2 Rationalizability
3 Exercises
4 Nash Equilibrium
5 Exercises
C. Hurtado (UIUC - Economics) Game Theory
Rationalizability
Rationalizability
I
l rL 4,-4 9,-9M 6,-6 6,-6R 9,-9 4,-4
I Penalty Kick Game is one of the most important games in the world.
I This game has no dominant strategies
I We need refinements to solve more games.
C. Hurtado (UIUC - Economics) Game Theory 6 / 16
Rationalizability
Rationalizability
I
l rL 4,-4 9,-9M 6,-6 6,-6R 9,-9 4,-4
I Penalty Kick Game is one of the most important games in the world.
I This game has no dominant strategies
I We need refinements to solve more games.
C. Hurtado (UIUC - Economics) Game Theory 6 / 16
Rationalizability
Rationalizability
I
I Do not shoot to the middleI Do not use a strategy that is never a best response
C. Hurtado (UIUC - Economics) Game Theory 7 / 16
Rationalizability
Rationalizability
DefinitionA strategy σi ∈ ∆(Si ) is a best response to the strategy profile σ−i ∈ ∆(S−i ) ifu(σi , σ−i ) ≥ u(σ̃i , σ−i ) for all σ̃i ∈ ∆(Si ). A strategy σi ∈ ∆(Si ) is never a bestresponse if there is no σ−i ∈ ∆(S−i ) for which σi is a best response.
I The idea is that a strategy, σi , is a best response if there is some strategy profile ofthe opponents for which σi does at least as well as any other strategy.
I Conversely, σi is never a best response if for every strategy profile of theopponents, there is some strategy that does strictly better than σi .
I Clearly, in any game, a strategy that is strictly dominated is never a best response.
I Exercise 2. Prove that in 2-player games, a pure strategy is never a best responseif and only if it is strictly dominated.
C. Hurtado (UIUC - Economics) Game Theory 8 / 16
Rationalizability
Rationalizability
I In games with more than 2 players, there may be strategies that are not strictlydominated that are nonetheless never best responses.
I As before, it is a consequence of ”rationality” that a player should not play astrategy that is never a best response. That is, we can delete strategies that arenever best responses.
I By iterating on the knowledge of rationality, we iteratively delete strategies thatare never best responses.
I The set of strategies for a player that survives this iterated deletion of never bestresponses is called her set of rationalizable strategies.
I The rationalizable actions can be computed as follows:1 Start with the full action set for each player.2 Remove actions which are never a best responses to any belief about the
opponents’ actions.3 Repeat process with the opponents’ remaining actions until no further
actions are eliminated.4 In this process leaves a non-empty set of actions for each player those are the
rationalizable actions.
C. Hurtado (UIUC - Economics) Game Theory 9 / 16
Rationalizability
Rationalizability
I In games with more than 2 players, there may be strategies that are not strictlydominated that are nonetheless never best responses.
I As before, it is a consequence of ”rationality” that a player should not play astrategy that is never a best response. That is, we can delete strategies that arenever best responses.
I By iterating on the knowledge of rationality, we iteratively delete strategies thatare never best responses.
I The set of strategies for a player that survives this iterated deletion of never bestresponses is called her set of rationalizable strategies.
I The rationalizable actions can be computed as follows:1 Start with the full action set for each player.2 Remove actions which are never a best responses to any belief about the
opponents’ actions.3 Repeat process with the opponents’ remaining actions until no further
actions are eliminated.4 In this process leaves a non-empty set of actions for each player those are the
rationalizable actions.
C. Hurtado (UIUC - Economics) Game Theory 9 / 16
Rationalizability
Rationalizability
DefinitionI σi ∈ ∆(Si ) is a 1-rationalizable strategy for player i if it is a best response to some
strategy profile σ−i ∈ ∆(S−i ).I σi ∈ ∆(Si ) is a k-rationalizable strategy (k ≥ 2) for player i if it is a best response
to some strategy profile σ−i ∈ ∆(S−i ) such that each σj is (k − 1)-rationalizablefor player j 6= i .
I σi ∈ ∆(Si ) is a rationalizable for player i if it is k-rationalizable for all k ≥ 1.
C. Hurtado (UIUC - Economics) Game Theory 10 / 16
Rationalizability
Rationalizability
I Note that the set of rationalizable strategies can no be larger that the set ofstrategies surviving iterative removal of strictly dominated strategies.
I This follows from the earlier comment that a strictly dominated strategy is never abest response.
I In this sense, rationalizability is (weakly) more restrictive than iterated deletion ofstrictly dominated strategies.
I It turns out that in 2-player games, the two concepts coincide. In n-player games(n > 2), they don’t have to.
I Strategies that remain after iterative elimination of strategies that are never bestresponses: those that a rational player can justify, or rationalize, with somereasonable conjecture concerning the behavior of his rivals (reasonable in the sensethat his opponents are not presumed to play strategies that are never bestresponses, etc.).
I ”Rationalizable” intuitively means that there is a plausible explanation that wouldjustify the use of the strategy.
C. Hurtado (UIUC - Economics) Game Theory 11 / 16
Exercises
On the Agenda
1 Formalizing the Game
2 Rationalizability
3 Exercises
4 Nash Equilibrium
5 Exercises
C. Hurtado (UIUC - Economics) Game Theory
Exercises
Exercises
I Exercise 1. Show that there can be no strategy σi ∈ ∆(Si ) such that for all si ∈ Siand s−i ∈ S−i , ui (σi , s−i ) > ui (si , s−i ).
I Exercise 2. Prove that in 2-player games, a pure strategy is never a best responseif and only if it is strictly dominated.
I Determine the set of rationalizable pure strategies for the following game:
I
1/2 b1 b2 b3 b4
a1 0, 7 2, 5 7, 0 0, 1a2 5, 2 3, 3 5, 2 0, 1a3 7, 0 2, 5 0, 7 0, 1a4 0, 0 0,-2 0, 0 10,-1
C. Hurtado (UIUC - Economics) Game Theory 12 / 16
Nash Equilibrium
On the Agenda
1 Formalizing the Game
2 Rationalizability
3 Exercises
4 Nash Equilibrium
5 Exercises
C. Hurtado (UIUC - Economics) Game Theory
Nash Equilibrium
Nash Equilibrium
I Now we turn to the most well-known solution concept in game theory. We’ll firstdiscuss pure strategy Nash equilibrium (PSNE), and then later extend to mixedstrategies.
DefinitionA strategy profile s = (s1, ..., sN) ∈ S is a Pure Strategy Nash Equilibrium (PSNE) if forall i and s̃i ∈ Si , u(si , s−i ) ≥ u(s̃i , s−i ).
I In a Nash equilibrium, each player’s strategy must be a best response to thosestrategies of his opponents that are components of the equilibrium.
I Remark: Every finite game of perfect information has a pure strategy Nashequilibrium.
C. Hurtado (UIUC - Economics) Game Theory 13 / 16
Nash Equilibrium
Nash Equilibrium
I Unlike with our earlier solution concepts (dominance and rationalizability), Nashequilibrium applies to a profile of strategies rather than any individual’s strategy.When people say ”Nash equilibrium strategy”, what they mean is ”a strategy thatis part of a Nash equilibrium profile”.
I The term equilibrium is used because it connotes that if a player knew that hisopponents were playing the prescribed strategies, then she is playing optimally byfollowing her prescribed strategy. In a sense, this is like a ”rational expectations”equilibrium, in that in a Nash equilibrium, a player’s beliefs about what hisopponents will do get confirmed (where the beliefs are precisely the opponents’prescribed strategies).
I Rationalizability only requires a player play optimally with respect to some”reasonable” conjecture about the opponents’ play, where ”reasonable” means thatthe conjectured play of the rivals can also be justified in this way. On the otherhand, Nash requires that a player play optimally with respect to what hisopponents are actually playing. That is to say, the conjecture she holds about heropponents’ play is correct.
C. Hurtado (UIUC - Economics) Game Theory 14 / 16
Nash Equilibrium
Nash Equilibrium
I The above point makes clear that Nash equilibrium is not simply a consequence of(common knowledge of) rationality and the structure of the game. Clearly, eachplayer’s strategy in a Nash equilibrium profile is rationalizable, but lots ofrationalizable profiles are not Nash equilibria.
C. Hurtado (UIUC - Economics) Game Theory 15 / 16
Exercises
On the Agenda
1 Formalizing the Game
2 Rationalizability
3 Exercises
4 Nash Equilibrium
5 Exercises
C. Hurtado (UIUC - Economics) Game Theory