1 chapter 4: minimax equilibrium in zero sum game scit1003 chapter 4: minimax equilibrium in zero...
TRANSCRIPT
1
SCIT1003Chapter 4: Minimax Equilibrium Chapter 4: Minimax Equilibrium
in Zero Sum Gamein Zero Sum Game
Prof. Tsang
Maximin & Minimax Equilibrium in a zero-sum game
• Minimax - minimizing the maximum loss (loss-ceiling, defensive)
• Maximin - maximizing the minimum gain (gain-floor, offensive)
• Minimax = Maximin
2
The Minimax Theorem
3
“Every finite, two-person, zero-sum game has a rational solution in the form of a pure or mixed strategy.”
John Von Neumann, 1926
For every two-person, zero-sum game with finite strategies, there exists a value V and a mixed strategy for each player, such that (a) Given player 2's strategy, the best payoff possible for player 1 is V, and (b) Given player 1's strategy, the best payoff possible for player 2 is −V.
Pure strategy game: Saddle pointSaddle point
4
A zero-sum game with a saddle point.
1
3
4 3
Is this a Nash Equilibrium?
MaxiMin
MiniMax
Pure & mixed strategiesPure & mixed strategies
5
A pure strategy provides a complete definition of how a player will play a game. It determines the move a player will make for any situation they could face.
A mixed strategy is an assignment of a probability to each pure strategy. This allows for a player to randomly select a pure strategy.In a pure strategy a player chooses an action for sure, whereas in a mixed strategy, he chooses a probability distribution over the set of actions available to him.
All you need to know about All you need to know about Probability Probability
6
If E is an outcome of action, then P(E) denotes the probability that E will occur, with the following properties:
1. 0 P(E) 1 such that:If E can never occur, then P(E) = 0If E is certain to occur, then P(E) = 1
2. The probabilities of all the possible outcomes must sum to 1
Mixed strategy
• In some zero-sum game, there is no pure strategy solution (no Saddle point)
• Play’s best way to win is mixing all possible moves together in a random (unpredictable) fashion.
• E.g. Rock-Paper-Scissors
7
Mixed strategiesMixed strategies
8
Some games, such as Rock-Paper-Scissors, do not have a pure strategy equilibrium. In this game, if Player 1 chooses R, Player 2 should choose p, but if Player 2 chooses p, Player 1 should choose S. This continues with Player 2 choosing r in response to the choice S by Player 1, and so forth.
In games like Rock-Paper-Scissors, a player will want to randomize over several actions, e.g. he/she can choose R, P & S in equal probabilities.
A soccer penalty shot at 12-yardleft or right?
9
Left Right
Left 42 558 95
Right 7 3093 70
Goalie
Kicker
p.145 payoffs are winning probability
10
A penalty shot at 12-yardleft or right?
If you are the kicker, which side you use?The best chance you have is 95%. So you kick left.But the goalie anticipates that because he knows that’s your best chance. So his anticipation reduces your chance to 58%.What if you anticipate that he anticipates … so you kick right & that increase your chance to 93%.What if he anticipates that you anticipate that he anticipates …If you use a pure strategy, he always has a way to reduce you chance to win.
• To end this circular reasoning, you do something that the goalie cannot anticipate.
• What if you mix the 2 choices randomly with 50-50 chance?
• Your chance of winning is(58+93)/2 if the goalie moves to left(93+70)/2 if the goalie moves to right
Is this better?
11
A penalty shot at 12-yardleft or right?
12p.166 graphical solution
Kicker’s mixture
13p.168 graphical solution
Goalie’s mixture
14
If the goalie improves his skill at saving kicks to the Right side
A Parking meter game (p.164)
15
If you pay for the parking, it cause you $1.
If you don’t pay for the parking and you are caught by the enforcer, the penalty is $50.
Should you take the risk of not paying for the parking?How often the enforcer should patrol to keep the car drivers honest (to pay the parking fee)?
Parking meter game
16
Pay Not pay
Enforce -1 -501 50
Not enforce
-1 01 0
Car driver
Enforcer
p.164
17
nono
No Nash equilibrium for pure strategy
x y 1-x-y
x=probability to take action R
y=probability to take action S
1-x-y=probability to take action P
Mixed strategiesMixed strategies
18
They have to be equal if expected payoff independent of action of player 2
19
Janken step game (Japanese RSP)
p.171
Two-Person, Zero-Sum Games: Summary
• Represent outcomes as payoffs to row player• Find any dominating equilibrium• Evaluate row minima and column maxima• If maximin=minimax, players adopt pure strategy
corresponding to saddle point; choices are in stable equilibrium -- secrecy not required
• If maximin minimax, find optimal mixed strategy; secrecy essential
20
Summary: Ch. 4
• Look for any equilibrium• Dominating Equilibrium• Minimax Equilibrium• Nash Equilibrium
21
Assignment 4.1
22
23
Assignment 4.1