1 chapter 4: minimax equilibrium in zero sum game scit1003 chapter 4: minimax equilibrium in zero...

23
1 SCIT1003 Chapter 4: Minimax Chapter 4: Minimax Equilibrium Equilibrium in Zero Sum Game in Zero Sum Game Prof. Tsang

Upload: leslie-holdridge

Post on 15-Dec-2015

229 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

1

SCIT1003Chapter 4: Minimax Equilibrium Chapter 4: Minimax Equilibrium

in Zero Sum Gamein Zero Sum Game

Prof. Tsang

Page 2: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Maximin & Minimax Equilibrium in a zero-sum game

• Minimax - minimizing the maximum loss (loss-ceiling, defensive)

• Maximin - maximizing the minimum gain (gain-floor, offensive)

• Minimax = Maximin

2

Page 3: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

The Minimax Theorem

3

“Every finite, two-person, zero-sum game has a rational solution in the form of a pure or mixed strategy.”

John Von Neumann, 1926

For every two-person, zero-sum game with finite strategies, there exists a value V and a mixed strategy for each player, such that (a) Given player 2's strategy, the best payoff possible for player 1 is V, and (b) Given player 1's strategy, the best payoff possible for player 2 is −V.

Page 4: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Pure strategy game: Saddle pointSaddle point

4

A zero-sum game with a saddle point.

1

3

4 3

Is this a Nash Equilibrium?

MaxiMin

MiniMax

Page 5: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Pure & mixed strategiesPure & mixed strategies

5

A pure strategy provides a complete definition of how a player will play a game. It determines the move a player will make for any situation they could face.

A mixed strategy is an assignment of a probability to each pure strategy. This allows for a player to randomly select a pure strategy.In a pure strategy a player chooses an action for sure, whereas in a mixed strategy, he chooses a probability distribution over the set of actions available to him.

Page 6: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

All you need to know about All you need to know about Probability Probability

6

If E is an outcome of action, then P(E) denotes the probability that E will occur, with the following properties:

1. 0 P(E) 1 such that:If E can never occur, then P(E) = 0If E is certain to occur, then P(E) = 1

2. The probabilities of all the possible outcomes must sum to 1

Page 7: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Mixed strategy

• In some zero-sum game, there is no pure strategy solution (no Saddle point)

• Play’s best way to win is mixing all possible moves together in a random (unpredictable) fashion.

• E.g. Rock-Paper-Scissors

7

Page 8: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Mixed strategiesMixed strategies

8

Some games, such as Rock-Paper-Scissors, do not have a pure strategy equilibrium. In this game, if Player 1 chooses R, Player 2 should choose p, but if Player 2 chooses p, Player 1 should choose S. This continues with Player 2 choosing r in response to the choice S by Player 1, and so forth.

In games like Rock-Paper-Scissors, a player will want to randomize over several actions, e.g. he/she can choose R, P & S in equal probabilities.

Page 9: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

A soccer penalty shot at 12-yardleft or right?

9

Left Right

Left 42 558 95

Right 7 3093 70

Goalie

Kicker

p.145 payoffs are winning probability

Page 10: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

10

A penalty shot at 12-yardleft or right?

If you are the kicker, which side you use?The best chance you have is 95%. So you kick left.But the goalie anticipates that because he knows that’s your best chance. So his anticipation reduces your chance to 58%.What if you anticipate that he anticipates … so you kick right & that increase your chance to 93%.What if he anticipates that you anticipate that he anticipates …If you use a pure strategy, he always has a way to reduce you chance to win.

Page 11: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

• To end this circular reasoning, you do something that the goalie cannot anticipate.

• What if you mix the 2 choices randomly with 50-50 chance?

• Your chance of winning is(58+93)/2 if the goalie moves to left(93+70)/2 if the goalie moves to right

Is this better?

11

A penalty shot at 12-yardleft or right?

Page 12: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

12p.166 graphical solution

Kicker’s mixture

Page 13: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

13p.168 graphical solution

Goalie’s mixture

Page 14: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

14

If the goalie improves his skill at saving kicks to the Right side

Page 15: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

A Parking meter game (p.164)

15

If you pay for the parking, it cause you $1.

If you don’t pay for the parking and you are caught by the enforcer, the penalty is $50.

Should you take the risk of not paying for the parking?How often the enforcer should patrol to keep the car drivers honest (to pay the parking fee)?

Page 16: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Parking meter game

16

Pay Not pay

Enforce -1 -501 50

Not enforce

-1 01 0

Car driver

Enforcer

p.164

Page 17: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

17

nono

No Nash equilibrium for pure strategy

x y 1-x-y

x=probability to take action R

y=probability to take action S

1-x-y=probability to take action P

Mixed strategiesMixed strategies

Page 18: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

18

They have to be equal if expected payoff independent of action of player 2

Page 19: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

19

Janken step game (Japanese RSP)

p.171

Page 20: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Two-Person, Zero-Sum Games: Summary

• Represent outcomes as payoffs to row player• Find any dominating equilibrium• Evaluate row minima and column maxima• If maximin=minimax, players adopt pure strategy

corresponding to saddle point; choices are in stable equilibrium -- secrecy not required

• If maximin minimax, find optimal mixed strategy; secrecy essential

20

Page 21: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Summary: Ch. 4

• Look for any equilibrium• Dominating Equilibrium• Minimax Equilibrium• Nash Equilibrium

21

Page 22: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

Assignment 4.1

22

Page 23: 1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang

23

Assignment 4.1