finite iterated prisoner’s dilemma revisited: belief change and end game effect jiawei li...

30
Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

Upload: moses-hodge

Post on 03-Jan-2016

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

Finite Iterated Prisoner’s Dilemma Revisited: Belief

Change and End Game Effect

Jiawei Li (Michael) & Graham KendallUniversity of Nottingham

Page 2: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

2

Outline

• Iterated prisoner’s dilemma (IPD)• Probability vs. uncertainty• A new model of IPD• End game effect• Conclusions

Jubilee Campus, University of Nottingham

Page 3: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

3

Iterated Prisoner’s Dilemma (IPD)

• Prisoner’s DilemmaTwo suspects are arrested by the police. They are separated and offered the same deal. If one testifies (defects from the other) for the prosecution against the other and the other remains silent (cooperates with the other), the betrayer goes free and the silent accomplice receives the full 10-year sentence. If both remain silent, both prisoners are sentenced to only six months in jail for a minor charge. If each betrays the other, each receives a five-year sentence. Each prisoner must choose to betray the other or to remain silent.

Page 4: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

4

Iterated Prisoner’s Dilemma (IPD)

Nash equilibrium: (Defect, Defect)

Page 5: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

5

Iterated Prisoner’s Dilemma (IPD)

• Finite IPD– n-stages;– n is known;– End game effect;– Backward induction;– Nash equilibrium;

Page 6: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

6

Iterated Prisoner’s Dilemma (IPD)

• Finite IPD experiments

Each of 15 pairs of subjects plays 22-round IPD. Average cooperation rates are 44.2% in known and 55.0% in unknown. (from Andreoni and Miller (1993))

Page 7: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

7

Iterated Prisoner’s Dilemma (IPD)

• Incomplete information (Kreps et al (1982))– ‘Tit for Tat (TFT)’ type or ‘Always defect (AllD)’ type;– Assign probability to

TFT type and 1- to AllD type;– Mutual cooperation can be

sequential equilibrium;– End game effect;

Page 8: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

8

Iterated Prisoner’s Dilemma (IPD)

• A new model

– Assumption that both players may be either AllD or TFT type;

– Repeated game with uncertainty;

– Changeable beliefs;

Page 9: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

9

Probability vs. uncertainty

• Probability: Tossing a coin;

50% -- 50%

• Uncertainty: Tossing a what?50% -- 50%?

Page 10: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

10

Probability vs. uncertainty

• Probability– Tossing a coin repeatedly;– 50% -- 50%

• Uncertainty– Tossing the dice repeatedly;– Expected probability may

change according to the outcome of past playing.

Page 11: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

11

Probability vs. uncertainty

• Repeated games with uncertainty– Bayes theorem:

P(Head) = 0.5;p(Head|1Head) = 0.683;p(Head|1Tail) = 0.317;p(Head|2Head) = 0.888;p(Head|2Head+1Tail) = 0.565;......

Page 12: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

12

A new model of IPD

• Let ROW and COL denote the players in a N-stage IPD game.

where R, S, T, and P denote, Reward for mutual cooperation, Sucker’s payoff, Temptation to defect, and Punishment for mutual defection, and

Page 13: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

13

A new model of IPD

• We refer to COL’s belief (Row’s type) by

that denote the probabilities that, if ROW is a TFT player, ROW

chooses to cooperate at the 1,…,n stage. Similarly, we define

ROW’s belief about COL’s type by that

denotes the probabilities that COL chooses to cooperate at

each stage if COL is a TFT player. {δi} and {θi} represent the

beliefs of player COL and ROW respectively.

Page 14: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

14

A new model of IPD

Two Assumptions:

• A1:

• A2: If either player chooses to defect at stage i, there will be

Page 15: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

15

A new model of IPD

Page 16: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

16

A new model of IPD

Page 17: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

17

A new model of IPD

Page 18: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

18

A new model of IPD

(D, D) is always Nash equilibrium at stage i; but it is not necessarily the only Nash equilibrium. When (2) is satisfied, both (C, C) and (D, D) are Nash equilibrium. (2) is a necessary and sufficient condition.

Page 19: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

19

A new model of IPD

A sufficient condition for (C,C) to be Nash equilibrium:

This condition denotes a depth-one induction, that is, if both players are likely to cooperate at both the current stage and the next stage, it is worth each player choosing to cooperate at the current stage. For example, when T=5, R=3, P=1, and S=0, the condition for (C,C) to be Nash equilibrium is,

Page 20: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

20

End game effect

• ?

• Unexpected hanging paradox.– A condemned prisoner;– Will be hanged on one weekday

in the following week;– Will be a surprise;

Page 21: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

21

End game effect

Page 22: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

22

End game effect

P4 = 1

P5 = 1

P3 = 1

P2 = 1

P1 = 1

Page 23: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

23

End game effect

Page 24: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

24

End game effect

• Process of belief change

P1 = 1/5P2 = 1/5P3 = 1/5P4 = 1/5P5 = 1/5

P1 = 0P2 = 1/4P3 = 1/4P4 = 1/4P5 = 1/4

P1 = 0P2 = 0P3 = 1/3P4 = 1/3P5 = 1/3

P1 = 0...P4 = 1/2P5 = 1/2

P1 = 0...P5 = 1

Page 25: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

25

End game effect

• Backward induction is not suitable for repeated games with uncertainty because uncertainty can be decreased during the process of games.

• End game effect has limited influence on the players’ strategies since it cannot be backward inducted.

Page 26: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

26

End game effect

• Why does the rate of cooperation in finite IPD experiments decrease as the game goes toward the end?

Page 27: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

27

Conclusions

• 1. We develop a new model for finite IPD that takes into consideration belief change.

• 2. Under the new model, the conditions of mutual cooperation are deduced. The result shows that, if the conditions are satisfied, both mutual cooperation and mutual defection are Nash equilibrium. Otherwise, mutual defection is the unique Nash equilibrium.

Page 28: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

28

Conclusions

• 3. This model could also deal with indefinite IPD and infinite IPD.

• 4. The outcome of this model conforms to experimental results.

Page 29: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

29

Conclusions

• 5. Backward induction is not suitable for repeated games with uncertainty when the beliefs of the players are changeable.

• 6. This model has the potential to apply to other repeated games of incomplete information.

Page 30: Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham

30

Thank you.