TY - GEN
T1 - Repeated triangular trade
T2 - 21st International Conference on Principles and Practice of Multi-Agent Systems, PRIMA 2018
AU - Shigedomi, Kota
AU - Sekiguchi, Tadashi
AU - Iwasaki, Atsushi
AU - Yokoo, Makoto
N1 - Funding Information:
This work was partially supported by JSPS KAKENHI (Grant Number 16KK0003, 17H00761, and 17H01787) and JST, Strategic International Collaborative Research Program, SICORP.
Funding Information:
Acknowledgements. This work was partially supported by JSPS KAKENHI (Grant Number 16KK0003, 17H00761, and 17H01787) and JST, Strategic International Collaborative Research Program, SICORP.
Publisher Copyright:
© Springer Nature Switzerland AG 2018.
PY - 2018
Y1 - 2018
N2 - We introduce a new fundamental problem called triangular trade, which is a natural extension of the well-studied prisoner’s dilemma for three (or more) players where a player cannot directly punish a seemingly defecting player. More specifically, this problem deals with a situation where the power/influence of players is one-way, players would be better off if they maintain circular cooperation, but each player has an incentive to defect. We analyze whether players can sustain such circular cooperation when they repeatedly play this game and each player observes the actions of another player with some observation errors (imperfect private monitoring). We confirm that no simple strategy can constitute an equilibrium within any reasonable parameter settings when there are only two actions: “Cooperate” and “Defect.” Thus, we introduce two additional actions: “Whistle” and “Punish,” which can be considered as a slight modification of “Cooperate.” Then, players can achieve sustainable cooperation using a simple strategy called Remote Punishment strategy (RP), which constitutes an equilibrium for a wide range of parameters. Furthermore, we show the payoff obtained by a variant of RP is optimal within a very general class of strategies that covers virtually all meaningful strategies.
AB - We introduce a new fundamental problem called triangular trade, which is a natural extension of the well-studied prisoner’s dilemma for three (or more) players where a player cannot directly punish a seemingly defecting player. More specifically, this problem deals with a situation where the power/influence of players is one-way, players would be better off if they maintain circular cooperation, but each player has an incentive to defect. We analyze whether players can sustain such circular cooperation when they repeatedly play this game and each player observes the actions of another player with some observation errors (imperfect private monitoring). We confirm that no simple strategy can constitute an equilibrium within any reasonable parameter settings when there are only two actions: “Cooperate” and “Defect.” Thus, we introduce two additional actions: “Whistle” and “Punish,” which can be considered as a slight modification of “Cooperate.” Then, players can achieve sustainable cooperation using a simple strategy called Remote Punishment strategy (RP), which constitutes an equilibrium for a wide range of parameters. Furthermore, we show the payoff obtained by a variant of RP is optimal within a very general class of strategies that covers virtually all meaningful strategies.
UR - http://www.scopus.com/inward/record.url?scp=85056460062&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85056460062&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-03098-8_15
DO - 10.1007/978-3-030-03098-8_15
M3 - Conference contribution
AN - SCOPUS:85056460062
SN - 9783030030971
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 242
EP - 257
BT - PRIMA 2018
A2 - Oren, Nir
A2 - Sakurai, Yuko
A2 - Noda, Itsuki
A2 - Cao Son, Tran
A2 - Miller, Tim
A2 - Savarimuthu, Bastin Tony
PB - Springer Verlag
Y2 - 29 October 2018 through 2 November 2018
ER -