TY - JOUR
T1 - On dynamic patrolling security games
AU - Kira, Akifumi
AU - Kamiyama, Naoyuki
AU - Anai, Hirokazu
AU - Iwashita, Hiroaki
AU - Ohori, Kotaro
N1 - Funding Information:
The authors would like to thank the anonymous referees for helpful comments and suggestions on this manuscript. Akifumi Kira was supported in part by JSPS KAKENHI Grant Numbers 26730010 and 17K12644, Japan. Naoyuki Kamiyama was supported by JST, PRESTO Grant Number JPMJPR14E1, Japan.
Publisher Copyright:
© The Operations Research Society of Japan
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2019
Y1 - 2019
N2 - We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.
AB - We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.
UR - http://www.scopus.com/inward/record.url?scp=85081264388&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081264388&partnerID=8YFLogxK
U2 - 10.15807/jorsj.62.152
DO - 10.15807/jorsj.62.152
M3 - Article
AN - SCOPUS:85081264388
SN - 0453-4514
VL - 62
SP - 152
EP - 168
JO - Journal of the Operations Research Society of Japan
JF - Journal of the Operations Research Society of Japan
IS - 4
ER -