TY - GEN
T1 - Communication for improving policy computation in distributed POMDPs
AU - Nair, Ranjit
AU - Tambe, Milind
AU - Roth, Maayan
AU - Yokoo, Makoto
PY - 2004
Y1 - 2004
N2 - Distributed Partially Observable Marfcov Decision Problems (POMDPs) are emerging as a popular approach for modeling multiagent teamwork where a group of agents work together to jointly maximize a reward function. Since the problem of finding the optimal joint policy for a distributed POMDP has been shown to be NEXP-Complete if no assumptions are made about the domain conditions, several locally optimal approaches have emerged as a viable solution. However, the use of communicative actions as part of these locally optimal algorithms has been largely ignored or has been applied only under restrictive assumptions about the domain. In this paper, we show how communicative acts can be explicitly introduced in order to find locally optimal joint policies that allow agents to coordinate better through synchronization achieved via communication. Furthermore, the introduction of communication allows us to develop a novel compact policy representation that results in savings of both space and time which are verified empirically. Finally, through the imposition of constraints on communication such as not going without communicating for more than K steps, even greater space and time savings can be obtained.
AB - Distributed Partially Observable Marfcov Decision Problems (POMDPs) are emerging as a popular approach for modeling multiagent teamwork where a group of agents work together to jointly maximize a reward function. Since the problem of finding the optimal joint policy for a distributed POMDP has been shown to be NEXP-Complete if no assumptions are made about the domain conditions, several locally optimal approaches have emerged as a viable solution. However, the use of communicative actions as part of these locally optimal algorithms has been largely ignored or has been applied only under restrictive assumptions about the domain. In this paper, we show how communicative acts can be explicitly introduced in order to find locally optimal joint policies that allow agents to coordinate better through synchronization achieved via communication. Furthermore, the introduction of communication allows us to develop a novel compact policy representation that results in savings of both space and time which are verified empirically. Finally, through the imposition of constraints on communication such as not going without communicating for more than K steps, even greater space and time savings can be obtained.
UR - http://www.scopus.com/inward/record.url?scp=4544315369&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=4544315369&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:4544315369
SN - 1581138644
SN - 9781581138641
T3 - Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2004
SP - 1098
EP - 1105
BT - Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2004
A2 - Jennings, N.R.
A2 - Sierra, C.
A2 - Sonenberg, L.
A2 - Tambe, M.
T2 - Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2004
Y2 - 19 July 2004 through 23 July 2004
ER -