TY - GEN
T1 - Introduction and control of subgoals in reinforcement learning
AU - Murata, Junichi
AU - Abe, Yasuomi
AU - Ota, Keisuke
PY - 2007
Y1 - 2007
N2 - Reinforcement learning (RL) can be applied to a wide class of problems because it requires no other information than perceived states and rewards to find good action policies. However, it takes a large number of trials before acquiring the optimal policy. In order to make RL faster, use of subgoals is proposed. Since errors and ambiguity are inevitable in subgoal information provided by human designers, a mechanism is proposed that controls use of subgoals. The method is applied to examples and the results show that use of subgoals is very effective in accelerating RL and that the proposed control mechanism successfully suppresses possible critical damages on the RL performance caused by errors and ambiguity in subgoal information.
AB - Reinforcement learning (RL) can be applied to a wide class of problems because it requires no other information than perceived states and rewards to find good action policies. However, it takes a large number of trials before acquiring the optimal policy. In order to make RL faster, use of subgoals is proposed. Since errors and ambiguity are inevitable in subgoal information provided by human designers, a mechanism is proposed that controls use of subgoals. The method is applied to examples and the results show that use of subgoals is very effective in accelerating RL and that the proposed control mechanism successfully suppresses possible critical damages on the RL performance caused by errors and ambiguity in subgoal information.
UR - http://www.scopus.com/inward/record.url?scp=38349094561&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38349094561&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:38349094561
SN - 9780889866317
T3 - Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, AIA 2007
SP - 329
EP - 334
BT - Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, AIA 2007
T2 - IASTED International Conference on Artificial Intelligence and Applications, AIA 2007
Y2 - 12 February 2007 through 14 February 2007
ER -