TY - GEN
T1 - Expediting RL by using graphical structures
AU - Dai, Peng
AU - Strehi, Alexander L.
AU - Goldsmith, Judy
PY - 2008
Y1 - 2008
N2 - The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend to be much slower than planning algorithms, which require the model as input. Recent results demonstrate that MDP planning can be expedited, by exploiting the graphical structure of the MDP. We present extensions to two popular RL algorithms, Q-learning and RMax, that learn and exploit the graphical structure of problems to improve overall learning speed. Use of the graphical structure of the under- lying MDP can greatly improve the speed of planning algorithms, if the underlying MDP has a nontrivial topological structure. Our experiments show that use of the apparent topological structure of an MDP speeds up reinforcement learning, even if the MDP is simply connected.
AB - The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend to be much slower than planning algorithms, which require the model as input. Recent results demonstrate that MDP planning can be expedited, by exploiting the graphical structure of the MDP. We present extensions to two popular RL algorithms, Q-learning and RMax, that learn and exploit the graphical structure of problems to improve overall learning speed. Use of the graphical structure of the under- lying MDP can greatly improve the speed of planning algorithms, if the underlying MDP has a nontrivial topological structure. Our experiments show that use of the apparent topological structure of an MDP speeds up reinforcement learning, even if the MDP is simply connected.
UR - http://www.scopus.com/inward/record.url?scp=84899933255&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84899933255&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84899933255
SN - 9781605604701
T3 - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
SP - 1297
EP - 1300
BT - 7th International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2008
T2 - 7th International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2008
Y2 - 12 May 2008 through 16 May 2008
ER -