Abstract
In order for robots and intelligent agents to interact with and learn from people with no machine-learning expertise, robots should be able to learn from natural human instruction. Many human explanations consist of simple sentences without state information, yet most machine learning techniques that incorporate human guidance cannot use nonspecific explanations. This work aims to learn policies from a few sentences that aren't state specific. The proposed Object-focused advice links an object to an action, and allows a person to generalize over an object's state space. To evaluate this technique, agents were trained using Object-focused advice collected from participants in an experiment in the Mario Bros. domain. The results show that Object-focused advice performs better than when no advice is given, the agent can learn where to apply the advice in the state space, and the agent can recover from adversarial advice. Also, including warnings of what not do to in addition to advice of what actions to take improves performance.
Original language | English |
---|---|
Title of host publication | AAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems |
Pages | 1447-1448 |
Number of pages | 2 |
ISBN (Electronic) | 9781450342391 |
State | Published - 2016 |
Event | 15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016 - Singapore, Singapore Duration: May 9 2016 → May 13 2016 |
Publication series
Name | Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS |
---|---|
ISSN (Print) | 1548-8403 |
ISSN (Electronic) | 1558-2914 |
Conference
Conference | 15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016 |
---|---|
Country/Territory | Singapore |
City | Singapore |
Period | 5/9/16 → 5/13/16 |
Bibliographical note
Publisher Copyright:Copyright © 2016, International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved.
Keywords
- Advice
- Human teachers
- Reinforcement learning
ASJC Scopus subject areas
- Artificial Intelligence
- Software
- Control and Systems Engineering