Resumen
Value alignment is a property of an intelligent agent indicating that it can only pursue goals that are beneficial to humans. Successful value alignment should ensure that an artificial general intelligence cannot intentionally or unintentionally perform behaviors that adversely affect humans. This is problematic in practice since it is difficult to exhaustively enumerated by human programmers. In order for successful value alignment, we argue that values should be learned. In this paper, we hypothesize that an artificial intelligence that can read and understand stories can learn the values tacitly held by the culture from which the stories originate. We describe preliminary work on using stories to generate a value-aligned reward signal for reinforcement learning agents that prevents psychotic-appearing behavior.
| Idioma original | English |
|---|---|
| Título de la publicación alojada | WS-16-01 |
| Subtítulo de la publicación alojada | Artificial Intelligence Applied to Assistive Technologies and Smart Environments; WS-16-02: AI, Ethics, and Society; WS-16-03: Artificial Intelligence for Cyber Security; WS-16-04: Artificial Intelligence for Smart Grids and Smart Buildings; WS-16-05: Beyond NP; WS-16-06: Computer Poker and Imperfect Information Games; WS-16-07: Declarative Learning Based Programming; WS-16-08: Expanding the Boundaries of Health Informatics Using AI; WS-16-09: Incentives and Trust in Electronic Communities; WS-16-10: Knowledge Extraction from Text; WS-16-11: Multiagent Interaction without Prior Coordination; WS-16-12: Planning for Hybrid Systems; WS-16-13: Scholarly Big Data: AI Perspectives, Challenges, and Ideas; WS-16-14: Symbiotic Cognitive Systems; WS-16-15: World Wide Web and Population Health Intelligence |
| Páginas | 105-112 |
| Número de páginas | 8 |
| ISBN (versión digital) | 9781577357599 |
| Estado | Published - 2016 |
| Evento | 2016 AAAI Workshop - Phoenix, United States Duración: feb 12 2016 → feb 13 2016 |
Serie de la publicación
| Nombre | AAAI Workshop - Technical Report |
|---|---|
| Volumen | WS-16-01 - WS-16-15 |
Conference
| Conference | 2016 AAAI Workshop |
|---|---|
| País/Territorio | United States |
| Ciudad | Phoenix |
| Período | 2/12/16 → 2/13/16 |
Nota bibliográfica
Publisher Copyright:Copyright © 2016, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
ASJC Scopus subject areas
- General Engineering
Huella
Profundice en los temas de investigación de 'Using stories to teach human values to artificial agents'. En conjunto forman una huella única.Citar esto
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver