Abstract
Deep reinforcement learning has great potential to acquire complex, adaptive behaviors for autonomous agents automatically. However, the underlying neural network polices have not been widely deployed in real-world applications, especially in these safety-critical tasks (e.g., autonomous driving). One of the reasons is that the learned policy cannot perform flexible and resilient behaviors as traditional methods to adapt to diverse environments. In this paper, we consider the problem that a mobile robot learns adaptive and resilient behaviors for navigating in unseen uncertain environments while avoiding collisions. We present a novel approach for uncertainty-aware navigation by introducing an uncertainty-aware predictor to model the environmental uncertainty, and we propose a novel uncertainty-aware navigation network to learn resilient behaviors in the prior unknown environments. To train the proposed uncertainty-aware network more stably and efficiently, we present the temperature decay training paradigm, which balances exploration and exploitation during the training process. Our experimental evaluation demonstrates that our approach can learn resilient behaviors in diverse environments and generate adaptive trajectories according to environmental uncertainties.
| Original language | English |
|---|---|
| Title of host publication | 2020 IEEE International Conference on Robotics and Automation, ICRA 2020 |
| Pages | 5299-5305 |
| Number of pages | 7 |
| ISBN (Electronic) | 9781728173955 |
| DOIs | |
| State | Published - May 2020 |
| Event | 2020 IEEE International Conference on Robotics and Automation, ICRA 2020 - Paris, France Duration: May 31 2020 → Aug 31 2020 |
Publication series
| Name | Proceedings - IEEE International Conference on Robotics and Automation |
|---|---|
| ISSN (Print) | 1050-4729 |
Conference
| Conference | 2020 IEEE International Conference on Robotics and Automation, ICRA 2020 |
|---|---|
| Country/Territory | France |
| City | Paris |
| Period | 5/31/20 → 8/31/20 |
Bibliographical note
Publisher Copyright:© 2020 IEEE.
Funding
This work was partially supported by HKSAR General Research Fund (GRF) HKU 11202119, 11207818, and NSFC/RGC Joint Research Scheme (HKU103/16-NSFC61631166002)
| Funders | Funder number |
|---|---|
| HKSAR General Research Fund | |
| NSFC-RGC | HKU103/16-NSFC61631166002 |
| New England Glaucoma Research Foundation | HKU 11202119, 11207818 |
ASJC Scopus subject areas
- Software
- Artificial Intelligence
- Electrical and Electronic Engineering
- Control and Systems Engineering