TY - GEN
T1 - A new method of concurrently visualizing states, values, and actions in reinforcement based brain machine interfaces
AU - Bae, Jihye
AU - Sanchez Giraldo, Luis G.
AU - Pohlmeyer, Eric A.
AU - Sanchez, Justin C.
AU - Principe, Jose C.
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2013
Y1 - 2013
N2 - This paper presents the first attempt to quantify the individual performance of the subject and of the computer agent on a closed loop Reinforcement Learning Brain Machine Interface (RLBMI). The distinctive feature of the RLBMI architecture is the co-adaptation of two systems (a BMI decoder in agent and a BMI user in environment). In this work, an agent implemented using Q-learning via kernel temporal difference (KTD)(λ) decodes the neural states of a monkey and transforms them into action directions of a robotic arm. We analyze how each participant influences the overall performance both in successful and missed trials by visualizing states, corresponding action value Q, and resulting actions in two-dimensional space. With the proposed methodology, we can observe how the decoder effectively learns a good state to action mapping, and how neural states affect the prediction performance.
AB - This paper presents the first attempt to quantify the individual performance of the subject and of the computer agent on a closed loop Reinforcement Learning Brain Machine Interface (RLBMI). The distinctive feature of the RLBMI architecture is the co-adaptation of two systems (a BMI decoder in agent and a BMI user in environment). In this work, an agent implemented using Q-learning via kernel temporal difference (KTD)(λ) decodes the neural states of a monkey and transforms them into action directions of a robotic arm. We analyze how each participant influences the overall performance both in successful and missed trials by visualizing states, corresponding action value Q, and resulting actions in two-dimensional space. With the proposed methodology, we can observe how the decoder effectively learns a good state to action mapping, and how neural states affect the prediction performance.
UR - http://www.scopus.com/inward/record.url?scp=84886542945&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84886542945&partnerID=8YFLogxK
U2 - 10.1109/EMBC.2013.6610770
DO - 10.1109/EMBC.2013.6610770
M3 - Conference contribution
C2 - 24110957
AN - SCOPUS:84886542945
SN - 9781457702167
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
SP - 5402
EP - 5405
BT - 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2013
Y2 - 3 July 2013 through 7 July 2013
ER -