TY - GEN
T1 - Reinforcement learning via kernel temporal difference
AU - Bae, Jihye
AU - Chhatbar, Pratik
AU - Francis, Joseph T.
AU - Sanchez, Justin C.
AU - Principe, Jose C.
PY - 2011
Y1 - 2011
N2 - This paper introduces a kernel adaptive filter implemented with stochastic gradient on temporal differences, kernel Temporal Difference (TD)(λ), to estimate the state-action value function in reinforcement learning. The case λ=0 will be studied in this paper. Experimental results show the method's applicability for learning motor state decoding during a center-out reaching task performed by a monkey. The results are compared to the implementation of a time delay neural network (TDNN) trained with backpropagation of the temporal difference error. From the experiments, it is observed that kernel TD(0) allows faster convergence and a better solution than the neural network.
AB - This paper introduces a kernel adaptive filter implemented with stochastic gradient on temporal differences, kernel Temporal Difference (TD)(λ), to estimate the state-action value function in reinforcement learning. The case λ=0 will be studied in this paper. Experimental results show the method's applicability for learning motor state decoding during a center-out reaching task performed by a monkey. The results are compared to the implementation of a time delay neural network (TDNN) trained with backpropagation of the temporal difference error. From the experiments, it is observed that kernel TD(0) allows faster convergence and a better solution than the neural network.
UR - http://www.scopus.com/inward/record.url?scp=84055199049&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84055199049&partnerID=8YFLogxK
U2 - 10.1109/IEMBS.2011.6091370
DO - 10.1109/IEMBS.2011.6091370
M3 - Conference contribution
C2 - 22255624
AN - SCOPUS:84055199049
SN - 9781424441211
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
SP - 5662
EP - 5665
BT - 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS 2011
T2 - 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS 2011
Y2 - 30 August 2011 through 3 September 2011
ER -