Abstract
Here the Newton's Method direct action selection approach to continuous action-space reinforcement learning is extended to use an eligibility trace. This is then compared to the momentum term approach from the literature in terms of the update equations and also the success rate and number of trials required to train on two variants of the simulated Cart-Pole benchmark problem. The eligibility trace approach achieves a higher success rate with a far wider range of parameter values than the momentum approach and also trains in fewer trials on the Cart-Pole problem.
Original language | English |
---|---|
Title of host publication | 2017 9th Computer Science and Electronic Engineering (CEEC) |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 55-59 |
Number of pages | 5 |
ISBN (Print) | 978-1-5386-3008-2 |
DOIs | |
Publication status | Published - 29 Sept 2017 |
Event | 2017 9th Computer Science and Electronic Engineering (CEEC) - Colchester, UK Duration: 27 Sept 2017 → 29 Sept 2017 |
Conference
Conference | 2017 9th Computer Science and Electronic Engineering (CEEC) |
---|---|
Period | 27/09/17 → 29/09/17 |
Keywords
- reinforcement learning
- eligibility trace
- momentum
- continuous state- and action-space
- artificial neural networks