Pyramid Representations of the set of actions in reinforcement learning
Future robot systems will perform increasingly complex tasks in decreasingly well-structured and known environments. Robots will need to adapt their hardware and software, first only to foreseen, but ultimately to more complex changes of the environment. In this paper we describe a learning strategy based on reinforcement which allows fast robot learning from scratch using only its interaction with the environment, even when the reward is provided by a human observer and therefore is highly non-deterministic and noisy. To get this our proposal uses a novel representation of the action space together with an ensemble of learners able to forecast the time interval before a robot failure.
keywords: Reinforcement learning, Robotics, Ensembles, Learning and adaptation
Publication: Congress
1624015035261
June 18, 2021
/research/publications/pyramid-representations-of-the-set-of-actions-in-reinforcement-learning
Future robot systems will perform increasingly complex tasks in decreasingly well-structured and known environments. Robots will need to adapt their hardware and software, first only to foreseen, but ultimately to more complex changes of the environment. In this paper we describe a learning strategy based on reinforcement which allows fast robot learning from scratch using only its interaction with the environment, even when the reward is provided by a human observer and therefore is highly non-deterministic and noisy. To get this our proposal uses a novel representation of the action space together with an ensemble of learners able to forecast the time interval before a robot failure. - R. Iglesias, V. Álvarez-Santos, M.A. Rodríguez, D. Santos Saavedra, C.V. Regueiro, X.M Pardo - 10.1007/978-3-319-18833-1_22
publications_en