Publicação: Intelligent control of a quadrotor with proximal policy optimization reinforcement learning
dc.contributor.author | Lopes, Guilherme Cano | |
dc.contributor.author | Ferreira, Murillo [UNESP] | |
dc.contributor.author | Da Silva Simoes, Alexandre [UNESP] | |
dc.contributor.author | Colombini, Esther Luna [UNESP] | |
dc.contributor.institution | Universidade Estadual de Campinas (UNICAMP) | |
dc.contributor.institution | Universidade Estadual Paulista (Unesp) | |
dc.date.accessioned | 2019-10-06T15:33:05Z | |
dc.date.available | 2019-10-06T15:33:05Z | |
dc.date.issued | 2018-12-24 | |
dc.description.abstract | Aerial platforms, such as quadrotors, are inherently unstable systems. Generally, the task of stabilizing the flight of a quadrotor is approached by techniques based on classic and modern control algorithms. However, recent model-free reinforcement learning algorithms have been successfully used for controlling drones. In this work we show the feasibility of applying reinforcement learning methods to optimize a stochastic control policy (during training), in order to perform the position control of the 'model-free' quadrotor. This process is achieved while maintaining a good sampling efficiency, allowing fast convergence even when using computationally expensive off-The-shelf simulators for robotics and without the necessity of any additional exploration strategy. We used the Proximal Policy Optimization (PPO) algorithm to make the agent learn a reliable control policy. The experiments for the resultant intelligent controller were performed using the V-REP simulator and the Vortex physics engine. | en |
dc.description.affiliation | Institute of Computing State University of Campinas-UNICAMP | |
dc.description.affiliation | Institute of Science and Technology of Sorocaba-ICTS São Paulo State University-UNESP | |
dc.description.affiliationUnesp | Institute of Science and Technology of Sorocaba-ICTS São Paulo State University-UNESP | |
dc.format.extent | 509-514 | |
dc.identifier | http://dx.doi.org/10.1109/LARS/SBR/WRE.2018.00094 | |
dc.identifier.citation | Proceedings - 15th Latin American Robotics Symposium, 6th Brazilian Robotics Symposium and 9th Workshop on Robotics in Education, LARS/SBR/WRE 2018, p. 509-514. | |
dc.identifier.doi | 10.1109/LARS/SBR/WRE.2018.00094 | |
dc.identifier.scopus | 2-s2.0-85061334198 | |
dc.identifier.uri | http://hdl.handle.net/11449/187338 | |
dc.language.iso | eng | |
dc.relation.ispartof | Proceedings - 15th Latin American Robotics Symposium, 6th Brazilian Robotics Symposium and 9th Workshop on Robotics in Education, LARS/SBR/WRE 2018 | |
dc.rights.accessRights | Acesso restrito | pt |
dc.source | Scopus | |
dc.subject | Control | |
dc.subject | Proximal Policy Optimization | |
dc.subject | Quadrotor | |
dc.subject | Reinforcement Learning | |
dc.title | Intelligent control of a quadrotor with proximal policy optimization reinforcement learning | en |
dc.type | Trabalho apresentado em evento | pt |
dspace.entity.type | Publication | |
unesp.campus | Universidade Estadual Paulista (UNESP), Instituto de Ciência e Tecnologia, Sorocaba | pt |