Intelligent control of a quadrotor with proximal policy optimization reinforcement learning

Lopes, Guilherme Cano; Ferreira, Murillo [UNESP]; Da Silva Simoes, Alexandre [UNESP]; Colombini, Esther Luna [UNESP]

doi:10.1109/LARS/SBR/WRE.2018.00094

Intelligent control of a quadrotor with proximal policy optimization reinforcement learning

dc.contributor.author	Lopes, Guilherme Cano
dc.contributor.author	Ferreira, Murillo [UNESP]
dc.contributor.author	Da Silva Simoes, Alexandre [UNESP]
dc.contributor.author	Colombini, Esther Luna [UNESP]
dc.contributor.institution	Universidade Estadual de Campinas (UNICAMP)
dc.contributor.institution	Universidade Estadual Paulista (Unesp)
dc.date.accessioned	2019-10-06T15:33:05Z
dc.date.available	2019-10-06T15:33:05Z
dc.date.issued	2018-12-24
dc.description.abstract	Aerial platforms, such as quadrotors, are inherently unstable systems. Generally, the task of stabilizing the flight of a quadrotor is approached by techniques based on classic and modern control algorithms. However, recent model-free reinforcement learning algorithms have been successfully used for controlling drones. In this work we show the feasibility of applying reinforcement learning methods to optimize a stochastic control policy (during training), in order to perform the position control of the 'model-free' quadrotor. This process is achieved while maintaining a good sampling efficiency, allowing fast convergence even when using computationally expensive off-The-shelf simulators for robotics and without the necessity of any additional exploration strategy. We used the Proximal Policy Optimization (PPO) algorithm to make the agent learn a reliable control policy. The experiments for the resultant intelligent controller were performed using the V-REP simulator and the Vortex physics engine.	en
dc.description.affiliation	Institute of Computing State University of Campinas-UNICAMP
dc.description.affiliation	Institute of Science and Technology of Sorocaba-ICTS São Paulo State University-UNESP
dc.description.affiliationUnesp	Institute of Science and Technology of Sorocaba-ICTS São Paulo State University-UNESP
dc.format.extent	509-514
dc.identifier	http://dx.doi.org/10.1109/LARS/SBR/WRE.2018.00094
dc.identifier.citation	Proceedings - 15th Latin American Robotics Symposium, 6th Brazilian Robotics Symposium and 9th Workshop on Robotics in Education, LARS/SBR/WRE 2018, p. 509-514.
dc.identifier.doi	10.1109/LARS/SBR/WRE.2018.00094
dc.identifier.scopus	2-s2.0-85061334198
dc.identifier.uri	http://hdl.handle.net/11449/187338
dc.language.iso	eng
dc.relation.ispartof	Proceedings - 15th Latin American Robotics Symposium, 6th Brazilian Robotics Symposium and 9th Workshop on Robotics in Education, LARS/SBR/WRE 2018
dc.rights.accessRights	Acesso restrito	pt
dc.source	Scopus
dc.subject	Control
dc.subject	Proximal Policy Optimization
dc.subject	Quadrotor
dc.subject	Reinforcement Learning
dc.title	Intelligent control of a quadrotor with proximal policy optimization reinforcement learning	en
dc.type	Trabalho apresentado em evento	pt
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	0bc7c43e-b5b0-4350-9d05-74d892acf9d1
relation.isOrgUnitOfPublication.latestForDiscovery	0bc7c43e-b5b0-4350-9d05-74d892acf9d1
unesp.campus	Universidade Estadual Paulista (UNESP), Instituto de Ciência e Tecnologia, Sorocaba	pt

Coleções

Sorocaba - ICTS - Instituto de Ciência e Tecnologia

Intelligent control of a quadrotor with proximal policy optimization reinforcement learning

Arquivos

Coleções