Logo do repositório

Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning

dc.contributor.authorDe Almeida, Aline Gabriel [UNESP]
dc.contributor.authorColombini, Esther Luna
dc.contributor.authorDa Silva Simoes, Alexandre [UNESP]
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.contributor.institutionUniversidade Estadual de Campinas (UNICAMP)
dc.date.accessioned2025-04-29T18:07:49Z
dc.date.issued2023-01-01
dc.description.abstractUnmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs.en
dc.description.affiliationInst. of Science and Tech. of Sorocaba São Paulo State University (Unesp)
dc.description.affiliationInstitute of Computing University of Campinas (Unicamp)
dc.description.affiliationUnespInst. of Science and Tech. of Sorocaba São Paulo State University (Unesp)
dc.format.extent107-112
dc.identifierhttp://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034
dc.identifier.citationProceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112.
dc.identifier.doi10.1109/LARS/SBR/WRE59448.2023.10333034
dc.identifier.scopus2-s2.0-85181114602
dc.identifier.urihttps://hdl.handle.net/11449/297822
dc.language.isoeng
dc.relation.ispartofProceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023
dc.sourceScopus
dc.subjectProximal Policy Optimization (PPO)
dc.subjectReinforcement Learning
dc.subjectTiltrotor
dc.subjectTwin-Delayed Deep Deterministic Policy Gradient (TD3)
dc.subjectUnmanned Aerial Vehicle (UAV)
dc.titleControlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learningen
dc.typeTrabalho apresentado em eventopt
dspace.entity.typePublication
relation.isOrgUnitOfPublication0bc7c43e-b5b0-4350-9d05-74d892acf9d1
relation.isOrgUnitOfPublication.latestForDiscovery0bc7c43e-b5b0-4350-9d05-74d892acf9d1
unesp.campusUniversidade Estadual Paulista (UNESP), Instituto de Ciência e Tecnologia, Sorocabapt

Arquivos