Publicação: Benchmarking anchor-based and anchor-free state-of-the-art deep learning methods for individual tree detection in rgb high-resolution images
dc.contributor.author | Zamboni, Pedro | |
dc.contributor.author | Junior, José Marcato | |
dc.contributor.author | Silva, Jonathan de Andrade | |
dc.contributor.author | Miyoshi, Gabriela Takahashi [UNESP] | |
dc.contributor.author | Matsubara, Edson Takashi | |
dc.contributor.author | Nogueira, Keiller | |
dc.contributor.author | Gonçalves, Wesley Nunes | |
dc.contributor.institution | Federal University of Mato Grosso do Sul | |
dc.contributor.institution | Universidade Estadual Paulista (UNESP) | |
dc.contributor.institution | University of Stirling | |
dc.date.accessioned | 2022-04-29T08:30:21Z | |
dc.date.available | 2022-04-29T08:30:21Z | |
dc.date.issued | 2021-07-01 | |
dc.description.abstract | Urban forests contribute to maintaining livability and increase the resilience of cities in the face of population growth and climate change. Information about the geographical distribution of individual trees is essential for the proper management of these systems. RGB high-resolution aerial images have emerged as a cheap and efficient source of data, although detecting and mapping single trees in an urban environment is a challenging task. Thus, we propose the evaluation of novel methods for single tree crown detection, as most of these methods have not been investigated in remote sensing applications. A total of 21 methods were investigated, including anchor-based (one and two-stage) and anchor-free state-of-the-art deep-learning methods. We used two orthoimages divided into 220 non-overlapping patches of 512 × 512 pixels with a ground sample distance (GSD) of 10 cm. The orthoimages were manually annotated, and 3382 single tree crowns were identified as the ground-truth. Our findings show that the anchor-free detectors achieved the best average performance with an AP50 of 0.686. We observed that the two-stage anchor-based and anchor-free methods showed better performance for this task, emphasizing the FSAF, Double Heads, CARAFE, ATSS, and FoveaBox models. RetinaNet, which is currently commonly applied in remote sensing, did not show satisfactory performance, and Faster R-CNN had lower results than the best methods but with no statistically significant difference. Our findings contribute to a better understanding of the performance of novel deep-learning methods in remote sensing applications and could be used as an indicator of the most suitable methods in such applications. | en |
dc.description.affiliation | Faculty of Engineering Architecture and Urbanism and Geography Federal University of Mato Grosso do Sul | |
dc.description.affiliation | Faculty of Computer Science Federal University of Mato Grosso do Sul | |
dc.description.affiliation | Department of Cartography São Paulo State University (UNESP) | |
dc.description.affiliation | Computing Science and Mathematics Division Faculty of Natural Sciences University of Stirling | |
dc.description.affiliationUnesp | Department of Cartography São Paulo State University (UNESP) | |
dc.description.sponsorship | Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) | |
dc.description.sponsorship | Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) | |
dc.description.sponsorshipId | CNPq: 303559/2019-5 | |
dc.description.sponsorshipId | CNPq: 304052/2019-1 | |
dc.description.sponsorshipId | CNPq: 433783/2018-4 | |
dc.description.sponsorshipId | CAPES: 88881.311850/2018-01 | |
dc.identifier | http://dx.doi.org/10.3390/rs13132482 | |
dc.identifier.citation | Remote Sensing, v. 13, n. 13, 2021. | |
dc.identifier.doi | 10.3390/rs13132482 | |
dc.identifier.issn | 2072-4292 | |
dc.identifier.scopus | 2-s2.0-85109397264 | |
dc.identifier.uri | http://hdl.handle.net/11449/229100 | |
dc.language.iso | eng | |
dc.relation.ispartof | Remote Sensing | |
dc.source | Scopus | |
dc.subject | Convolutional neural network | |
dc.subject | Object detection | |
dc.subject | Remote sensing | |
dc.title | Benchmarking anchor-based and anchor-free state-of-the-art deep learning methods for individual tree detection in rgb high-resolution images | en |
dc.type | Artigo | |
dspace.entity.type | Publication | |
unesp.department | Cartografia - FCT | pt |