Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy
| dc.contributor.author | Rozendo, Guilherme Botazzo [UNESP] | |
| dc.contributor.author | Roberto, Guilherme Freire | |
| dc.contributor.author | do Nascimento, Marcelo Zanchetta | |
| dc.contributor.author | Alves Neves, Leandro [UNESP] | |
| dc.contributor.author | Lumini, Alessandra | |
| dc.contributor.institution | University of Porto (FEUP) | |
| dc.contributor.institution | Universidade Federal de Uberlândia (UFU) | |
| dc.contributor.institution | Universidade Estadual Paulista (UNESP) | |
| dc.date.accessioned | 2025-04-29T20:08:42Z | |
| dc.date.issued | 2024-01-01 | |
| dc.description.abstract | Weeds are a significant threat to agricultural production. Weed classification systems based on image analysis have offered innovative solutions to agricultural problems, with convolutional neural networks (CNNs) playing a pivotal role in this task. However, CNNs are limited in their ability to capture global relationships in images due to their localized convolutional operation. Vision Transformers (ViT) and Pyramid Vision Transformers (PVT) have emerged as viable solutions to overcome this limitation. Our study aims to determine the effectiveness of CNN, PVT, and ViT in classifying weeds in image datasets. We also examine if combining these methods in an ensemble can enhance classification performance. Our tests were conducted on significant agricultural datasets, including DeepWeeds and CottonWeedID15. The results indicate that a maximum of 3 methods in an ensemble, with only 15 epochs in training, can achieve high accuracy rates of up to 99.17%. This study demonstrates that high accuracies can be achieved with ease of implementation and only a few epochs. | en |
| dc.description.affiliation | Department of Computer Science and Engineering (DISI) - University of Bologna | |
| dc.description.affiliation | Faculty of Engineering University of Porto (FEUP) | |
| dc.description.affiliation | Faculty of Computer Science (FACOM) Federal University of Uberlândia (UFU) | |
| dc.description.affiliation | Department of Computer Science and Statistics (DCCE) São Paulo State University | |
| dc.description.affiliationUnesp | Department of Computer Science and Statistics (DCCE) São Paulo State University | |
| dc.description.sponsorship | European Commission | |
| dc.format.extent | 229-243 | |
| dc.identifier | http://dx.doi.org/10.1007/978-3-031-49018-7_17 | |
| dc.identifier.citation | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 14469 LNCS, p. 229-243. | |
| dc.identifier.doi | 10.1007/978-3-031-49018-7_17 | |
| dc.identifier.issn | 1611-3349 | |
| dc.identifier.issn | 0302-9743 | |
| dc.identifier.scopus | 2-s2.0-85178553087 | |
| dc.identifier.uri | https://hdl.handle.net/11449/307220 | |
| dc.language.iso | eng | |
| dc.relation.ispartof | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | |
| dc.source | Scopus | |
| dc.subject | CNN | |
| dc.subject | Ensemble | |
| dc.subject | Pyramid Vision Transformers | |
| dc.subject | Vision transformers | |
| dc.subject | Weeds classification | |
| dc.title | Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy | en |
| dc.type | Trabalho apresentado em evento | pt |
| dspace.entity.type | Publication | |
| unesp.author.orcid | 0000-0002-4123-8264[1] | |
| unesp.author.orcid | 0000-0001-5883-2983[2] | |
| unesp.author.orcid | 0000-0003-3537-0178[3] | |
| unesp.author.orcid | 0000-0001-8580-7054[4] | |
| unesp.author.orcid | 0000-0003-0290-7354[5] |
