Unsupervised Dual-Layer Aggregation for Feature Fusion on Image Retrieval Tasks
| dc.contributor.author | Moreno, Ademir [UNESP] | |
| dc.contributor.author | Guimaraes Pedronette, Daniel Carlos [UNESP] | |
| dc.contributor.institution | Universidade Estadual Paulista (UNESP) | |
| dc.date.accessioned | 2025-04-29T20:01:00Z | |
| dc.date.issued | 2024-01-01 | |
| dc.description.abstract | The revolutionary advances in image representation have led to impressive progress in many image understanding-related tasks, primarily supported by Convolutional Neural Networks (CNN) and, more recently, by Transformer models. Despite such advances, assessing the similarity among images for retrieval in unsupervised scenarios remains a challenging task, mostly grounded on traditional pairwise measures, such as the Euclidean distance. The scenario is even more challenging when different visual features are available, requiring the selection and fusion of features without any label information. In this paper, we propose an Unsupervised Dual-Layer Aggregation (UDLA) method, based on contextual similarity approaches for selecting and fusing CNN and Transformer-based visual features trained through transfer learning. In the first layer, the selected features are fused in pairs focused on precision. A sub-set of pairs is selected for a second layer aggregation focused on recall. An experimental evaluation conducted in different public datasets showed the effectiveness of the proposed approach, which achieved results significantly superior to the best-isolated feature and also superior to a recent fusion approach considered as baseline. | en |
| dc.description.affiliation | São Paulo State University (UNESP) Department of Statistics Applied Mathematics and Computing | |
| dc.description.affiliationUnesp | São Paulo State University (UNESP) Department of Statistics Applied Mathematics and Computing | |
| dc.identifier | http://dx.doi.org/10.1109/SIBGRAPI62404.2024.10716343 | |
| dc.identifier.citation | Brazilian Symposium of Computer Graphic and Image Processing. | |
| dc.identifier.doi | 10.1109/SIBGRAPI62404.2024.10716343 | |
| dc.identifier.issn | 1530-1834 | |
| dc.identifier.scopus | 2-s2.0-85207831512 | |
| dc.identifier.uri | https://hdl.handle.net/11449/304838 | |
| dc.language.iso | eng | |
| dc.relation.ispartof | Brazilian Symposium of Computer Graphic and Image Processing | |
| dc.source | Scopus | |
| dc.title | Unsupervised Dual-Layer Aggregation for Feature Fusion on Image Retrieval Tasks | en |
| dc.type | Trabalho apresentado em evento | pt |
| dspace.entity.type | Publication |

