GRAPH CONVOLUTIONAL NETWORKS AND MANIFOLD RANKING FOR MULTIMODAL VIDEO RETRIEVAL

de Almeida, Lucas Barbosa [UNESP]; Valem, Lucas Pascotti [UNESP]; Pedronette, Daniel Carlos Guimarães [UNESP]

GRAPH CONVOLUTIONAL NETWORKS AND MANIFOLD RANKING FOR MULTIMODAL VIDEO RETRIEVAL

dc.contributor.author	de Almeida, Lucas Barbosa [UNESP]
dc.contributor.author	Valem, Lucas Pascotti [UNESP]
dc.contributor.author	Pedronette, Daniel Carlos Guimarães [UNESP]
dc.contributor.institution	Universidade Estadual Paulista (UNESP)
dc.date.accessioned	2023-07-29T13:38:35Z
dc.date.available	2023-07-29T13:38:35Z
dc.date.issued	2022-01-01
dc.description.abstract	Despite the impressive advances obtained by supervised deep learning approaches on retrieval and classification tasks, how to acquire labeled data for training remains a challenging bottleneck. In this scenario, the need for developing more effective content-based retrieval approaches capable of taking advantage of multimodal information and advances in unsupervised learning becomes imperative. Based on such observations, we propose two novel approaches that combine Graph Convolutional Networks (GCNs) with rank-based manifold learning methods. The GCN models were trained in an unsupervised way, using the Deep Graph Infomax algorithm, and the proposed approaches employ recent rank-based manifold learning methods. Multimodal information is exploited through pre-trained CNNs via transfer learning for extracting audio, image, and video features. The proposed approaches were evaluated on three public action recognition datasets. High-effective results were obtained, reaching relative gains up to +29.44% of MAP compared to baseline approaches without GCNs. The experimental evaluation also considered classical and recent baselines in the literature.	en
dc.description.affiliation	Department of Statistics Applied Mathematics and Computing (DEMAC) São Paulo State University (UNESP)
dc.description.affiliationUnesp	Department of Statistics Applied Mathematics and Computing (DEMAC) São Paulo State University (UNESP)
dc.description.sponsorship	Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
dc.description.sponsorshipId	FAPESP: #2018/15597-6
dc.description.sponsorshipId	FAPESP: #2020/03311-0
dc.description.sponsorshipId	FAPESP: #2020/11366-0
dc.format.extent	2811-2815
dc.identifier	http://dx.doi.org/10.1109/ICIP46576.2022.9897911
dc.identifier.citation	Proceedings - International Conference on Image Processing, ICIP, p. 2811-2815.
dc.identifier.doi	10.1109/ICIP46576.2022.9897911
dc.identifier.issn	1522-4880
dc.identifier.scopus	2-s2.0-85146715017
dc.identifier.uri	http://hdl.handle.net/11449/248246
dc.language.iso	eng
dc.relation.ispartof	Proceedings - International Conference on Image Processing, ICIP
dc.source	Scopus
dc.subject	graph convolutional networks
dc.subject	manifold learning
dc.subject	rank aggregation
dc.subject	video multimodal retrieval
dc.title	GRAPH CONVOLUTIONAL NETWORKS AND MANIFOLD RANKING FOR MULTIMODAL VIDEO RETRIEVAL	en
dc.type	Trabalho apresentado em evento
dspace.entity.type	Publication
unesp.campus	Universidade Estadual Paulista (UNESP), Instituto de Geociências e Ciências Exatas, Rio Claro	pt
unesp.department	Estatística, Matemática Aplicada e Computação - IGCE	pt

Coleções

Rio Claro - IGCE - Instituto de Geociências e Ciências Exatas

GRAPH CONVOLUTIONAL NETWORKS AND MANIFOLD RANKING FOR MULTIMODAL VIDEO RETRIEVAL

Arquivos

Coleções