Logo do repositório

Neural Architecture Search for Enhancing Action Video Recognition in Compressed Domains

dc.contributor.authorLamkowski, Pedro [UNESP]
dc.contributor.authorRodrigues, Douglas [UNESP]
dc.contributor.authorPassos, Leandro A. [UNESP]
dc.contributor.authorPapa, João P. [UNESP]
dc.contributor.authorAlmeida, Jurandy
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.contributor.institutionUniversidade Federal de São Carlos (UFSCar)
dc.date.accessioned2025-04-29T20:08:58Z
dc.date.issued2024-01-01
dc.description.abstractVideo classification models have become one of the most widely used topics in the computer vision field, encompassing many tasks such as medical, security, industrial, and other applications. Although deep learning models have achieved great results in the video domain, such models are built to operate in the domain of RGB frame sequences. In such models, a prior step is required for decoding video data since the vast majority relies on compressed formats. Nevertheless, large amounts of computational resources are required for decoding, especially in real-time. Researchers have already tackled the task of building networks that work in the compressed domain with promising results but with architectures still very close to those used for the RGB domain. We propose an approach that employs Neural Architecture Search to explore and find the most effective architectures for the compressed domain. Our approach was tested on UCF101 and HMDB51 datasets, obtaining a computationally less complex architecture than similar methods.en
dc.description.affiliationSão Paulo State University Department of Computing, São Paulo
dc.description.affiliationFederal University of São Carlos Department of Computing, São Paulo
dc.description.affiliationUnespSão Paulo State University Department of Computing, São Paulo
dc.identifierhttp://dx.doi.org/10.1109/IWSSIP62407.2024.10634021
dc.identifier.citationInternational Conference on Systems, Signals, and Image Processing.
dc.identifier.doi10.1109/IWSSIP62407.2024.10634021
dc.identifier.issn2157-8702
dc.identifier.issn2157-8672
dc.identifier.scopus2-s2.0-85202801149
dc.identifier.urihttps://hdl.handle.net/11449/307317
dc.language.isoeng
dc.relation.ispartofInternational Conference on Systems, Signals, and Image Processing
dc.sourceScopus
dc.subjectcompressed domain
dc.subjectneural architecture search
dc.subjectvideo classification
dc.titleNeural Architecture Search for Enhancing Action Video Recognition in Compressed Domainsen
dc.typeTrabalho apresentado em eventopt
dspace.entity.typePublication

Arquivos

Coleções