Logo do repositório

Canonical cortical graph neural networks and its application for speech enhancement in audio-visual hearing aids

dc.contributor.authorPassos, Leandro A.
dc.contributor.authorPapa, João Paulo [UNESP]
dc.contributor.authorHussain, Amir
dc.contributor.authorAdeel, Ahsan
dc.contributor.institutionUniversity of Wolverhampton
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.contributor.institutionEdinburgh Napier University
dc.contributor.institutiondeepCI.org 20/1 Parkside Terrace
dc.date.accessioned2023-07-29T16:03:47Z
dc.date.available2023-07-29T16:03:47Z
dc.date.issued2023-03-28
dc.description.abstractDespite the recent success of machine learning algorithms, most models face drawbacks when considering more complex tasks requiring interaction between different sources, such as multimodal input data and logical time sequences. On the other hand, the biological brain is highly sharpened in this sense, empowered to automatically manage and integrate such streams of information. In this context, this work draws inspiration from recent discoveries in brain cortical circuits to propose a more biologically plausible self-supervised machine learning approach. This combines multimodal information using intra-layer modulations together with Canonical Correlation Analysis, and a memory mechanism to keep track of temporal data, the overall approach termed Canonical Cortical Graph Neural networks. This is shown to outperform recent state-of-the-art models in terms of clean audio reconstruction and energy efficiency for a benchmark audio-visual speech dataset. The enhanced performance is demonstrated through a reduced and smother neuron firing rate distribution. suggesting that the proposed model is amenable for speech enhancement in future audio-visual hearing aid devices.en
dc.description.affiliationCMI Lab School of Engineering and Informatics University of Wolverhampton
dc.description.affiliationDepartment of Computing São Paulo State University
dc.description.affiliationSchool of Computing Edinburgh Napier University, Scotland
dc.description.affiliationdeepCI.org 20/1 Parkside Terrace
dc.description.affiliationUnespDepartment of Computing São Paulo State University
dc.format.extent196-203
dc.identifierhttp://dx.doi.org/10.1016/j.neucom.2022.11.081
dc.identifier.citationNeurocomputing, v. 527, p. 196-203.
dc.identifier.doi10.1016/j.neucom.2022.11.081
dc.identifier.issn1872-8286
dc.identifier.issn0925-2312
dc.identifier.scopus2-s2.0-85146713633
dc.identifier.urihttp://hdl.handle.net/11449/249585
dc.language.isoeng
dc.relation.ispartofNeurocomputing
dc.sourceScopus
dc.subjectCanonical correlation analysis
dc.subjectCortical circuits
dc.subjectGraph neural network
dc.subjectMultimodal learning
dc.subjectPositional encoding
dc.subjectPrior frames neighborhood
dc.titleCanonical cortical graph neural networks and its application for speech enhancement in audio-visual hearing aidsen
dc.typeArtigopt
dspace.entity.typePublication
relation.isDepartmentOfPublication872c0bbb-bf84-404e-9ca7-f87a0fe94e58
relation.isDepartmentOfPublication.latestForDiscovery872c0bbb-bf84-404e-9ca7-f87a0fe94e58
relation.isOrgUnitOfPublicationaef1f5df-a00f-45f4-b366-6926b097829b
relation.isOrgUnitOfPublication.latestForDiscoveryaef1f5df-a00f-45f4-b366-6926b097829b
unesp.campusUniversidade Estadual Paulista (UNESP), Faculdade de Ciências, Baurupt
unesp.departmentComputação - FCpt

Arquivos