Effective speaker retrieval and recognition through vector quantization and unsupervised distance learning

dc.contributor.authorDe Abreu Campos, Victor [UNESP]
dc.contributor.authorGuimarães Pedronette, Daniel Carlos [UNESP]
dc.contributor.institutionUniversidade Estadual Paulista (Unesp)
dc.date.accessioned2018-12-11T16:43:13Z
dc.date.available2018-12-11T16:43:13Z
dc.date.issued2016-06-06
dc.description.abstractThe huge amount of multimedia content accumulated daily has demanded the development of effective retrieval approaches. In this context, speaker recognition methods capable of automatically identifying a person through their voice is of great relevance. This paper presents a novel speaker recognition approach modelled in a retrieval scenario and using a recent unsupervised learning method. The proposed approach considers MFCC features and a Vector Quantization model to compute distances among audio objects. Next, a rank-based unsupervised learning method is used for improving the effectiveness of retrieval results. Several experiments were conducted considering three public datasets with different settings, such as background noise from diverse sources. Experimental results demonstrate that the proposed approach can achieve very high effectiveness results. In addition, effectiveness gains up to +27% were obtained by the unsupervised learning procedure.en
dc.description.affiliationDept. of Statistic Applied Math. and Computing Universidade Estadual Paulista (UNESP)
dc.description.affiliationUnespDept. of Statistic Applied Math. and Computing Universidade Estadual Paulista (UNESP)
dc.format.extent27-32
dc.identifierhttp://dx.doi.org/10.1145/2927006.2927010
dc.identifier.citationMARMI 2016 - Proceedings of the 2016 ACM 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction, co-located with ICMR 2016, p. 27-32.
dc.identifier.doi10.1145/2927006.2927010
dc.identifier.scopus2-s2.0-84978747065
dc.identifier.urihttp://hdl.handle.net/11449/168820
dc.language.isoeng
dc.relation.ispartofMARMI 2016 - Proceedings of the 2016 ACM 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction, co-located with ICMR 2016
dc.rights.accessRightsAcesso aberto
dc.sourceScopus
dc.subjectSpeaker recognition
dc.subjectUnsupervised learning
dc.subjectVector quantization
dc.titleEffective speaker retrieval and recognition through vector quantization and unsupervised distance learningen
dc.typeTrabalho apresentado em evento

Arquivos

Coleções