Effective speaker retrieval and recognition through vector quantization and unsupervised distance learning

De Abreu Campos, Victor [UNESP]; Guimarães Pedronette, Daniel Carlos [UNESP]

Publicação:
Effective speaker retrieval and recognition through vector quantization and unsupervised distance learning

dc.contributor.author	De Abreu Campos, Victor [UNESP]
dc.contributor.author	Guimarães Pedronette, Daniel Carlos [UNESP]
dc.contributor.institution	Universidade Estadual Paulista (Unesp)
dc.date.accessioned	2018-12-11T16:43:13Z
dc.date.available	2018-12-11T16:43:13Z
dc.date.issued	2016-06-06
dc.description.abstract	The huge amount of multimedia content accumulated daily has demanded the development of effective retrieval approaches. In this context, speaker recognition methods capable of automatically identifying a person through their voice is of great relevance. This paper presents a novel speaker recognition approach modelled in a retrieval scenario and using a recent unsupervised learning method. The proposed approach considers MFCC features and a Vector Quantization model to compute distances among audio objects. Next, a rank-based unsupervised learning method is used for improving the effectiveness of retrieval results. Several experiments were conducted considering three public datasets with different settings, such as background noise from diverse sources. Experimental results demonstrate that the proposed approach can achieve very high effectiveness results. In addition, effectiveness gains up to +27% were obtained by the unsupervised learning procedure.	en
dc.description.affiliation	Dept. of Statistic Applied Math. and Computing Universidade Estadual Paulista (UNESP)
dc.description.affiliationUnesp	Dept. of Statistic Applied Math. and Computing Universidade Estadual Paulista (UNESP)
dc.format.extent	27-32
dc.identifier	http://dx.doi.org/10.1145/2927006.2927010
dc.identifier.citation	MARMI 2016 - Proceedings of the 2016 ACM 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction, co-located with ICMR 2016, p. 27-32.
dc.identifier.doi	10.1145/2927006.2927010
dc.identifier.scopus	2-s2.0-84978747065
dc.identifier.uri	http://hdl.handle.net/11449/168820
dc.language.iso	eng
dc.relation.ispartof	MARMI 2016 - Proceedings of the 2016 ACM 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction, co-located with ICMR 2016
dc.rights.accessRights	Acesso aberto
dc.source	Scopus
dc.subject	Speaker recognition
dc.subject	Unsupervised learning
dc.subject	Vector quantization
dc.title	Effective speaker retrieval and recognition through vector quantization and unsupervised distance learning	en
dc.type	Trabalho apresentado em evento
dspace.entity.type	Publication

Coleções

Artigos

Publicação: Effective speaker retrieval and recognition through vector quantization and unsupervised distance learning

Arquivos

Coleções

Publicação:
Effective speaker retrieval and recognition through vector quantization and unsupervised distance learning