An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Contreras, Rodrigo Colnago [UNESP]; Viana, Monique Simplicio; Fonseca, Everthon Silva; dos Santos, Francisco Lledo; Zanin, Rodrigo Bruno; Guido, Rodrigo Capobianco [UNESP]

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

dc.contributor.author	Contreras, Rodrigo Colnago [UNESP]
dc.contributor.author	Viana, Monique Simplicio
dc.contributor.author	Fonseca, Everthon Silva
dc.contributor.author	dos Santos, Francisco Lledo
dc.contributor.author	Zanin, Rodrigo Bruno
dc.contributor.author	Guido, Rodrigo Capobianco [UNESP]
dc.contributor.institution	Universidade Estadual Paulista (UNESP)
dc.contributor.institution	Federal Institute of São Paulo
dc.contributor.institution	Mato Grosso State University
dc.date.accessioned	2023-07-29T16:16:17Z
dc.date.available	2023-07-29T16:16:17Z
dc.date.issued	2023-06-01
dc.description.abstract	Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.	en
dc.description.affiliation	Department of Computer Science and Statistics Institute of Biosciences Letters and Exact Sciences São Paulo State University, SP
dc.description.affiliation	Federal Institute of São Paulo, SP
dc.description.affiliation	Faculty of Architecture and Engineering Mato Grosso State University, MT
dc.description.affiliationUnesp	Department of Computer Science and Statistics Institute of Biosciences Letters and Exact Sciences São Paulo State University, SP
dc.identifier	http://dx.doi.org/10.3390/s23115196
dc.identifier.citation	Sensors, v. 23, n. 11, 2023.
dc.identifier.doi	10.3390/s23115196
dc.identifier.issn	1424-8220
dc.identifier.scopus	2-s2.0-85161510694
dc.identifier.uri	http://hdl.handle.net/11449/250048
dc.language.iso	eng
dc.relation.ispartof	Sensors
dc.source	Scopus
dc.subject	cepstral analysis
dc.subject	dysphonia detection
dc.subject	machine learning
dc.subject	pattern recognition
dc.subject	voice disorder detection
dc.title	An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection	en
dc.type	Artigo
unesp.author.orcid	0000-0003-4003-7791[1]
unesp.author.orcid	0000-0002-2960-8293[2]
unesp.author.orcid	0000-0001-6202-0806[3]
unesp.author.orcid	0000-0002-7718-8203[4]
unesp.author.orcid	0000-0002-4990-0056[5]
unesp.author.orcid	0000-0002-0924-8024[6]
unesp.campus	Universidade Estadual Paulista (Unesp), Instituto de Biociências Letras e Ciências Exatas, São José do Rio Preto	pt
unesp.department	Ciências da Computação e Estatística - IBILCE	pt

Coleções

São José do Rio Preto - IBILCE - Instituto de Biociências, Letras e Ciências Exatas

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Arquivos

Coleções