An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Contreras, Rodrigo Colnago [UNESP]; Viana, Monique Simplicio; Fonseca, Everthon Silva; dos Santos, Francisco Lledo; Zanin, Rodrigo Bruno; Guido, Rodrigo Capobianco [UNESP]

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Data

2023-06-01

Autores

Contreras, Rodrigo Colnago

Viana, Monique Simplicio

Fonseca, Everthon Silva

dos Santos, Francisco Lledo

Zanin, Rodrigo Bruno

Guido, Rodrigo Capobianco

Tipo

Artigo

Resumo

Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.

Palavras-chave

cepstral analysis, dysphonia detection, machine learning, pattern recognition, voice disorder detection

Idioma

Inglês

Como citar

Sensors, v. 23, n. 11, 2023.

URI

http://hdl.handle.net/11449/250048

Coleções

São José do Rio Preto - IBILCE - Instituto de Biociências, Letras e Ciências Exatas

Página do item completo

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Data

Autores

Orientador

Coorientador

Pós-graduação

Curso de graduação

Título da Revista

ISSN da Revista

Título de Volume

Editor

Tipo

Direito de acesso

Resumo

Descrição

Palavras-chave

Idioma

Como citar

URI

Itens relacionados

Financiadores

Coleções