Residual Neural Network precisely quantifies dysarthria severity-level based on short-duration speech segments
dc.contributor.author | Gupta, Siddhant | |
dc.contributor.author | Patil, Ankur T. | |
dc.contributor.author | Purohit, Mirali | |
dc.contributor.author | Parmar, Mihir | |
dc.contributor.author | Patel, Maitreya | |
dc.contributor.author | Patil, Hemant A. | |
dc.contributor.author | Guido, Rodrigo Capobianco [UNESP] | |
dc.contributor.institution | Dhirubhai Ambani Institute of Information and Communication Technology (DA-IICT) | |
dc.contributor.institution | Arizona State University | |
dc.contributor.institution | Universidade Estadual Paulista (Unesp) | |
dc.date.accessioned | 2021-06-25T11:12:50Z | |
dc.date.available | 2021-06-25T11:12:50Z | |
dc.date.issued | 2021-07-01 | |
dc.description.abstract | Recently, we have witnessed Deep Learning methodologies gaining significant attention for severity-based classification of dysarthric speech. Detecting dysarthria, quantifying its severity, are of paramount importance in various real-life applications, such as the assessment of patients’ progression in treatments, which includes an adequate planning of their therapy and the improvement of speech-based interactive systems in order to handle pathologically-affected voices automatically. Notably, current speech-powered tools often deal with short-duration speech segments and, consequently, are less efficient in dealing with impaired speech, even by using Convolutional Neural Networks (CNNs). Thus, detecting dysarthria severity-level based on short speech segments might help in improving the performance and applicability of those systems. To achieve this goal, we propose a novel Residual Network (ResNet)-based technique which receives short-duration speech segments as input. Statistically meaningful objective analysis of our experiments, reported over standard Universal Access corpus, exhibits average values of 21.35% and 22.48% improvement, compared to the baseline CNN, in terms of classification accuracy and F1-score, respectively. For additional comparisons, tests with Gaussian Mixture Models and Light CNNs were also performed. Overall, the values of 98.90% and 98.00% for classification accuracy and F1-score, respectively, were obtained with the proposed ResNet approach, confirming its efficacy and reassuring its practical applicability. | en |
dc.description.affiliation | Speech Research Lab Dhirubhai Ambani Institute of Information and Communication Technology (DA-IICT) | |
dc.description.affiliation | Arizona State University | |
dc.description.affiliation | Instituto de Biociências Letras e Ciências Exatas Unesp - Univ Estadual Paulista (São Paulo State University), Rua Cristóvão Colombo 2265, Jd Nazareth | |
dc.description.affiliationUnesp | Instituto de Biociências Letras e Ciências Exatas Unesp - Univ Estadual Paulista (São Paulo State University), Rua Cristóvão Colombo 2265, Jd Nazareth | |
dc.description.sponsorship | Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) | |
dc.description.sponsorship | Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) | |
dc.description.sponsorshipId | FAPESP: 2019/04475-0 | |
dc.description.sponsorshipId | CNPq: 306808/2018-8 | |
dc.format.extent | 105-117 | |
dc.identifier | http://dx.doi.org/10.1016/j.neunet.2021.02.008 | |
dc.identifier.citation | Neural Networks, v. 139, p. 105-117. | |
dc.identifier.doi | 10.1016/j.neunet.2021.02.008 | |
dc.identifier.issn | 1879-2782 | |
dc.identifier.issn | 0893-6080 | |
dc.identifier.scopus | 2-s2.0-85102061061 | |
dc.identifier.uri | http://hdl.handle.net/11449/208481 | |
dc.language.iso | eng | |
dc.relation.ispartof | Neural Networks | |
dc.source | Scopus | |
dc.subject | CNN | |
dc.subject | Dysarthria | |
dc.subject | ResNet | |
dc.subject | Severity-level | |
dc.subject | Short-speech segments | |
dc.title | Residual Neural Network precisely quantifies dysarthria severity-level based on short-duration speech segments | en |
dc.type | Artigo | |
unesp.author.orcid | 0000-0002-0924-8024[7] | |
unesp.campus | Universidade Estadual Paulista (Unesp), Instituto de Biociências Letras e Ciências Exatas, São José do Rio Preto | pt |
unesp.department | Ciências da Computação e Estatística - IBILCE | pt |