Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses

Alves, Anderson Antonio Carvalho; Andrietta, Lucas Tassoni; Lopes, Rafael Zinni; Bussiman, Fernando Oliveira; Silva, Fabyano Fonseca e; Carvalheiro, Roberto [UNESP]; Brito, Luiz Fernando; Balieiro, Júlio César de Carvalho; Albuquerque, Lucia Galvão [UNESP]; Ventura, Ricardo Vieira

doi:10.3389/fanim.2021.681557

Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses

dc.contributor.author	Alves, Anderson Antonio Carvalho
dc.contributor.author	Andrietta, Lucas Tassoni
dc.contributor.author	Lopes, Rafael Zinni
dc.contributor.author	Bussiman, Fernando Oliveira
dc.contributor.author	Silva, Fabyano Fonseca e
dc.contributor.author	Carvalheiro, Roberto [UNESP]
dc.contributor.author	Brito, Luiz Fernando
dc.contributor.author	Balieiro, Júlio César de Carvalho
dc.contributor.author	Albuquerque, Lucia Galvão [UNESP]
dc.contributor.author	Ventura, Ricardo Vieira
dc.contributor.institution	Science and Technology of Maranhão (IFMA)
dc.contributor.institution	Universidade de São Paulo (USP)
dc.contributor.institution	Federal University of Viçosa
dc.contributor.institution	Universidade Estadual Paulista (UNESP)
dc.contributor.institution	National Council for Scientific and Technological Development (CNPq)
dc.contributor.institution	Purdue University
dc.date.accessioned	2025-04-29T18:07:49Z
dc.date.issued	2021-01-01
dc.description.abstract	This study focused on assessing the usefulness of using audio signal processing in the gaited horse industry. A total of 196 short-time audio files (4 s) were collected from video recordings of Brazilian gaited horses. These files were converted into waveform signals (196 samples by 80,000 columns) and divided into training (N = 164) and validation (N = 32) datasets. Twelve single-valued audio features were initially extracted to summarize the training data according to the gait patterns (Marcha Batida—MB and Marcha Picada—MP). After preliminary analyses, high-dimensional arrays of the Mel Frequency Cepstral Coefficients (MFCC), Onset Strength (OS), and Tempogram (TEMP) were extracted and used as input information in the classification algorithms. A principal component analysis (PCA) was performed using the 12 single-valued features set and each audio-feature dataset—AFD (MFCC, OS, and TEMP) for prior data visualization. Machine learning (random forest, RF; support vector machine, SVM) and deep learning (multilayer perceptron neural networks, MLP; convolution neural networks, CNN) algorithms were used to classify the gait types. A five-fold cross-validation scheme with 10 repetitions was employed for assessing the models' predictive performance. The classification performance across models and AFD was also validated with independent observations. The models and AFD were compared based on the classification accuracy (ACC), specificity (SPEC), sensitivity (SEN), and area under the curve (AUC). In the logistic regression analysis, five out of the 12 audio features extracted were significant (p < 0.05) between the gait types. ACC averages ranged from 0.806 to 0.932 for MFCC, from 0.758 to 0.948 for OS and, from 0.936 to 0.968 for TEMP. Overall, the TEMP dataset provided the best classification accuracies for all models. The most suitable method for audio-based horse gait pattern classification was CNN. Both cross and independent validation schemes confirmed that high values of ACC, SPEC, SEN, and AUC are expected for yet-to-be-observed labels, except for MFCC-based models, in which clear overfitting was observed. Using audio-generated data for describing gait phenotypes in Brazilian horses is a promising approach, as the two gait patterns were correctly distinguished. The highest classification performance was achieved by combining CNN and the rhythmic-descriptive AFD.	en
dc.description.affiliation	Department of Education Federal Institute of Education Science and Technology of Maranhão (IFMA)
dc.description.affiliation	Department of Animal Nutrition and Production School of Veterinary Medicine and Animal Science University of São Paulo
dc.description.affiliation	Department of Animal Science Federal University of Viçosa
dc.description.affiliation	Department of Animal Science School of Agricultural and Veterinary Sciences Säo Paulo State University (UNESP)
dc.description.affiliation	National Council for Scientific and Technological Development (CNPq)
dc.description.affiliation	Department of Animal Sciences Purdue University
dc.description.affiliationUnesp	Department of Animal Science School of Agricultural and Veterinary Sciences Säo Paulo State University (UNESP)
dc.description.sponsorship	Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
dc.identifier	http://dx.doi.org/10.3389/fanim.2021.681557
dc.identifier.citation	Frontiers in Animal Science, v. 2.
dc.identifier.doi	10.3389/fanim.2021.681557
dc.identifier.issn	2673-6225
dc.identifier.scopus	2-s2.0-85131139764
dc.identifier.uri	https://hdl.handle.net/11449/297825
dc.language.iso	eng
dc.relation.ispartof	Frontiers in Animal Science
dc.source	Scopus
dc.subject	audio-feature
dc.subject	convolutional neural network
dc.subject	four-beat gaited
dc.subject	horse gait
dc.subject	sound analysis
dc.title	Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses	en
dc.type	Artigo	pt
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	3d807254-e442-45e5-a80b-0f6bf3a26e48
relation.isOrgUnitOfPublication.latestForDiscovery	3d807254-e442-45e5-a80b-0f6bf3a26e48
unesp.campus	Universidade Estadual Paulista (UNESP), Faculdade de Ciências Agrárias e Veterinárias, Jaboticabal	pt

Coleções

Jaboticabal - FCAV - Faculdade de Ciências Agrárias e Veterinárias

Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses

Arquivos

Coleções