Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses
| dc.contributor.author | Alves, Anderson Antonio Carvalho | |
| dc.contributor.author | Andrietta, Lucas Tassoni | |
| dc.contributor.author | Lopes, Rafael Zinni | |
| dc.contributor.author | Bussiman, Fernando Oliveira | |
| dc.contributor.author | Silva, Fabyano Fonseca e | |
| dc.contributor.author | Carvalheiro, Roberto [UNESP] | |
| dc.contributor.author | Brito, Luiz Fernando | |
| dc.contributor.author | Balieiro, Júlio César de Carvalho | |
| dc.contributor.author | Albuquerque, Lucia Galvão [UNESP] | |
| dc.contributor.author | Ventura, Ricardo Vieira | |
| dc.contributor.institution | Science and Technology of Maranhão (IFMA) | |
| dc.contributor.institution | Universidade de São Paulo (USP) | |
| dc.contributor.institution | Federal University of Viçosa | |
| dc.contributor.institution | Universidade Estadual Paulista (UNESP) | |
| dc.contributor.institution | National Council for Scientific and Technological Development (CNPq) | |
| dc.contributor.institution | Purdue University | |
| dc.date.accessioned | 2025-04-29T18:07:49Z | |
| dc.date.issued | 2021-01-01 | |
| dc.description.abstract | This study focused on assessing the usefulness of using audio signal processing in the gaited horse industry. A total of 196 short-time audio files (4 s) were collected from video recordings of Brazilian gaited horses. These files were converted into waveform signals (196 samples by 80,000 columns) and divided into training (N = 164) and validation (N = 32) datasets. Twelve single-valued audio features were initially extracted to summarize the training data according to the gait patterns (Marcha Batida—MB and Marcha Picada—MP). After preliminary analyses, high-dimensional arrays of the Mel Frequency Cepstral Coefficients (MFCC), Onset Strength (OS), and Tempogram (TEMP) were extracted and used as input information in the classification algorithms. A principal component analysis (PCA) was performed using the 12 single-valued features set and each audio-feature dataset—AFD (MFCC, OS, and TEMP) for prior data visualization. Machine learning (random forest, RF; support vector machine, SVM) and deep learning (multilayer perceptron neural networks, MLP; convolution neural networks, CNN) algorithms were used to classify the gait types. A five-fold cross-validation scheme with 10 repetitions was employed for assessing the models' predictive performance. The classification performance across models and AFD was also validated with independent observations. The models and AFD were compared based on the classification accuracy (ACC), specificity (SPEC), sensitivity (SEN), and area under the curve (AUC). In the logistic regression analysis, five out of the 12 audio features extracted were significant (p < 0.05) between the gait types. ACC averages ranged from 0.806 to 0.932 for MFCC, from 0.758 to 0.948 for OS and, from 0.936 to 0.968 for TEMP. Overall, the TEMP dataset provided the best classification accuracies for all models. The most suitable method for audio-based horse gait pattern classification was CNN. Both cross and independent validation schemes confirmed that high values of ACC, SPEC, SEN, and AUC are expected for yet-to-be-observed labels, except for MFCC-based models, in which clear overfitting was observed. Using audio-generated data for describing gait phenotypes in Brazilian horses is a promising approach, as the two gait patterns were correctly distinguished. The highest classification performance was achieved by combining CNN and the rhythmic-descriptive AFD. | en |
| dc.description.affiliation | Department of Education Federal Institute of Education Science and Technology of Maranhão (IFMA) | |
| dc.description.affiliation | Department of Animal Nutrition and Production School of Veterinary Medicine and Animal Science University of São Paulo | |
| dc.description.affiliation | Department of Animal Science Federal University of Viçosa | |
| dc.description.affiliation | Department of Animal Science School of Agricultural and Veterinary Sciences Säo Paulo State University (UNESP) | |
| dc.description.affiliation | National Council for Scientific and Technological Development (CNPq) | |
| dc.description.affiliation | Department of Animal Sciences Purdue University | |
| dc.description.affiliationUnesp | Department of Animal Science School of Agricultural and Veterinary Sciences Säo Paulo State University (UNESP) | |
| dc.description.sponsorship | Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) | |
| dc.identifier | http://dx.doi.org/10.3389/fanim.2021.681557 | |
| dc.identifier.citation | Frontiers in Animal Science, v. 2. | |
| dc.identifier.doi | 10.3389/fanim.2021.681557 | |
| dc.identifier.issn | 2673-6225 | |
| dc.identifier.scopus | 2-s2.0-85131139764 | |
| dc.identifier.uri | https://hdl.handle.net/11449/297825 | |
| dc.language.iso | eng | |
| dc.relation.ispartof | Frontiers in Animal Science | |
| dc.source | Scopus | |
| dc.subject | audio-feature | |
| dc.subject | convolutional neural network | |
| dc.subject | four-beat gaited | |
| dc.subject | horse gait | |
| dc.subject | sound analysis | |
| dc.title | Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses | en |
| dc.type | Artigo | pt |
| dspace.entity.type | Publication | |
| relation.isOrgUnitOfPublication | 3d807254-e442-45e5-a80b-0f6bf3a26e48 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 3d807254-e442-45e5-a80b-0f6bf3a26e48 | |
| unesp.campus | Universidade Estadual Paulista (UNESP), Faculdade de Ciências Agrárias e Veterinárias, Jaboticabal | pt |

