Natural Language Processing to Extract Information from Portuguese-Language Medical Records
dc.contributor.author | da Rocha, Naila Camila [UNESP] | |
dc.contributor.author | Barbosa, Abner Macola Pacheco [UNESP] | |
dc.contributor.author | Schnr, Yaron Oliveira [UNESP] | |
dc.contributor.author | Machado-Rugolo, Juliana [UNESP] | |
dc.contributor.author | de Andrade, Luis Gustavo Modelli [UNESP] | |
dc.contributor.author | Corrente, José Eduardo | |
dc.contributor.author | de Arruda Silveira, Liciana Vaz [UNESP] | |
dc.contributor.institution | Universidade Estadual Paulista (UNESP) | |
dc.contributor.institution | Fundação para o Desenvolvimento Médico e Hospitalar (FAMESP) | |
dc.date.accessioned | 2023-07-29T12:48:26Z | |
dc.date.available | 2023-07-29T12:48:26Z | |
dc.date.issued | 2023-01-01 | |
dc.description.abstract | Studies that use medical records are often impeded due to the information presented in narrative fields. However, recent studies have used artificial intelligence to extract and process secondary health data from electronic medical records. The aim of this study was to develop a neural network that uses data from unstructured medical records to capture information regarding symptoms, diagnoses, medications, conditions, exams, and treatment. Data from 30,000 medical records of patients hospitalized in the Clinical Hospital of the Botucatu Medical School (HCFMB), São Paulo, Brazil, were obtained, creating a corpus with 1200 clinical texts. A natural language algorithm for text extraction and convolutional neural networks for pattern recognition were used to evaluate the model with goodness-of-fit indices. The results showed good accuracy, considering the complexity of the model, with an F-score of 63.9% and a precision of 72.7%. The patient condition class reached a precision of 90.3% and the medication class reached 87.5%. The proposed neural network will facilitate the detection of relationships between diseases and symptoms and prevalence and incidence, in addition to detecting the identification of clinical conditions, disease evolution, and the effects of prescribed medications. | en |
dc.description.affiliation | Department of Biostatistics Institute of Biosciences Universidade Estadual Paulista (UNESP) | |
dc.description.affiliation | Medical School Universidade Estadual Paulista (UNESP) | |
dc.description.affiliation | Health Technology Assessment Center (Clinical Hospital of the Botucatu Medical School) | |
dc.description.affiliation | Research Support Office Fundação para o Desenvolvimento Médico e Hospitalar (FAMESP) | |
dc.description.affiliationUnesp | Department of Biostatistics Institute of Biosciences Universidade Estadual Paulista (UNESP) | |
dc.description.affiliationUnesp | Medical School Universidade Estadual Paulista (UNESP) | |
dc.description.affiliationUnesp | Health Technology Assessment Center (Clinical Hospital of the Botucatu Medical School) | |
dc.identifier | http://dx.doi.org/10.3390/data8010011 | |
dc.identifier.citation | Data, v. 8, n. 1, 2023. | |
dc.identifier.doi | 10.3390/data8010011 | |
dc.identifier.issn | 2306-5729 | |
dc.identifier.scopus | 2-s2.0-85146769393 | |
dc.identifier.uri | http://hdl.handle.net/11449/246711 | |
dc.language.iso | eng | |
dc.relation.ispartof | Data | |
dc.source | Scopus | |
dc.subject | medical records | |
dc.subject | named entity recognition | |
dc.subject | neural networks | |
dc.title | Natural Language Processing to Extract Information from Portuguese-Language Medical Records | en |
dc.type | Artigo | |
unesp.author.orcid | 0000-0002-1684-2574[1] | |
unesp.author.orcid | 0000-0003-3668-8911[2] | |
unesp.author.orcid | 0000-0003-3984-4959[4] | |
unesp.author.orcid | 0000-0001-8931-5495[7] |