When External Knowledge Does Not Aggregate in Named Entity Recognition

dc.contributor.authorPrivatto, Pedro Ivo Monteiro [UNESP]
dc.contributor.authorGuilherme, Ivan Rizzo [UNESP]
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.date.accessioned2022-05-01T11:54:06Z
dc.date.available2022-05-01T11:54:06Z
dc.date.issued2021-01-01
dc.description.abstractIn the different areas of knowledge, textual data are important sources of information. This way, Information Extraction methods have been developed to identify and structure information present in textual documents. In particular there is the Named Entity Recognition (NER) task, which consists of using methods to identify Named Entities, such as Person, Place, among others, in texts, using techniques from Natural Language Processing and Machine Learning. Recent works explored the use of external sources of knowledge to boost the Machine Learning models with sets of domain specific relevant information for the NER task. This work aims to evaluate the aggregation of external knowledge, in the form of Gazetter and Knowledge Graphs, for NER task. Our approach is composed of two steps: i) generation of embeddings, ii) definition and training of the Machine Learning methods. The experiments were conducted on four English datasets, and their results show that the applied strategies for external knowledge integration did not bring great gains to the models, as expressed by F1-Score metric. In the performed experiments, there was an F1-score increase in 17 of the 32 cases where external knowledge was used, but in most cases the gains were lesser than 0.5% in F1-score. In some scenarios the aggregated external knowledge does not capture relevant content, thus not being necessarily beneficial to the methodology.en
dc.description.affiliationInstitute of Geosciences and Exact Sciences UNESP - São Paulo State University, SP
dc.description.affiliationUnespInstitute of Geosciences and Exact Sciences UNESP - São Paulo State University, SP
dc.format.extent616-627
dc.identifierhttp://dx.doi.org/10.1007/978-3-030-91699-2_42
dc.identifier.citationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 13074 LNAI, p. 616-627.
dc.identifier.doi10.1007/978-3-030-91699-2_42
dc.identifier.issn1611-3349
dc.identifier.issn0302-9743
dc.identifier.scopus2-s2.0-85121798857
dc.identifier.urihttp://hdl.handle.net/11449/233938
dc.language.isoeng
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.sourceScopus
dc.subjectInformation extraction
dc.subjectKnowledge embeddings
dc.subjectNamed entity recognition
dc.titleWhen External Knowledge Does Not Aggregate in Named Entity Recognitionen
dc.typeTrabalho apresentado em evento
unesp.campusUniversidade Estadual Paulista (Unesp), Instituto de Geociências e Ciências Exatas, Rio Claropt
unesp.departmentEstatística, Matemática Aplicada e Computação - IGCEpt

Arquivos