Logotipo do repositório
 

Publicação:
PetroBERT: A Domain Adaptation Language Model for Oil and Gas Applications in Portuguese

dc.contributor.authorRodrigues, Rafael B. M. [UNESP]
dc.contributor.authorPrivatto, Pedro I. M. [UNESP]
dc.contributor.authorde Sousa, Gustavo José [UNESP]
dc.contributor.authorMurari, Rafael P. [UNESP]
dc.contributor.authorAfonso, Luis C. S. [UNESP]
dc.contributor.authorPapa, João P. [UNESP]
dc.contributor.authorPedronette, Daniel C. G. [UNESP]
dc.contributor.authorGuilherme, Ivan R. [UNESP]
dc.contributor.authorPerrout, Stephan R.
dc.contributor.authorRiente, Aliel F.
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.contributor.institutionPetróleo Brasileiro S.A. - Petrobras
dc.contributor.institutionCentro de Pesquisas da Petróleo Brasileiro S.A. - CENPES/Petrobras
dc.date.accessioned2022-05-01T15:46:21Z
dc.date.available2022-05-01T15:46:21Z
dc.date.issued2022-01-01
dc.description.abstractThis work proposes the PetroBERT, which is a BERT-based model adapted to the oil and gas exploration domain in Portuguese. PetroBERT was pre-trained using the Petrolês corpus and a private daily drilling report corpus over BERT multilingual and BERTimbau. The proposed model was evaluated in the NER and sentence classification tasks and achieved interesting results, which shows its potential for such a domain. To the best of our knowledge, this is the first BERT-based model to the oil and gas context.en
dc.description.affiliationUNESP - São Paulo State University School of Technology and Sciences
dc.description.affiliationUNESP - São Paulo State University Institute of Geosciences and Exact Sciences
dc.description.affiliationUNESP - São Paulo State University School of Sciences
dc.description.affiliationPetróleo Brasileiro S.A. - Petrobras
dc.description.affiliationCentro de Pesquisas da Petróleo Brasileiro S.A. - CENPES/Petrobras
dc.description.affiliationUnespUNESP - São Paulo State University School of Technology and Sciences
dc.description.affiliationUnespUNESP - São Paulo State University Institute of Geosciences and Exact Sciences
dc.description.affiliationUnespUNESP - São Paulo State University School of Sciences
dc.format.extent101-109
dc.identifierhttp://dx.doi.org/10.1007/978-3-030-98305-5_10
dc.identifier.citationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 13208 LNAI, p. 101-109.
dc.identifier.doi10.1007/978-3-030-98305-5_10
dc.identifier.issn1611-3349
dc.identifier.issn0302-9743
dc.identifier.scopus2-s2.0-85127159496
dc.identifier.urihttp://hdl.handle.net/11449/234320
dc.language.isoeng
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.sourceScopus
dc.subjectBERT
dc.subjectDomain adaption
dc.subjectOil and gas
dc.titlePetroBERT: A Domain Adaptation Language Model for Oil and Gas Applications in Portugueseen
dc.typeTrabalho apresentado em evento
dspace.entity.typePublication
unesp.author.orcid0000-0001-8776-4529[1]
unesp.author.orcid0000-0003-0567-4082[2]
unesp.author.orcid0000-0001-8407-4901[3]
unesp.author.orcid0000-0002-5543-3896[5]
unesp.author.orcid0000-0002-6494-7514[6]
unesp.author.orcid0000-0002-2867-4838[7]
unesp.author.orcid0000-0002-3610-3779[8]
unesp.campusUniversidade Estadual Paulista (UNESP), Faculdade de Ciências, Baurupt
unesp.departmentComputação - FCpt

Arquivos