Publicação: PetroBERT: A Domain Adaptation Language Model for Oil and Gas Applications in Portuguese
dc.contributor.author | Rodrigues, Rafael B. M. [UNESP] | |
dc.contributor.author | Privatto, Pedro I. M. [UNESP] | |
dc.contributor.author | de Sousa, Gustavo José [UNESP] | |
dc.contributor.author | Murari, Rafael P. [UNESP] | |
dc.contributor.author | Afonso, Luis C. S. [UNESP] | |
dc.contributor.author | Papa, João P. [UNESP] | |
dc.contributor.author | Pedronette, Daniel C. G. [UNESP] | |
dc.contributor.author | Guilherme, Ivan R. [UNESP] | |
dc.contributor.author | Perrout, Stephan R. | |
dc.contributor.author | Riente, Aliel F. | |
dc.contributor.institution | Universidade Estadual Paulista (UNESP) | |
dc.contributor.institution | Petróleo Brasileiro S.A. - Petrobras | |
dc.contributor.institution | Centro de Pesquisas da Petróleo Brasileiro S.A. - CENPES/Petrobras | |
dc.date.accessioned | 2022-05-01T15:46:21Z | |
dc.date.available | 2022-05-01T15:46:21Z | |
dc.date.issued | 2022-01-01 | |
dc.description.abstract | This work proposes the PetroBERT, which is a BERT-based model adapted to the oil and gas exploration domain in Portuguese. PetroBERT was pre-trained using the Petrolês corpus and a private daily drilling report corpus over BERT multilingual and BERTimbau. The proposed model was evaluated in the NER and sentence classification tasks and achieved interesting results, which shows its potential for such a domain. To the best of our knowledge, this is the first BERT-based model to the oil and gas context. | en |
dc.description.affiliation | UNESP - São Paulo State University School of Technology and Sciences | |
dc.description.affiliation | UNESP - São Paulo State University Institute of Geosciences and Exact Sciences | |
dc.description.affiliation | UNESP - São Paulo State University School of Sciences | |
dc.description.affiliation | Petróleo Brasileiro S.A. - Petrobras | |
dc.description.affiliation | Centro de Pesquisas da Petróleo Brasileiro S.A. - CENPES/Petrobras | |
dc.description.affiliationUnesp | UNESP - São Paulo State University School of Technology and Sciences | |
dc.description.affiliationUnesp | UNESP - São Paulo State University Institute of Geosciences and Exact Sciences | |
dc.description.affiliationUnesp | UNESP - São Paulo State University School of Sciences | |
dc.format.extent | 101-109 | |
dc.identifier | http://dx.doi.org/10.1007/978-3-030-98305-5_10 | |
dc.identifier.citation | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 13208 LNAI, p. 101-109. | |
dc.identifier.doi | 10.1007/978-3-030-98305-5_10 | |
dc.identifier.issn | 1611-3349 | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.scopus | 2-s2.0-85127159496 | |
dc.identifier.uri | http://hdl.handle.net/11449/234320 | |
dc.language.iso | eng | |
dc.relation.ispartof | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | |
dc.source | Scopus | |
dc.subject | BERT | |
dc.subject | Domain adaption | |
dc.subject | Oil and gas | |
dc.title | PetroBERT: A Domain Adaptation Language Model for Oil and Gas Applications in Portuguese | en |
dc.type | Trabalho apresentado em evento | |
dspace.entity.type | Publication | |
unesp.author.orcid | 0000-0001-8776-4529[1] | |
unesp.author.orcid | 0000-0003-0567-4082[2] | |
unesp.author.orcid | 0000-0001-8407-4901[3] | |
unesp.author.orcid | 0000-0002-5543-3896[5] | |
unesp.author.orcid | 0000-0002-6494-7514[6] | |
unesp.author.orcid | 0000-0002-2867-4838[7] | |
unesp.author.orcid | 0000-0002-3610-3779[8] | |
unesp.campus | Universidade Estadual Paulista (UNESP), Faculdade de Ciências, Bauru | pt |
unesp.department | Computação - FC | pt |