Mining scientific articles powered by machine learning techniques

Gulo, Carlos A.S.J.; Rúbio, Thiago R.P.M.; Tabassum, Shazia; Prado, Simone G.D. [UNESP]

Publicação:
Mining scientific articles powered by machine learning techniques

dc.contributor.author	Gulo, Carlos A.S.J.
dc.contributor.author	Rúbio, Thiago R.P.M.
dc.contributor.author	Tabassum, Shazia
dc.contributor.author	Prado, Simone G.D. [UNESP]
dc.contributor.institution	Universidade do Porto
dc.contributor.institution	UNEMAT
dc.contributor.institution	LIAAD
dc.contributor.institution	Universidade Estadual Paulista (UNESP)
dc.date.accessioned	2022-04-28T19:03:32Z
dc.date.available	2022-04-28T19:03:32Z
dc.date.issued	2015-09-01
dc.description.abstract	Literature review is one of the most important phases of research. Scientists must identify the gaps and challenges about certain area and the scientific literature, as a result of the accumulation of knowledge, should provide enough information. The problem is where to find the best and most important articles that guarantees to ascertain the state of the art on that specific domain. A feasible literature review consists on locating, appraising, and synthesising the best empirical evidences in the pool of available publications, guided by one or more research questions. Nevertheless, it is not assured that searching interesting articles in electronic databases will retrieve the most relevant content. Indeed, the existent search engines try to recommend articles by only looking for the occurrences of given keywords. In fact, the relevance of a paper should depend on many other factors as adequacy to the theme, specific tools used or even the test strategy, making automatic recommendation of articles a challenging problem. Our approach allows researchers to browse huge article collections and quickly find the appropriate publications of particular interest by using machine learning techniques. The proposed solution automatically classifies and prioritises the relevance of scientific papers. Using previous samples manually classified by domain experts, we apply a Naive Bayes Classifier to get predicted articles from real world journal repositories such as IEEE Xplore or ACM Digital. Results suggest that our model can substantially recommend, classify and rank the most relevant articles of a particular scientific field of interest. In our experiments, we achieved 98.22% of accuracy in recommending articles that are present in an expert classification list, indicating a good prediction of relevance. The recommended papers worth, at least, the reading. We envisage to expand our model in order to accept user's filters and other inputs to improve predictions.	en
dc.description.affiliation	Departamento de Engenharia Informática Faculdade of Engenharia Universidade do Porto
dc.description.affiliation	PIXEL Research Group UNEMAT
dc.description.affiliation	LIACC - Artificial Intelligence and Computing Science Laboratory Universidade do Porto
dc.description.affiliation	LIAAD
dc.description.affiliation	Departamento de Computação Faculdade de Ciências Universidade Estadual Paulista
dc.description.affiliationUnesp	Departamento de Computação Faculdade de Ciências Universidade Estadual Paulista
dc.description.sponsorship	Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
dc.description.sponsorshipId	CAPES: BEX 1338/14-5
dc.format.extent	21-28
dc.identifier	http://dx.doi.org/10.4230/OASIcs.ICCSW.2015.21
dc.identifier.citation	OpenAccess Series in Informatics, v. 49, p. 21-28.
dc.identifier.doi	10.4230/OASIcs.ICCSW.2015.21
dc.identifier.issn	2190-6807
dc.identifier.scopus	2-s2.0-84965036752
dc.identifier.uri	http://hdl.handle.net/11449/220612
dc.language.iso	eng
dc.relation.ispartof	OpenAccess Series in Informatics
dc.source	Scopus
dc.subject	Machine learning
dc.subject	Ranking
dc.subject	Systematic literature review
dc.subject	Text categorisation
dc.subject	Text classification
dc.title	Mining scientific articles powered by machine learning techniques	en
dc.type	Trabalho apresentado em evento	pt
dspace.entity.type	Publication
unesp.campus	Universidade Estadual Paulista (UNESP), Faculdade de Ciências, Bauru	pt

Coleções

Bauru - FC - Faculdade de Ciências

Publicação: Mining scientific articles powered by machine learning techniques

Arquivos

Coleções

Publicação:
Mining scientific articles powered by machine learning techniques