Logotipo do repositório
 

Publicação:
The nature of scientific datasets in South American repositories: a survey of formats and extensions

dc.contributor.authorRodrigues, Marcello Mundim
dc.contributor.authorLourenco, Cintia de Azevedo
dc.contributor.authorDias, Guilherme Ataide [UNESP]
dc.contributor.institutionUniversidade Federal de Minas Gerais (UFMG)
dc.contributor.institutionUniv Fed Paraiba
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.date.accessioned2022-11-30T13:38:57Z
dc.date.available2022-11-30T13:38:57Z
dc.date.issued2022-01-01
dc.description.abstractObjective: identifying the scientific data repositories created and managed by Higher Education Institutions and/or South American research and funding agencies; identifying and describing the formats and extensions of files that compile the scientific datasets deposited in these repositories. Methods: eight repositories retrieved by RE3DATA were selected for investigation. A population (N) of 1.115 scientific datasets was obtained. By using Stratified Random Sampling, the resulting sample (n) value was 258 datasets, which corresponds to 23,15% of the population (N). Data surveyed from the samples were condensed into tables and charts. Results: it was noticed that the nature of the scientific datasets investigated is centered on textual and numerical data, saved in text files and tables, respectively. Also, the datasets may be either homogeneous (one or more files saved in a unique format and extension, e.g.: image format in.jpg) or heterogeneous (files saved in different formats and extensions, content of the data, as observed in the .gpx and gdb extensions, which refer to geospatial data, therefore, alphanumeric data. Conclusions: There is a growing need of describing the nature of data, as well as the formats and extensions of files. This kind of descriptive metadata would be valuable to potential users, as it would allow a greater understanding of the context of the data, focusing on data reuse.en
dc.description.affiliationUniv Fed Minas Gerais, Doutorando Gestao & Org Conhecimento, Belo Horizonte, MG, Brazil
dc.description.affiliationUniv Fed Minas Gerais, Ciencia Informacao, Belo Horizonte, MG, Brazil
dc.description.affiliationUniv Fed Minas Gerais, Escola Ciencia Informacao, Belo Horizonte, MG, Brazil
dc.description.affiliationUniv Fed Paraiba, Dept Ciencia Informacao, Joao Pessoa, Paraiba, Brazil
dc.description.affiliationUniv Estadual Paulista, Ciencia Informacao, Sao Paulo, Brazil
dc.description.affiliationUnespUniv Estadual Paulista, Ciencia Informacao, Sao Paulo, Brazil
dc.format.extent26
dc.identifierhttp://dx.doi.org/10.5007/1518-2924.2022.e85148
dc.identifier.citationEncontros Bibli-revista Eletronica De Biblioteconomia E Ciencia Da Informacao. Florianopolis: Univ Federal Santa Catarina, v. 27, 26 p., 2022.
dc.identifier.doi10.5007/1518-2924.2022.e85148
dc.identifier.issn1518-2924
dc.identifier.urihttp://hdl.handle.net/11449/237571
dc.identifier.wosWOS:000804414500004
dc.language.isopor
dc.publisherUniv Federal Santa Catarina
dc.relation.ispartofEncontros Bibli-revista Eletronica De Biblioteconomia E Ciencia Da Informacao
dc.sourceWeb of Science
dc.subjectScientific data
dc.subjectDatasets
dc.subjectData repositories
dc.subjectFormats and extensions
dc.subjectSurvey
dc.titleThe nature of scientific datasets in South American repositories: a survey of formats and extensionsen
dc.typeArtigopt
dcterms.rightsHolderUniv Federal Santa Catarina
dspace.entity.typePublication
unesp.campusUniversidade Estadual Paulista (UNESP), Faculdade de Filosofia e Ciências, Maríliapt

Arquivos