Publicação: The nature of scientific datasets in South American repositories: a survey of formats and extensions
dc.contributor.author | Rodrigues, Marcello Mundim | |
dc.contributor.author | Lourenco, Cintia de Azevedo | |
dc.contributor.author | Dias, Guilherme Ataide [UNESP] | |
dc.contributor.institution | Universidade Federal de Minas Gerais (UFMG) | |
dc.contributor.institution | Univ Fed Paraiba | |
dc.contributor.institution | Universidade Estadual Paulista (UNESP) | |
dc.date.accessioned | 2022-11-30T13:38:57Z | |
dc.date.available | 2022-11-30T13:38:57Z | |
dc.date.issued | 2022-01-01 | |
dc.description.abstract | Objective: identifying the scientific data repositories created and managed by Higher Education Institutions and/or South American research and funding agencies; identifying and describing the formats and extensions of files that compile the scientific datasets deposited in these repositories. Methods: eight repositories retrieved by RE3DATA were selected for investigation. A population (N) of 1.115 scientific datasets was obtained. By using Stratified Random Sampling, the resulting sample (n) value was 258 datasets, which corresponds to 23,15% of the population (N). Data surveyed from the samples were condensed into tables and charts. Results: it was noticed that the nature of the scientific datasets investigated is centered on textual and numerical data, saved in text files and tables, respectively. Also, the datasets may be either homogeneous (one or more files saved in a unique format and extension, e.g.: image format in.jpg) or heterogeneous (files saved in different formats and extensions, content of the data, as observed in the .gpx and gdb extensions, which refer to geospatial data, therefore, alphanumeric data. Conclusions: There is a growing need of describing the nature of data, as well as the formats and extensions of files. This kind of descriptive metadata would be valuable to potential users, as it would allow a greater understanding of the context of the data, focusing on data reuse. | en |
dc.description.affiliation | Univ Fed Minas Gerais, Doutorando Gestao & Org Conhecimento, Belo Horizonte, MG, Brazil | |
dc.description.affiliation | Univ Fed Minas Gerais, Ciencia Informacao, Belo Horizonte, MG, Brazil | |
dc.description.affiliation | Univ Fed Minas Gerais, Escola Ciencia Informacao, Belo Horizonte, MG, Brazil | |
dc.description.affiliation | Univ Fed Paraiba, Dept Ciencia Informacao, Joao Pessoa, Paraiba, Brazil | |
dc.description.affiliation | Univ Estadual Paulista, Ciencia Informacao, Sao Paulo, Brazil | |
dc.description.affiliationUnesp | Univ Estadual Paulista, Ciencia Informacao, Sao Paulo, Brazil | |
dc.format.extent | 26 | |
dc.identifier | http://dx.doi.org/10.5007/1518-2924.2022.e85148 | |
dc.identifier.citation | Encontros Bibli-revista Eletronica De Biblioteconomia E Ciencia Da Informacao. Florianopolis: Univ Federal Santa Catarina, v. 27, 26 p., 2022. | |
dc.identifier.doi | 10.5007/1518-2924.2022.e85148 | |
dc.identifier.issn | 1518-2924 | |
dc.identifier.uri | http://hdl.handle.net/11449/237571 | |
dc.identifier.wos | WOS:000804414500004 | |
dc.language.iso | por | |
dc.publisher | Univ Federal Santa Catarina | |
dc.relation.ispartof | Encontros Bibli-revista Eletronica De Biblioteconomia E Ciencia Da Informacao | |
dc.source | Web of Science | |
dc.subject | Scientific data | |
dc.subject | Datasets | |
dc.subject | Data repositories | |
dc.subject | Formats and extensions | |
dc.subject | Survey | |
dc.title | The nature of scientific datasets in South American repositories: a survey of formats and extensions | en |
dc.type | Artigo | pt |
dcterms.rightsHolder | Univ Federal Santa Catarina | |
dspace.entity.type | Publication | |
unesp.campus | Universidade Estadual Paulista (UNESP), Faculdade de Filosofia e Ciências, Marília | pt |