Hybrid visualization approach to show documents similarity and content in a single view

Carregando...
Imagem de Miniatura

Data

2018-05-23

Autores

Andreotti, Andre Luiz Dias [UNESP]
Silva, Lenon Fachiano [UNESP]
Eler, Danilo Medeiros [UNESP]

Título da Revista

ISSN da Revista

Título de Volume

Editor

Resumo

Multidimensional projection techniques can be employed to project datasets from a higher to a lower dimensional space (e.g., 2D space). These techniques can be used to present the relationships of dataset instances based on distance by grouping or separating clusters of instances in the projected space. Several works have used multidimensional projections to aid in the exploration of document collections. Even though the projection techniques can organize a dataset, the user needs to read each document to understand the cluster generation. Alternatively, techniques such as topic extraction or tag clouds can be employed to present a summary of the document contents. To minimize the exploratory work and to aid in cluster analysis, this work proposes a new hybrid visualization to show both document relationship and content in a single view, employing multidimensional projections to relate documents and tag clouds. We show the effectiveness of the proposed approach in the exploration of two document collections composed by world news.

Descrição

Palavras-chave

Document pre-processing, Document similarity, Hybrid visualization, Multidimensional projection, Tag cloud, Text mining

Como citar

Information (Switzerland), v. 9, n. 6, 2018.