Improving semi-supervised learning through optimum connectivity

Carregando...
Imagem de Miniatura

Data

2016-12-01

Orientador

Coorientador

Pós-graduação

Curso de graduação

Título da Revista

ISSN da Revista

Título de Volume

Editor

Elsevier B.V.

Tipo

Artigo

Direito de acesso

Acesso abertoAcesso Aberto

Resumo

The annotation of large data sets by a classifier is a problem whose challenge increases as the number of labeled samples used to train the classifier reduces in comparison to the number of unlabeled samples. In this context, semi-supervised learning methods aim at discovering and labeling informative samples among the unlabeled ones, such that their addition to the correct class in the training set can improve classification performance. We present a semi-supervised learning approach that connects unlabeled and labeled samples as nodes of a minimum-spanning tree and partitions the tree into an optimum-path forest rooted at the labeled nodes. It is suitable when most samples from a same class are more closely connected through sequences of nearby samples than samples from distinct classes, which is usually the case in data sets with a reasonable relation between number of samples and feature space dimension. The proposed solution is validated by using several data sets and state-of-the-art methods as baselines. (C) 2016 Elsevier Ltd. All rights reserved.

Descrição

Idioma

Inglês

Como citar

Pattern Recognition. Oxford: Elsevier Sci Ltd, v. 60, p. 72-85, 2016.

Itens relacionados