Pedronette, Daniel Carlos GuimarĂ£es [UNESP]Latecki, Longin Jan2021-06-252021-06-252021-03-01Information Processing and Management, v. 58, n. 2, 2021.0306-4573http://hdl.handle.net/11449/206925Graph Convolutional Networks (GCNs) have been established as a fundamental approach for representation learning on graphs, based on convolution operations on non-Euclidean domain, defined by graph-structured data. GCNs and variants have achieved state-of-the-art results on classification tasks, especially in semi-supervised learning scenarios. A central challenge in semi-supervised classification consists in how to exploit the maximum of useful information encoded in the unlabeled data. In this paper, we address this issue through a novel self-training approach for improving the accuracy of GCNs on semi-supervised classification tasks. A margin score is used through a rank-based model to identify the most confident sample predictions. Such predictions are exploited as an expanded labeled set in a second-stage training step. Our model is suitable for different GCN models. Moreover, we also propose a rank aggregation of labeled sets obtained by different GCN models. The experimental evaluation considers four GCN variations and traditional benchmarks extensively used in the literature. Significant accuracy gains were achieved for all evaluated models, reaching results comparable or superior to the state-of-the-art. The best results were achieved for rank aggregation self-training on combinations of the four GCN models.engGraph convolutional networksRank modelSelf-trainingSemi-supervised learningRank-based self-training for graph convolutional networksArtigo10.1016/j.ipm.2020.1024432-s2.0-85097135780