Ensembles of Classifiers and Quantifiers with Data Fusion for Quantification Learning
Carregando...
Data
Orientador
Coorientador
Pós-graduação
Curso de graduação
Título da Revista
ISSN da Revista
Título de Volume
Editor
Tipo
Trabalho apresentado em evento
Direito de acesso
Resumo
Quantification is a supervised Machine Learning task that estimates the class distribution in an unlabeled test set. Quantification has practical applications in various fields, including medical research, environmental monitoring, and quality control. For instance, medical research often estimates the prevalence of a particular disease in a population. Despite being a thriving research area, most existing quantification methods are limited to binary-class problems. Moreover, recent experimental evidence suggests that modern state-of-the-art quantifiers do not perform well for multi-class problems, which are prevalent in quantification. This paper proposes two novel multi-class ensemble quantifiers, FMC-SQ and FMC-MQ, that use data fusion methods at the classifier and quantifier levels. We conducted experiments with 12 state-of-the-art (single and ensemble) quantifiers to evaluate our models on 31 multi-class datasets. Our experimental results indicate that FMC-MQ is the best-performing quantifier outperforming other single and ensemble methods. Also, aggregating quantifier outputs seem to be a more promising research direction than aggregating classification scores for quantification.
Descrição
Palavras-chave
class probability estimation, ensembles, machine learning, multi-class, prevalence estimation, Quantification
Idioma
Inglês
Citação
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 14276 LNAI, p. 3-17.