Logo do repositório
 

MC-SQ: A Highly Accurate Ensemble for Multi-class Quantification

dc.contributor.authorDonyavi, Zahra
dc.contributor.authorSerapio, Adriane [UNESP]
dc.contributor.authorBatista, Gustavo
dc.contributor.authorShekhar, S.
dc.contributor.authorZhou, Z. H.
dc.contributor.authorChiang, Y. Y.
dc.contributor.authorStiglic, G.
dc.contributor.institutionUniv New South Wales
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.date.accessioned2025-04-29T20:11:31Z
dc.date.issued2023-01-01
dc.description.abstractQuantification research proposes methods to estimate the class distribution in an independent sample. Many areas, such as epidemiology, sentiment analysis, political research and ecological surveillance, rely on quantification methods to estimate aggregated quantities. For instance, epidemiologists are often concerned with the dynamics of the number of disease cases across space and time. Thus, while classification predicts individual subjects, quantification is the class of methods that directly estimate the number of cases. Quantification is a thriving research area, and the community has proposed several approaches in the last decade. Nevertheless, most quantification research has focused on binary-class quantifiers, expecting these approaches to extend to multi-class using the one-versus-all (OVA) approach. However, there is enough empirical evidence indicating the performance of OVA multi-class quantifiers is subpar. This paper has two main contributions. First, we demonstrate why OVA quantifiers are doomed to underperform in multiclass settings due to a distribution shift they cannot han-dle. Second, we propose a new class of quantifiers based on ensemble learning that boosts the performance of the base quantifiers in the binary and, more importantly, multi-class settings. In one of the most comprehensive experimental setups ever attempted in quantification research, we show that our ensembles are the best-performing quantifiers compared with 33 state-of-the-art (single and ensemble) quantifiers and rank first in a recent quantification competition.en
dc.description.affiliationUniv New South Wales, Sydney, NSW, Australia
dc.description.affiliationSao Paulo State Univ, Sao Paulo, Brazil
dc.description.affiliationUnespSao Paulo State Univ, Sao Paulo, Brazil
dc.description.sponsorshipFundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
dc.description.sponsorshipIdFAPESP: 2021/12278-0
dc.format.extent622-630
dc.identifier.citationProceedings Of The 2023 Siam International Conference On Data Mining, Sdm. Philadelphia: Siam, p. 622-630, 2023.
dc.identifier.urihttps://hdl.handle.net/11449/308210
dc.identifier.wosWOS:001284687600080
dc.language.isoeng
dc.publisherSiam
dc.relation.ispartofProceedings Of The 2023 Siam International Conference On Data Mining, Sdm
dc.sourceWeb of Science
dc.titleMC-SQ: A Highly Accurate Ensemble for Multi-class Quantificationen
dc.typeTrabalho apresentado em eventopt
dcterms.rightsHolderSiam
dspace.entity.typePublication

Arquivos

Coleções