Logo do repositório

MC-SQ: A Highly Accurate Ensemble for Multi-class Quantification

dc.contributor.authorDonyavi, Zahra
dc.contributor.authorSerapio, Adriane [UNESP]
dc.contributor.authorBatista, Gustavo
dc.contributor.institutionUniversity of New South Wales
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.date.accessioned2025-04-29T20:09:30Z
dc.date.issued2023-01-01
dc.description.abstractQuantification research proposes methods to estimate the class distribution in an independent sample. Many areas, such as epidemiology, sentiment analysis, political research and ecological surveillance, rely on quantification methods to estimate aggregated quantities. For instance, epidemiologists are often concerned with the dynamics of the number of disease cases across space and time. Thus, while classification predicts individual subjects, quantification is the class of methods that directly estimate the number of cases. Quantification is a thriving research area, and the community has proposed several approaches in the last decade. Nevertheless, most quantification research has focused on binary-class quantifiers, expecting these approaches to extend to multi-class using the one-versus-all (OVA) approach. However, there is enough empirical evidence indicating the performance of OVA multi-class quantifiers is subpar. This paper has two main contributions. First, we demonstrate why OVA quantifiers are doomed to underperform in multi-class settings due to a distribution shift they cannot handle. Second, we propose a new class of quantifiers based on ensemble learning that boosts the performance of the base quantifiers in the binary and, more importantly, multi-class settings. In one of the most comprehensive experimental setups ever attempted in quantification research, we show that our ensembles are the best-performing quantifiers compared with 33 state-of-the-art (single and ensemble) quantifiers and rank first in a recent quantification competition.en
dc.description.affiliationUniversity of New South Wales
dc.description.affiliationSão Paulo State University
dc.description.affiliationUnespSão Paulo State University
dc.description.sponsorshipFundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
dc.description.sponsorshipIdFAPESP: 2021/12278-0
dc.format.extent622-630
dc.identifierhttp://dx.doi.org/10.1137/1.9781611977653.ch70
dc.identifier.citation2023 SIAM International Conference on Data Mining, SDM 2023, p. 622-630.
dc.identifier.doi10.1137/1.9781611977653.ch70
dc.identifier.scopus2-s2.0-85174257065
dc.identifier.urihttps://hdl.handle.net/11449/307468
dc.language.isoeng
dc.relation.ispartof2023 SIAM International Conference on Data Mining, SDM 2023
dc.sourceScopus
dc.titleMC-SQ: A Highly Accurate Ensemble for Multi-class Quantificationen
dc.typeTrabalho apresentado em eventopt
dspace.entity.typePublication

Arquivos

Coleções