MC-SQ: A Highly Accurate Ensemble for Multi-class Quantification
dc.contributor.author | Donyavi, Zahra | |
dc.contributor.author | Serapio, Adriane [UNESP] | |
dc.contributor.author | Batista, Gustavo | |
dc.contributor.author | Shekhar, S. | |
dc.contributor.author | Zhou, Z. H. | |
dc.contributor.author | Chiang, Y. Y. | |
dc.contributor.author | Stiglic, G. | |
dc.contributor.institution | Univ New South Wales | |
dc.contributor.institution | Universidade Estadual Paulista (UNESP) | |
dc.date.accessioned | 2025-04-29T20:11:31Z | |
dc.date.issued | 2023-01-01 | |
dc.description.abstract | Quantification research proposes methods to estimate the class distribution in an independent sample. Many areas, such as epidemiology, sentiment analysis, political research and ecological surveillance, rely on quantification methods to estimate aggregated quantities. For instance, epidemiologists are often concerned with the dynamics of the number of disease cases across space and time. Thus, while classification predicts individual subjects, quantification is the class of methods that directly estimate the number of cases. Quantification is a thriving research area, and the community has proposed several approaches in the last decade. Nevertheless, most quantification research has focused on binary-class quantifiers, expecting these approaches to extend to multi-class using the one-versus-all (OVA) approach. However, there is enough empirical evidence indicating the performance of OVA multi-class quantifiers is subpar. This paper has two main contributions. First, we demonstrate why OVA quantifiers are doomed to underperform in multiclass settings due to a distribution shift they cannot han-dle. Second, we propose a new class of quantifiers based on ensemble learning that boosts the performance of the base quantifiers in the binary and, more importantly, multi-class settings. In one of the most comprehensive experimental setups ever attempted in quantification research, we show that our ensembles are the best-performing quantifiers compared with 33 state-of-the-art (single and ensemble) quantifiers and rank first in a recent quantification competition. | en |
dc.description.affiliation | Univ New South Wales, Sydney, NSW, Australia | |
dc.description.affiliation | Sao Paulo State Univ, Sao Paulo, Brazil | |
dc.description.affiliationUnesp | Sao Paulo State Univ, Sao Paulo, Brazil | |
dc.description.sponsorship | Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) | |
dc.description.sponsorshipId | FAPESP: 2021/12278-0 | |
dc.format.extent | 622-630 | |
dc.identifier.citation | Proceedings Of The 2023 Siam International Conference On Data Mining, Sdm. Philadelphia: Siam, p. 622-630, 2023. | |
dc.identifier.uri | https://hdl.handle.net/11449/308210 | |
dc.identifier.wos | WOS:001284687600080 | |
dc.language.iso | eng | |
dc.publisher | Siam | |
dc.relation.ispartof | Proceedings Of The 2023 Siam International Conference On Data Mining, Sdm | |
dc.source | Web of Science | |
dc.title | MC-SQ: A Highly Accurate Ensemble for Multi-class Quantification | en |
dc.type | Trabalho apresentado em evento | pt |
dcterms.rightsHolder | Siam | |
dspace.entity.type | Publication |