O2PF: Oversampling via optimum-path forest for breast cancer detection

dc.contributor.authorPassos, Leandro [UNESP]
dc.contributor.authorJodas, Danilo [UNESP]
dc.contributor.authorRibeiro, Luiz [UNESP]
dc.contributor.authorMoreira, Thierry [UNESP]
dc.contributor.authorPapa, Joao [UNESP]
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.date.accessioned2022-04-28T19:29:21Z
dc.date.available2022-04-28T19:29:21Z
dc.date.issued2020-07-01
dc.description.abstractBreast cancer is among the most deadly diseases, distressing mostly women worldwide. Although traditional methods for detection have presented themselves as valid for the task, they still commonly present low accuracies and demand considerable time and effort from professionals. Therefore, a computer-aided diagnosis (CAD) system capable of providing early detection becomes hugely desirable. In the last decade, machine learning-based techniques have been of paramount importance in this context, since they are capable of extracting essential information from data and reasoning about it. However, such approaches still suffer from imbalanced data, specifically on medical issues, where the number of healthy people samples is, in general, considerably higher than the number of patients. Therefore this paper proposes the O2PF, a data oversampling method based on the unsupervised Optimum-Path Forest Algorithm. Experiments conducted over the full oversampling scenario state the robustness of the model, which is compared against three well-established oversampling methods considering three breast cancer and three general-purpose tasks for medical issues datasets.en
dc.description.affiliationSao Paulo State University Department of Computing
dc.description.affiliationUnespSao Paulo State University Department of Computing
dc.format.extent498-503
dc.identifierhttp://dx.doi.org/10.1109/CBMS49503.2020.00100
dc.identifier.citationProceedings - IEEE Symposium on Computer-Based Medical Systems, v. 2020-July, p. 498-503.
dc.identifier.doi10.1109/CBMS49503.2020.00100
dc.identifier.issn1063-7125
dc.identifier.scopus2-s2.0-85091143461
dc.identifier.urihttp://hdl.handle.net/11449/221556
dc.language.isoeng
dc.relation.ispartofProceedings - IEEE Symposium on Computer-Based Medical Systems
dc.sourceScopus
dc.subjectData imbalance
dc.subjectOptimum-path forest
dc.subjectOversampling
dc.titleO2PF: Oversampling via optimum-path forest for breast cancer detectionen
dc.typeTrabalho apresentado em evento

Arquivos

Coleções