Addressing the overlapping data problem in classification using the one-vs-one decomposition strategy

dc.contributor.authorSáez, José Antonio
dc.contributor.authorGalar Idoate, Mikel
dc.contributor.authorKrawczyk, Bartosz
dc.contributor.departmentInstitute of Smart Cities - ISCen
dc.date.accessioned2020-02-18T13:10:04Z
dc.date.available2020-02-18T13:10:04Z
dc.date.issued2019
dc.description.abstractLearning good-performing classifiers from data with easily separable classes is not usually a difficult task for most of the algorithms. However, problems affecting classifier performance may arise when samples from different classes share similar characteristics or are overlapped, since the boundaries of each class may not be clearly defined. In order to address this problem, the majority of existing works in the literature propose to either adapt well-known algorithms to reduce the negative impact of overlapping or modify the original data by introducing/removing features which decrease the overlapping region. However, these approaches may present some drawbacks: the changes in specific algorithms may not be useful for other methods and modifying the original data can produce variable results depending on data characteristics and the technique used later. An unexplored and interesting research line to deal with the overlapping phenomenon consists of decomposing the problem into several binary subproblems to reduce its complexity, diminishing the negative effects of overlapping. Based on this novel idea in the field of overlapping data, this paper proposes the usage of the One-vs-One (OVO) strategy to alleviate the presence of overlapping, without modifying existing algorithms or data conformations as suggested by previous works. To test the suitability of the OVO approach with overlapping data, and due to the lack of proposals in the specialized literature, this research also introduces a novel scheme to artificially induce overlapping in real-world datasets, which enables us to simulate different types and levels of overlapping among the classes. The results obtained show that the methods using the OVO achieve better performances when considering data with overlapped classes than those dealing with all classes at the same time.en
dc.description.sponsorshipThe work of J. A. Sáez was supported by the Juan de la Cierva-formación Fellowship from the Spanish Ministry of Economy, Industry and Competitiveness under Grant FJCI-2015-25547. The work of M. Galar was supported in part by the Spanish Ministry of Economy, Industry and Competitiveness (MINECO), in part by the AEI/FEDER, and in part by the UE under Project TIN2016-77356-P. The work of B. Krawczyk was supported by the Polish National Science Center under Grant UMO-2015/19/B/ST6/01597.en
dc.format.extent15 p.
dc.format.mimetypeapplication/pdfen
dc.identifier.doi10.1109/ACCESS.2019.2925300
dc.identifier.issn2169-3536
dc.identifier.urihttps://academica-e.unavarra.es/handle/2454/36275
dc.language.isoengen
dc.publisherIEEEen
dc.relation.ispartofIEEE Access, 2019, 7, 83396-83411en
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/1PE/TIN2016-77356-P/
dc.relation.publisherversionhttps://doi.org/10.1109/ACCESS.2019.2925300
dc.rightsThis work is licensed under a Creative Commons Attribution 3.0 Licenseen
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by/3.0/
dc.subjectClassificationen
dc.subjectData generationen
dc.subjectDecomposition strategiesen
dc.subjectOne-vs-oneen
dc.subjectOverlapping dataen
dc.titleAddressing the overlapping data problem in classification using the one-vs-one decomposition strategyen
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion
dspace.entity.typePublication
relation.isAuthorOfPublication44c7a308-9c21-49ef-aa03-b45c2c5a06fd
relation.isAuthorOfPublication.latestForDiscovery44c7a308-9c21-49ef-aa03-b45c2c5a06fd

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2019100193_Saez_AddressingOverlapping.pdf
Size:
1.39 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: