JUCS - Journal of Universal Computer Science 26(6): 698-719, doi: 10.3897/jucs.2020.037
Undersampling Instance Selection for Hybrid and Incomplete Imbalanced Data
expand article infoOscar Camacho-Nieto, Cornelio Yáñez-Márquez§, Yenny Villuendas-Rey
‡ CIDETEC-IPN, Ciudad de Mexico, Mexico§ CIC-IPN, Ciudad de Mexico, Mexico
Open Access
Abstract
This paper proposes a novel undersampling method, for dealing with imbalanced datasets. The proposal is based on a novel instance importance measure (also introduced in this paper), and is able to balance hybrid and incomplete data. The numerical experiments carried out show the proposed undersampling algorithm outperforms others algorithms of the state of art, in well-known imbalanced datasets.
Keywords
undersampling, imbalanced data, hybrid and incomplete data