JUCS - Journal of Universal Computer Science 28(5): 499-539, doi: 10.3897/jucs.78218
An Enhanced Evolutionary Based Feature Selection Approach Using Grey Wolf Optimizer for the Classification of High-dimensional Biological Data
expand article infoThaer Thaher, Mohammed Awad§, Mohammed Aldasht|, Alaa Sheta, Hamza Turabieh#, Hamouda Chantar¤
‡ Department of Engineering and Technology Sciences, Arab American University, Jenin, Palestine§ Department of Computer Systems Engineering, Arab American University, Jenin, Palestine| Department of Computer Engineering, Palestine Polytechnic University, Hebron, Palestine¶ Computer Science Department, Southern Connecticut State University, CT, United States of America# Department of Information Technology, Collage of Computers and Information Technology, Taif University, Taif, Saudi Arabia¤ Faculty of Information Technology, Sebha University, Sebha 18758, Libya, Sebha, Libya
Open Access
Feature selection (FS) is a pre-processing step that aims to eliminate the redundant and less-informative features to enhance the performance of data mining techniques. It is also considered as one of the key success factors for classification problems in high-dimensional datasets. This paper proposes an efficient wrapper feature selection method based on Grey Wolf Optimizer (GWO). GWO is a recent metaheuristic algorithm that has been widely employed to solve diverse optimization problems. However, GWO mainly follows the search directions toward the leading wolves, making it prone to fall into local optima, especially when dealing with high-dimensional problems, which is the case when dealing with many biological datasets. An enhanced variation of GWO called EGWO, which adapts two enhancements, is introduced to overcome this specific shortcoming. In the first place, the transition parameter concept is incorporated to move GWO from the exploration phase to the exploitation phase. Several adaptive non-linear decreasing formulas are introduced to control the transition parameters. In the second place, a random-based search strategy is exploited to empower diversity during the search process. Two binarization schemes using S-shaped and V-shaped transfer functions are incorporated to map the continuous search space into a binary one for FS. The efficiency of the proposed EGWO is validated on ten high-dimensional low-samples biological data. Our experiments show the promising performance of EGWO compared to the original GWO approach and other state-of-the-art techniques in terms of dimensionality reduction and the enhancement of classification performance.
Feature selection, Enhanced grey wolf optimizer, Binary grey wolf optimizer, Classification, Biological data, Exploration, Metaheuristics