학술논문

SFE: A Simple, Fast, and Efficient Feature Selection Algorithm for High-Dimensional Data

Document Type

Periodical

Author

Ahadzadeh, B.; Abdar, M.; Safara, F.; Khosravi, A.; Menhaj, M.B.; Suganthan, P.N.

Source

IEEE Transactions on Evolutionary Computation IEEE Trans. Evol. Computat. Evolutionary Computation, IEEE Transactions on. 27(6):1896-1911 Dec, 2023

Subject

Computing and Processing
Feature extraction
Machine learning algorithms
Classification algorithms
Search problems
Computational efficiency
Filtering algorithms
Electronic mail
Evolutionary computational (EC) methods
feature selection (FS)
high-dimensional dataset
particle swarm optimization (PSO)

Language

ISSN

1089-778X
1941-0026

Abstract

In this article, a new feature selection (FS) algorithm, called simple, fast, and efficient (SFE), is proposed for high-dimensional datasets. The SFE algorithm performs its search process using a search agent and two operators: 1) nonselection and 2) selection. It comprises two phases: 1) exploration and 2) exploitation. In the exploration phase, the nonselection operator performs a global search in the entire problem search space for the irrelevant, redundant, trivial, and noisy features and changes the status of the features from selected mode to nonselected mode. In the exploitation phase, the selection operator searches the problem search space for the features with a high impact on the classification results and changes the status of the features from nonselected mode to selected mode. The proposed SFE is successful in FS from high-dimensional datasets. However, after reducing the dimensionality of a dataset, its performance cannot be increased significantly. In these situations, an evolutionary computational method could be used to find a more efficient subset of features in the new and reduced search space. To overcome this issue, this article proposes a hybrid algorithm, SFE-PSO (particle swarm optimization) to find an optimal feature subset. The efficiency and effectiveness of the SFE and the SFE-PSO for FS are compared on 40 high-dimensional datasets. Their performances were compared with six recently proposed FS algorithms. The results obtained indicate that the two proposed algorithms significantly outperform the other algorithms and can be used as efficient and effective algorithms in selecting features from high-dimensional datasets.

Online Access

Full Text (IEEE) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송