Classification with Machine Learning Algorithms after Hybrid Feature Selection in Imbalanced Data Sets

The efficacy of machine learning algorithms significantly depends on the adequacy and relevance of features in the data set. Hence, feature selection precedes the classification process. In this study, a hybrid feature selection approach, integrating filter and wrapper methods was employed. This app...

Full description

Saved in:
Bibliographic Details
Main Authors: Meryem Pulat, Ipek Deveci Kocakoç
Format: Article
Language:English
Published: Wrocław University of Science and Technology 2024-01-01
Series:Operations Research and Decisions
Online Access:https://ord.pwr.edu.pl/assets/papers_archive/ord2024vol34no4_10.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The efficacy of machine learning algorithms significantly depends on the adequacy and relevance of features in the data set. Hence, feature selection precedes the classification process. In this study, a hybrid feature selection approach, integrating filter and wrapper methods was employed. This approach not only enhances classification accuracy, surpassing the results achievable with filter methods alone, but also reduces processing time compared to exclusive reliance on wrapper methods. Results indicate a general improvement in algorithm performance with the application of the hybrid feature selection approach. The study utilized the Taiwanese Bankruptcy and Statlog (German Credit Data) datasets from the UCI Machine Learning Repository. These datasets exhibit an unbalanced distribution, necessitating data preprocessing that considers this unbalance. After acknowledging the datasets' unbalanced nature, feature selection and subsequent classification processes were executed. (original abstract)
ISSN:2081-8858
2391-6060