IA-KNNR: A Novel Imbalance-Aware Approach for Handling Multi-Label Class Imbalance Problem

Multi-label learning (MLL) is a supervised learning where the classifier needs to learn from the data where one instance can belong to more than one class (label). Due to its wide application, MLL has attracted great attention from academia and researchers. However, due to the complex nature of mult...

Full description

Saved in:
Bibliographic Details
Main Authors: Himanshu Suyal, Shiv Naresh Shivhare, Gulshan Shrivastava, Rohit Singh, Ansh Singhal
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11075775/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1839625308223832064
author Himanshu Suyal
Shiv Naresh Shivhare
Gulshan Shrivastava
Rohit Singh
Ansh Singhal
author_facet Himanshu Suyal
Shiv Naresh Shivhare
Gulshan Shrivastava
Rohit Singh
Ansh Singhal
author_sort Himanshu Suyal
collection DOAJ
description Multi-label learning (MLL) is a supervised learning where the classifier needs to learn from the data where one instance can belong to more than one class (label). Due to its wide application, MLL has attracted great attention from academia and researchers. However, due to the complex nature of multi-label data (MLD), it suffers from the class imbalance problem, where data distribution is heterogeneous, and learning from that imbalanced data is challenging. To address this, we propose IA-KNNR, a novel imbalance-aware resampling framework that leverages distance-weighted K-nearest neighbors, label similarity, and a dynamic weighting strategy that accounts for the co-occurrence patterns between frequent and infrequent labels. In addition, DBSCAN-based prototype selection is used to select the most appropriate training data to reduce the complexity of the training time. Extensive experiments on ten multi-label benchmark datasets demonstrate that IA-KNNR performs significantly better than state-of-the-art approaches, up to 26.4%.
format Article
id doaj-art-954736c51a3b482ab23f6db18e79fd1c
institution Matheson Library
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-954736c51a3b482ab23f6db18e79fd1c2025-07-17T23:01:53ZengIEEEIEEE Access2169-35362025-01-011311999912001710.1109/ACCESS.2025.358614611075775IA-KNNR: A Novel Imbalance-Aware Approach for Handling Multi-Label Class Imbalance ProblemHimanshu Suyal0https://orcid.org/0000-0002-8731-5800Shiv Naresh Shivhare1https://orcid.org/0000-0002-6306-1113Gulshan Shrivastava2https://orcid.org/0000-0003-3671-4921Rohit Singh3https://orcid.org/0000-0002-7626-8885Ansh Singhal4https://orcid.org/0009-0008-0922-8200School of Computer Science Engineering and Technology, Bennett University, Greater Noida, Uttar Pradesh, IndiaSchool of Computer Science Engineering and Technology, Bennett University, Greater Noida, Uttar Pradesh, IndiaSchool of Computer Science Engineering and Technology, Bennett University, Greater Noida, Uttar Pradesh, IndiaSchool of Computer Science and Engineering, Manipal University Jaipur, Jaipur, Rajasthan, IndiaSchool of Computer Science Engineering and Technology, Bennett University, Greater Noida, Uttar Pradesh, IndiaMulti-label learning (MLL) is a supervised learning where the classifier needs to learn from the data where one instance can belong to more than one class (label). Due to its wide application, MLL has attracted great attention from academia and researchers. However, due to the complex nature of multi-label data (MLD), it suffers from the class imbalance problem, where data distribution is heterogeneous, and learning from that imbalanced data is challenging. To address this, we propose IA-KNNR, a novel imbalance-aware resampling framework that leverages distance-weighted K-nearest neighbors, label similarity, and a dynamic weighting strategy that accounts for the co-occurrence patterns between frequent and infrequent labels. In addition, DBSCAN-based prototype selection is used to select the most appropriate training data to reduce the complexity of the training time. Extensive experiments on ten multi-label benchmark datasets demonstrate that IA-KNNR performs significantly better than state-of-the-art approaches, up to 26.4%.https://ieeexplore.ieee.org/document/11075775/Multi-label classificationimbalanced learningoversamplingsustainable innovation
spellingShingle Himanshu Suyal
Shiv Naresh Shivhare
Gulshan Shrivastava
Rohit Singh
Ansh Singhal
IA-KNNR: A Novel Imbalance-Aware Approach for Handling Multi-Label Class Imbalance Problem
IEEE Access
Multi-label classification
imbalanced learning
oversampling
sustainable innovation
title IA-KNNR: A Novel Imbalance-Aware Approach for Handling Multi-Label Class Imbalance Problem
title_full IA-KNNR: A Novel Imbalance-Aware Approach for Handling Multi-Label Class Imbalance Problem
title_fullStr IA-KNNR: A Novel Imbalance-Aware Approach for Handling Multi-Label Class Imbalance Problem
title_full_unstemmed IA-KNNR: A Novel Imbalance-Aware Approach for Handling Multi-Label Class Imbalance Problem
title_short IA-KNNR: A Novel Imbalance-Aware Approach for Handling Multi-Label Class Imbalance Problem
title_sort ia knnr a novel imbalance aware approach for handling multi label class imbalance problem
topic Multi-label classification
imbalanced learning
oversampling
sustainable innovation
url https://ieeexplore.ieee.org/document/11075775/
work_keys_str_mv AT himanshusuyal iaknnranovelimbalanceawareapproachforhandlingmultilabelclassimbalanceproblem
AT shivnareshshivhare iaknnranovelimbalanceawareapproachforhandlingmultilabelclassimbalanceproblem
AT gulshanshrivastava iaknnranovelimbalanceawareapproachforhandlingmultilabelclassimbalanceproblem
AT rohitsingh iaknnranovelimbalanceawareapproachforhandlingmultilabelclassimbalanceproblem
AT anshsinghal iaknnranovelimbalanceawareapproachforhandlingmultilabelclassimbalanceproblem