Benchmarking the First Realistic Dataset for Speech Separation

This paper presents a thorough benchmarking analysis of a recently introduced realistic dataset for speech separation tasks. The dataset contains audio mixtures that replicate real-life scenarios and is accompanied by ground truths, making it a valuable resource for researchers. Although the datase...

Full description

Saved in:
Bibliographic Details
Main Authors: Rawad MELHEM, Oumayma AL DAKKAK, Assef JAFAR
Format: Article
Language:English
Published: Institute of Fundamental Technological Research Polish Academy of Sciences 2025-07-01
Series:Archives of Acoustics
Subjects:
Online Access:https://acoustics.ippt.pan.pl/index.php/aa/article/view/4180
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presents a thorough benchmarking analysis of a recently introduced realistic dataset for speech separation tasks. The dataset contains audio mixtures that replicate real-life scenarios and is accompanied by ground truths, making it a valuable resource for researchers. Although the dataset construction methodology was recently disclosed, its benchmarking and detailed performance analysis have not yet been conducted. In this study, we evaluate the performance of four speech separation models using two distinct testing sets, ensuring a robust evaluation. Our findings underscore the dataset’s efficacy to advance speech separation research within authentic environments. Furthermore, we propose a novel approach for assessing metrics in real-world speech separation systems, where ground truths are unavailable. This method aims to improve accuracy evaluations and refine models for practical applications.We make the dataset publicly available to encourage innovation and collaboration in the field.
ISSN:0137-5075
2300-262X