ASVBM: Structural variant benchmarking with local joint analysis for multiple callsets

Accurate benchmarking of structural variant (SV) detection is essential for advancing the development and application of human whole-genome sequencing (WGS). A fundamental challenge in benchmarking SV detection results is determining whether two SVs represent the same event. Differences in the varia...

Full description

Saved in:
Bibliographic Details
Main Authors: Peizheng Mu, Xiangyan Feng, Lanxin Tong, Jie Huang, Chaoqun Zhu, Fei Wang, Wei Quan, Yuanjun Ma, Yucui Dong, Xiao Zhu
Format: Article
Language:English
Published: Elsevier 2025-01-01
Series:Computational and Structural Biotechnology Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2001037025002612
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurate benchmarking of structural variant (SV) detection is essential for advancing the development and application of human whole-genome sequencing (WGS). A fundamental challenge in benchmarking SV detection results is determining whether two SVs represent the same event. Differences in the variation-awareness and strategic implementation of aligners inherently constrain SV detection algorithms that rely on alignment-based approaches. Traditional benchmarking, which primarily focuses on comparing and matching individual variants, makes it difficult to capture the relationships between multiple adjacent variants. We introduced ASVBM, an improved benchmarking framework that introduces the notion of latent positives and leverages a joint analysis and validation strategy based on local variants. This performance improvement arose from the discovery that multiple smaller variants are nearly equivalent to a larger variant. We comprehensively evaluated the performance of six state-of-the-art variant calling pipelines using real WGS datasets. According to multiple matching criteria, ASVBM employs a joint analysis strategy to uncover potential equivalences between the callset and the benchmark set, thereby reducing false mismatches caused by differences in variant representation. ASVBM is available at https://github.com/zhuxiao/asvbm.
ISSN:2001-0370