Dual-Attention-Based Enhanced Unified Net for Precise GTV Segmentation of Nasopharyngeal Carcinoma in 3D MR Images

Accurate gross tumor volume (GTV) segmentation is essential for effective radiotherapy in nasopharyngeal carcinoma (NPC). However, challenges arise due to the nasopharyngeal region’s complex anatomy and the annotated data scarcity. Our study presents a dual-attention-based enhanced unifie...

Full description

Saved in:
Bibliographic Details
Main Authors: Hassan Ali Khan, Gong Xueqing, Muhammad Shoib Amin, Zeeshan Bin Siddique, Muhammad Ahtsam Naeem
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11037670/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurate gross tumor volume (GTV) segmentation is essential for effective radiotherapy in nasopharyngeal carcinoma (NPC). However, challenges arise due to the nasopharyngeal region&#x2019;s complex anatomy and the annotated data scarcity. Our study presents a dual-attention-based enhanced unified network (DAEU-Net) designed for precise NPC GTV segmentation utilizing 3D T1, T2, and T1C-weighted MR images. Our approach involves splitting large-scale MR data into multiple patches and then training every patch independently. This approach effectively captures localized and detailed information without downscaling the image resolution. The DAEU-Net integrates channel-attention and pixel-attention modules within the encoder section, eliminating background noise and reducing information loss by enhancing the network&#x2019;s focus on detailed features. The decoder section incorporates bottleneck residual blocks to enhance the computing efficiency and robustness of the network. The proposed methodology surpasses the state-of-the-art models with a respective average symmetric surface distance (ASSD) of <inline-formula> <tex-math notation="LaTeX">$0.920\pm 0.386$ </tex-math></inline-formula> mm, <inline-formula> <tex-math notation="LaTeX">$0.987\pm 0.421$ </tex-math></inline-formula> mm, and <inline-formula> <tex-math notation="LaTeX">$1.043\pm 0.457$ </tex-math></inline-formula> mm and a dice similarity coefficient (DSC) of 0.896, 0.871, and 0.851, respectively. Multi-viewed animated MR images in three orthogonal dimensions (axial, sagittal, and coronal) with predicted NPC tumors and real GTV masks were shown to assist in comprehending the tumor&#x2019;s precise location. Our approach can significantly improves NPC tumor delineation, aids in automated tumor lesion segmentation, and reduces the annotation workload for oncologists.
ISSN:2169-3536