跳到主要导航 跳到搜索 跳到主要内容

Self-supervised saliency-guided dual-attention fusion for benign-malignant thyroid nodule diagnosis in ultrasound

  • Xi'an University of Architecture and Technology
  • Northwestern Polytechnical University Xian
  • Tongji University

科研成果: 期刊稿件文章同行评审

摘要

Accurate benign–malignant diagnosis of thyroid nodules from ultrasound images remains challenging due to low contrast, speckle noise, and substantial inter-center variability. Most existing deep learning approaches operate directly on raw ultrasound images and rely on fully supervised learning, lacking explicit mechanisms to leverage lesion-level localization for diagnostic guidance. Although some methods incorporate segmentation to assist classification, they typically depend on fully supervised pixel-level annotations, which are often unavailable in real-world clinical settings and limit generalization to data from different clinical centers. To address these limitations, we propose a self-supervised saliency-guided dual-attention fusion framework for thyroid nodule diagnosis in ultrasound images. The proposed method learns a saliency mask to highlight diagnostically informative regions and employs self-supervised reconstruction, boundary consistency, and affinity constraints to regularize saliency learning without requiring pixel-level annotations. In addition, a lightweight dual-attention fusion module is designed to integrate original and saliency-enhanced features. Unlike conventional attention mechanisms, this module combines variance-guided channel attention with patch-wise spatial attention, enabling selective emphasis on informative channels in the original image while capturing lesion-specific contextual features guided by the saliency map. This design facilitates the learning of more discriminative representations for benign–malignant classification. Extensive experiments conducted on three multi-center thyroid ultrasound datasets, under both single-center and cross-center evaluation protocols, demonstrate that the proposed framework consistently outperforms state-of-the-art methods, with particularly notable gains in multi-center generalization performance. Code is available at https://github.com/guangguangLi/MultiModalClassifier.

源语言英语
文章编号133641
期刊Neurocomputing
685
DOI
出版状态已出版 - 7 7月 2026

指纹

探究 'Self-supervised saliency-guided dual-attention fusion for benign-malignant thyroid nodule diagnosis in ultrasound' 的科研主题。它们共同构成独一无二的指纹。

引用此