Skip to main navigation Skip to search Skip to main content

Addressing data heterogeneity in distributed medical imaging with heterosync learning

  • Ultrasound Engineering Institute, Medical Industry Branch of China Association Plant Engineering (UE-MICAP)
  • Sun Yat-sen University
  • Guangxi University for Nationalities
  • The First Affiliated Hospital of Guangxi Medical University
  • Sixth Affiliated Hospital of Sun Yat-sen University (Guangdong Gastrointestinal Hospital)

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Data heterogeneity critically limits distributed artificial intelligence (AI) in medical imaging. We propose HeteroSync Learning (HSL), a privacy-preserving framework that addresses heterogeneity through: (1) Shared Anchor Task (SAT) for cross-node representation alignment, and (2) an Auxiliary Learning Architecture coordinating SAT with local primary tasks. Validated via large-scale simulations (feature/label/quantity/combined heterogeneity) and a real-world multi-center thyroid cancer study, HSL outperforms local learning, 12 benchmark methods (FedAvg, FedProx, SplitAVG, FedRCL, FedCOME, etc.), and foundation models (e.g., CLIP) by better stability and up to 40% in area under the curve (AUC), matching central learning performance. HSL achieves 0.846 AUC on the out-of-distribution pediatric thyroid cancer data (outperforming others by 5.1-28.2%), demonstrating superior generalization. Visualizations confirm HSL successfully homogenizes heterogeneous distributions. This work provides an effective solution for distributed medical AI, enabling equitable collaboration across institutions and advancing healthcare AI democratization.

Original languageEnglish
Article number9416
JournalNature Communications
Volume16
Issue number1
DOIs
StatePublished - Dec 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'Addressing data heterogeneity in distributed medical imaging with heterosync learning'. Together they form a unique fingerprint.

Cite this