ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Impression Generation on Multi-Institution and Multi-System Data

  • Tianyang Zhong
  • , Wei Zhao
  • , Yutong Zhang
  • , Yi Pan
  • , Peixin Dong
  • , Zuowei Jiang
  • , Hanqi Jiang
  • , Yifan Zhou
  • , Xiaoyan Kui
  • , Youlan Shang
  • , Lin Zhao
  • , Li Yang
  • , Yaonai Wei
  • , Zhuoyi Li
  • , Jiadong Zhang
  • , Longtao Yang
  • , Hao Chen
  • , Huan Zhao
  • , Yuxiao Liu
  • , Ning Zhu
  • Yiwei Li, Yisong Wang, Jiaqi Yao, Jiaqi Wang, Ying Zeng, Lei He, Chao Zheng, Zhixue Zhang, Ming Li, Zhengliang Liu, Haixing Dai, Zihao Wu, Shu Zhang, Xiaoyan Cai, Xintao Hu, Shijie Zhao, Xi Jiang, Xin Zhang, Wei Liu, Xiang Li, Lei Guo, Dinggang Shen, Junwei Han, Tianming Liu, Jun Liu, Tuo Zhang

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Achieving clinical level performance and widespread deployment for generating radiology impressions encounters a giant challenge for conventional artificial intelligence models tailored to specific diseases and organs. Concurrent with the increasing accessibility of radiology reports and advancements in modern general AI techniques, the emergence and potential of deployable radiology AI exploration have been bolstered. Here, we present ChatRadio-Valuer, the first general radiology diagnosis large language model for localized deployment within hospitals and being close to clinical use for multi-institution and multi-system diseases. ChatRadio-Valuer achieved 15 state-of-the-art results across five human systems and six institutions in clinical-level events (n = 332,673) through rigorous and full-spectrum assessment, including engineering metrics, clinical validation, and efficiency evaluation. Notably, it exceeded OpenAI's GPT-3.5 and GPT-4 models, achieving superior performance in comprehensive disease diagnosis compared to the average level of radiology experts. Besides, ChatRadio-Valuer supports zero-shot transfer learning, greatly boosting its effectiveness as a radiology assistant, while ensuring adherence to privacy standards and being readily utilized for large-scale patient populations. Our expeditions suggest the development of localized LLMs would become an imperative avenue in hospital applications.

Original languageEnglish
Pages (from-to)1050-1061
Number of pages12
JournalIEEE Transactions on Biomedical Engineering
Volume73
Issue number3
DOIs
StatePublished - 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Radiology impression
  • generalization
  • large language model
  • localization and deployment

Fingerprint

Dive into the research topics of 'ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Impression Generation on Multi-Institution and Multi-System Data'. Together they form a unique fingerprint.

Cite this