TY - JOUR
T1 - An Iterative Optimizing Framework for Radiology Report Summarization With ChatGPT
AU - Ma, Chong
AU - Wu, Zihao
AU - Wang, Jiaqi
AU - Xu, Shaochen
AU - Wei, Yaonai
AU - Liu, Zhengliang
AU - Zeng, Fang
AU - Jiang, Xi
AU - Guo, Lei
AU - Cai, Xiaoyan
AU - Zhang, Shu
AU - Zhang, Tuo
AU - Zhu, Dajiang
AU - Shen, Dinggang
AU - Liu, Tianming
AU - Li, Xiang
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2024
Y1 - 2024
N2 - The 'Impression' section of a radiology report is a critical basis for communication between radiologists and other physicians. Typically written by radiologists, this part is derived from the 'Findings' section, which can be laborious and error-prone. Although deep-learning-based models, such as bidirectional encoder representation from transformers (BERT), have achieved promising results in automatic impression generation (AIG), such models often require substantial amounts of medical data and have poor generalization performance. Recently, large language models (LLMs) like Chat Generative Pre-trained Transformer (ChatGPT) have shown strong generalization capabilities and performance, but their performance in specific domains, such as radiology, remains under-investigated and potentially limited. To address this limitation, we propose ImpressionGPT, leveraging the contextual learning capabilities of LLMs through our dynamic prompt and iterative optimization algorithm to accomplish the AIG task. ImpressionGPT initially employs a small amount of domain-specific data to create a dynamic prompt, extracting contextual semantic information closely related to the test data. Subsequently, the iterative optimization algorithm automatically evaluates the output of LLMs and provides optimization suggestions, continuously refining the output results. The proposed ImpressionGPT model achieves superior performance of AIG task on both the Medical Information Mart for Intensive Care - Chest X-ray database (MIMIC-CXR) and Open Access Biomedical Image Search Engine (OpenI) datasets without requiring additional training data or fine-tuning the LLMs. This work presents a paradigm for localizing LLMs that can be applied in a wide range of similar application scenarios, bridging the gap between general-purpose LLMs and the specific language processing needs of various domains.
AB - The 'Impression' section of a radiology report is a critical basis for communication between radiologists and other physicians. Typically written by radiologists, this part is derived from the 'Findings' section, which can be laborious and error-prone. Although deep-learning-based models, such as bidirectional encoder representation from transformers (BERT), have achieved promising results in automatic impression generation (AIG), such models often require substantial amounts of medical data and have poor generalization performance. Recently, large language models (LLMs) like Chat Generative Pre-trained Transformer (ChatGPT) have shown strong generalization capabilities and performance, but their performance in specific domains, such as radiology, remains under-investigated and potentially limited. To address this limitation, we propose ImpressionGPT, leveraging the contextual learning capabilities of LLMs through our dynamic prompt and iterative optimization algorithm to accomplish the AIG task. ImpressionGPT initially employs a small amount of domain-specific data to create a dynamic prompt, extracting contextual semantic information closely related to the test data. Subsequently, the iterative optimization algorithm automatically evaluates the output of LLMs and provides optimization suggestions, continuously refining the output results. The proposed ImpressionGPT model achieves superior performance of AIG task on both the Medical Information Mart for Intensive Care - Chest X-ray database (MIMIC-CXR) and Open Access Biomedical Image Search Engine (OpenI) datasets without requiring additional training data or fine-tuning the LLMs. This work presents a paradigm for localizing LLMs that can be applied in a wide range of similar application scenarios, bridging the gap between general-purpose LLMs and the specific language processing needs of various domains.
KW - ChatGPT
KW - dynamic prompt
KW - iterative optimization
KW - radiology report summarization
UR - http://www.scopus.com/inward/record.url?scp=85185383267&partnerID=8YFLogxK
U2 - 10.1109/TAI.2024.3364586
DO - 10.1109/TAI.2024.3364586
M3 - 文章
AN - SCOPUS:85185383267
SN - 2691-4581
VL - 5
SP - 4163
EP - 4175
JO - IEEE Transactions on Artificial Intelligence
JF - IEEE Transactions on Artificial Intelligence
IS - 8
ER -