TY - JOUR
T1 - Learning to Summarize Chinese Radiology Findings with a Pre-Trained Encoder
AU - Jiang, Zuowei
AU - Cai, Xiaoyan
AU - Yang, Libin
AU - Gao, Dehong
AU - Zhao, Wei
AU - Han, Junwei
AU - Liu, Jun
AU - Shen, Dinggang
AU - Liu, Tianming
N1 - Publisher Copyright:
© 1964-2012 IEEE.
PY - 2023/12/1
Y1 - 2023/12/1
N2 - Automatic radiology report summarization has been an attractive research problem towards computer-aided diagnosis to alleviate physicians' workload in recent years. However, existing methods for English radiology report summarization using deep learning techniques cannot be directly applied to Chinese radiology reports due to limitations of the related corpus. In response to this, we propose an abstractive summarization approach for Chinese chest radiology report. Our approach involves the construction of a pre-training corpus using a Chinese medical-related pre-training dataset, and the collection of Chinese chest radiology reports from Department of Radiology at the Second Xiangya Hospital as the fine-tuning corpus. To improve the initialization of the encoder, we introduce a new task-oriented pre-training objective called Pseudo Summary Objective on the pre-training corpus. We then develop a Chinese pre-trained language model called Chinese medical BERT (CMBERT), which is used to initialize the encoder and fine-tuned on the abstractive summarization task. In testing our approach on a real large-scale hospital dataset, we observe that the performance of our proposed approach achieves outstanding improvement compared with other abstractive summarization models. This highlights the effectiveness of our approach in addressing the limitations of previous methods for Chinese radiology report summarization. Overall, our proposed approach demonstrates a promising direction for the automatic summarization of Chinese chest radiology reports, offering a viable solution to alleviate physicians' workload in the field of computer-aided diagnosis.
AB - Automatic radiology report summarization has been an attractive research problem towards computer-aided diagnosis to alleviate physicians' workload in recent years. However, existing methods for English radiology report summarization using deep learning techniques cannot be directly applied to Chinese radiology reports due to limitations of the related corpus. In response to this, we propose an abstractive summarization approach for Chinese chest radiology report. Our approach involves the construction of a pre-training corpus using a Chinese medical-related pre-training dataset, and the collection of Chinese chest radiology reports from Department of Radiology at the Second Xiangya Hospital as the fine-tuning corpus. To improve the initialization of the encoder, we introduce a new task-oriented pre-training objective called Pseudo Summary Objective on the pre-training corpus. We then develop a Chinese pre-trained language model called Chinese medical BERT (CMBERT), which is used to initialize the encoder and fine-tuned on the abstractive summarization task. In testing our approach on a real large-scale hospital dataset, we observe that the performance of our proposed approach achieves outstanding improvement compared with other abstractive summarization models. This highlights the effectiveness of our approach in addressing the limitations of previous methods for Chinese radiology report summarization. Overall, our proposed approach demonstrates a promising direction for the automatic summarization of Chinese chest radiology reports, offering a viable solution to alleviate physicians' workload in the field of computer-aided diagnosis.
KW - Abstractive summarization
KW - Chinese chest radiology report
KW - pre-trained language model
KW - task-oriented pre-training objective
UR - http://www.scopus.com/inward/record.url?scp=85162732532&partnerID=8YFLogxK
U2 - 10.1109/TBME.2023.3280987
DO - 10.1109/TBME.2023.3280987
M3 - 文章
C2 - 37314905
AN - SCOPUS:85162732532
SN - 0018-9294
VL - 70
SP - 3277
EP - 3287
JO - IEEE Transactions on Biomedical Engineering
JF - IEEE Transactions on Biomedical Engineering
IS - 12
ER -