TY - JOUR
T1 - BioRAGent
T2 - natural language biomedical querying with retrieval-augmented multiagent systems
AU - Bi, Manlian
AU - Bao, Zhijie
AU - Xie, Dongna
AU - Xie, Xiaohan
AU - Yang, Changxiao
AU - Wang, Tao
AU - Wang, Yongtian
AU - Peng, Jiajie
N1 - Publisher Copyright:
© The Author(s) 2025. Published by Oxford University Press.
PY - 2025/9/1
Y1 - 2025/9/1
N2 - Understanding the roles of genes, phenotypes, and diseases is crucial for advancing biomedical research. However, efficient and accessible retrieval of biomedical knowledge remains a challenge due to the complexity of the relevant data. We introduce BioRAGent, an intelligent biomedical assistant that combines Tool-augmented retrieval-augmented generation (RAG) with a multiagent system. Leveraging the ability of large language models, BioRAGent facilitates natural language queries about genes, phenotypes, diseases, and their interrelationships. BioRAGent employs three specialized agents: Guide (query optimization), Retriever (data retrieval), and Reviewer (answer validation) to access authoritative biomedical databases and to generate accurate responses. We evaluate the performance of BioRAGent on a benchmark of eleven single-hop and three multi-hop tasks, demonstrating superior results compared with state-of-the-art models. User evaluations highlight the practicality and robust user experience of BioRAGent, particularly in handling complex multi-hop queries. Moreover, ablation experiments validate the contribution of each agent in improving retrieval accuracy.
AB - Understanding the roles of genes, phenotypes, and diseases is crucial for advancing biomedical research. However, efficient and accessible retrieval of biomedical knowledge remains a challenge due to the complexity of the relevant data. We introduce BioRAGent, an intelligent biomedical assistant that combines Tool-augmented retrieval-augmented generation (RAG) with a multiagent system. Leveraging the ability of large language models, BioRAGent facilitates natural language queries about genes, phenotypes, diseases, and their interrelationships. BioRAGent employs three specialized agents: Guide (query optimization), Retriever (data retrieval), and Reviewer (answer validation) to access authoritative biomedical databases and to generate accurate responses. We evaluate the performance of BioRAGent on a benchmark of eleven single-hop and three multi-hop tasks, demonstrating superior results compared with state-of-the-art models. User evaluations highlight the practicality and robust user experience of BioRAGent, particularly in handling complex multi-hop queries. Moreover, ablation experiments validate the contribution of each agent in improving retrieval accuracy.
KW - biomedical knowledge retrieval
KW - large language models
KW - multiagent system
KW - retrieval-augmented generation
UR - https://www.scopus.com/pages/publications/105018527985
U2 - 10.1093/bib/bbaf539
DO - 10.1093/bib/bbaf539
M3 - 文章
C2 - 41081729
AN - SCOPUS:105018527985
SN - 1467-5463
VL - 26
JO - Briefings in Bioinformatics
JF - Briefings in Bioinformatics
IS - 5
M1 - bbaf539
ER -