TY - GEN
T1 - MTGCDA
T2 - 2025 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2025
AU - Liang, Sizhe
AU - Wang, Lei
AU - You, Zhuhong
AU - Yu, Changqing
AU - Shi, Tailong
AU - Jiang, Chen
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Circular RNA (circRNA) has a stable structure and tissue-specific expression, which is of great value in the diagnosis and treatment of diseases. However, complex biological relationships and heterogeneous data hinder the prediction of circRNA-disease associations, resulting in challenges such as weak semantic representation and information loss. To address this problem, we propose a transformer-based multi-source heterogeneous graph model MTGCDA. The model first aggregates various biological data sources related to circRNA and disease to construct a heterogeneous graph containing various types of nodes and relationships. By applying a specialized heterogeneous graph neural network, the unique structural and contextual properties of different biological entities are captured. Subsequently, the circular RNA and disease node embeddings derived from multi-layer heterogeneous graph convolutional networks are combined to form a comprehensive joint representation. The fused embeddings are then processed by a CatBoost classifier to accurately estimate the likelihood of potential associations. Experiments on the CircR2Disease dataset show that MTGCDA achieves an AUC of 0.9756, outperforming existing methods. In addition, 9 of the 10 best predictions have been validated by literature, demonstrating the effectiveness and biological relevance of the model.
AB - Circular RNA (circRNA) has a stable structure and tissue-specific expression, which is of great value in the diagnosis and treatment of diseases. However, complex biological relationships and heterogeneous data hinder the prediction of circRNA-disease associations, resulting in challenges such as weak semantic representation and information loss. To address this problem, we propose a transformer-based multi-source heterogeneous graph model MTGCDA. The model first aggregates various biological data sources related to circRNA and disease to construct a heterogeneous graph containing various types of nodes and relationships. By applying a specialized heterogeneous graph neural network, the unique structural and contextual properties of different biological entities are captured. Subsequently, the circular RNA and disease node embeddings derived from multi-layer heterogeneous graph convolutional networks are combined to form a comprehensive joint representation. The fused embeddings are then processed by a CatBoost classifier to accurately estimate the likelihood of potential associations. Experiments on the CircR2Disease dataset show that MTGCDA achieves an AUC of 0.9756, outperforming existing methods. In addition, 9 of the 10 best predictions have been validated by literature, demonstrating the effectiveness and biological relevance of the model.
UR - https://www.scopus.com/pages/publications/105033538953
U2 - 10.1109/BIBM66473.2025.11356712
DO - 10.1109/BIBM66473.2025.11356712
M3 - 会议稿件
AN - SCOPUS:105033538953
T3 - Proceedings - 2025 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2025
SP - 1076
EP - 1081
BT - Proceedings - 2025 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2025
A2 - Liu, Juan
A2 - Huang, Jingshan
A2 - Wang, Xiaowo
A2 - Zhang, Fa
A2 - Zou, Xiufen
A2 - Tian, Tian
A2 - Hu, Xiaohua
A2 - Hu, Bin
A2 - Xiong, Yi
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 15 December 2025 through 18 December 2025
ER -