TY - GEN
T1 - Identification of lncRNA-related protein-coding genes using multi-omics data based on deep learning and matrix completion
AU - Gao, Meihong
AU - Shang, Xuequn
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Long noncoding RNAs (lncRNAs) can regulate the expression of protein-coding genes (PCGs) to cause disease. Identifying lncRNA-PCG associations (LGAs) is beneficial in revealing the pathogenic mechanism of lncRNA. Nevertheless, it remains challenging due to the heterogeneity of lncRNA expression and the complexity of its regulatory patterns. Biological experiments have been designed to identify LGAs, but they cannot be used on a large scale due to time and financial constraints. Therefore, the design of computational methods becomes crucial for LGA research. Here, we propose a new computational model, DNNMC, to reveal potential LGAs based on deep neural networks and inductive matrix completion using association and multi-omics data. We first integrated LGA and multi-omics similarity to construct lncRNA and PCG similarity networks. Subsequently, deep graph convolutional networks were used for feature learning of lncRNAs and PCGs. These learned features and the known LGA matrix were finally used as input to the inductive matrix completion module for predicting potential LGAs. Experimental results on three datasets demonstrated that DNNMC outperformed other machine learning methods in predicting LGA relationships. Furthermore, multi-omics features were shown to improve the performance of LGA identification. In conclusion, we propose a new LGA prediction method, DNNMC, which can effectively complete the LGA prediction task and help to reveal the regulatory mechanism of lncRNAs in diseases.
AB - Long noncoding RNAs (lncRNAs) can regulate the expression of protein-coding genes (PCGs) to cause disease. Identifying lncRNA-PCG associations (LGAs) is beneficial in revealing the pathogenic mechanism of lncRNA. Nevertheless, it remains challenging due to the heterogeneity of lncRNA expression and the complexity of its regulatory patterns. Biological experiments have been designed to identify LGAs, but they cannot be used on a large scale due to time and financial constraints. Therefore, the design of computational methods becomes crucial for LGA research. Here, we propose a new computational model, DNNMC, to reveal potential LGAs based on deep neural networks and inductive matrix completion using association and multi-omics data. We first integrated LGA and multi-omics similarity to construct lncRNA and PCG similarity networks. Subsequently, deep graph convolutional networks were used for feature learning of lncRNAs and PCGs. These learned features and the known LGA matrix were finally used as input to the inductive matrix completion module for predicting potential LGAs. Experimental results on three datasets demonstrated that DNNMC outperformed other machine learning methods in predicting LGA relationships. Furthermore, multi-omics features were shown to improve the performance of LGA identification. In conclusion, we propose a new LGA prediction method, DNNMC, which can effectively complete the LGA prediction task and help to reveal the regulatory mechanism of lncRNAs in diseases.
KW - deep neural networks
KW - inductive matrix completion
KW - Long noncoding RNAs
KW - protein-coding genes
KW - similarity networks
UR - http://www.scopus.com/inward/record.url?scp=85146723596&partnerID=8YFLogxK
U2 - 10.1109/BIBM55620.2022.9995428
DO - 10.1109/BIBM55620.2022.9995428
M3 - 会议稿件
AN - SCOPUS:85146723596
T3 - Proceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
SP - 3307
EP - 3314
BT - Proceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
A2 - Adjeroh, Donald
A2 - Long, Qi
A2 - Shi, Xinghua
A2 - Guo, Fei
A2 - Hu, Xiaohua
A2 - Aluru, Srinivas
A2 - Narasimhan, Giri
A2 - Wang, Jianxin
A2 - Kang, Mingon
A2 - Mondal, Ananda M.
A2 - Liu, Jin
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
Y2 - 6 December 2022 through 8 December 2022
ER -