Identification of lncRNA-related protein-coding genes using multi-omics data based on deep learning and matrix completion

Meihong Gao, Xuequn Shang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Long noncoding RNAs (lncRNAs) can regulate the expression of protein-coding genes (PCGs) to cause disease. Identifying lncRNA-PCG associations (LGAs) is beneficial in revealing the pathogenic mechanism of lncRNA. Nevertheless, it remains challenging due to the heterogeneity of lncRNA expression and the complexity of its regulatory patterns. Biological experiments have been designed to identify LGAs, but they cannot be used on a large scale due to time and financial constraints. Therefore, the design of computational methods becomes crucial for LGA research. Here, we propose a new computational model, DNNMC, to reveal potential LGAs based on deep neural networks and inductive matrix completion using association and multi-omics data. We first integrated LGA and multi-omics similarity to construct lncRNA and PCG similarity networks. Subsequently, deep graph convolutional networks were used for feature learning of lncRNAs and PCGs. These learned features and the known LGA matrix were finally used as input to the inductive matrix completion module for predicting potential LGAs. Experimental results on three datasets demonstrated that DNNMC outperformed other machine learning methods in predicting LGA relationships. Furthermore, multi-omics features were shown to improve the performance of LGA identification. In conclusion, we propose a new LGA prediction method, DNNMC, which can effectively complete the LGA prediction task and help to reveal the regulatory mechanism of lncRNAs in diseases.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
EditorsDonald Adjeroh, Qi Long, Xinghua Shi, Fei Guo, Xiaohua Hu, Srinivas Aluru, Giri Narasimhan, Jianxin Wang, Mingon Kang, Ananda M. Mondal, Jin Liu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3307-3314
Number of pages8
ISBN (Electronic)9781665468190
DOIs
StatePublished - 2022
Event2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022 - Las Vegas, United States
Duration: 6 Dec 20228 Dec 2022

Publication series

NameProceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022

Conference

Conference2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
Country/TerritoryUnited States
CityLas Vegas
Period6/12/228/12/22

Keywords

  • deep neural networks
  • inductive matrix completion
  • Long noncoding RNAs
  • protein-coding genes
  • similarity networks

Fingerprint

Dive into the research topics of 'Identification of lncRNA-related protein-coding genes using multi-omics data based on deep learning and matrix completion'. Together they form a unique fingerprint.

Cite this