TY - JOUR
T1 - Prediction of drug-target interactions by ensemble learning method from protein sequence and drug fingerprint
AU - Zhan, Xinke
AU - You, Zhu Hong
AU - Cai, Jinfan
AU - Li, Liping
AU - Yu, Changqing
AU - Pan, Jie
AU - Kong, Jiangkun
N1 - Publisher Copyright:
© 2020 Institute of Electrical and Electronics Engineers Inc.. All rights reserved.
PY - 2020
Y1 - 2020
N2 - Predicting the target-drug interactions (DITs) is of great important for screening new drug candidate and understanding biological processes. However, identifying the drug-target interactions through traditional experiments is still costly, laborious and complicated. Thus, there is a great need for developing reliable computational methods to effectively predict DTIs. In this study, we report a novel computational method combining local optimal oriented pattern (LOOP), Position Specific Scoring Matrix (PSSM) and Rotation Forest (RF) for predicting DTI. Specifically, the target protein sequence is firstly transformed as the PSSM, in which the evolutionary information of protein is retained. Then, the LOOP is used to extract the feature vectors from PSSM, and the sub-structure information of drug molecule is represented as fingerprint features. Finally, RF classifier is adopted to infer the potential drug-target interactions. When the experiment is carried out on four benchmark datasets including enzyme, ion channel, G protein-coupled receptors (GPCRs), and nuclear receptor, we achieved the high average prediction accuracies of 89.09%, 87.53%, 82.05%, and 73.33% respectively. For further evaluating the proposed method, we compare the prediction performance of the proposed method with the state-of-the-art support vector machine (SVM) and K-Nearest Neighbor (KNN). The comprehensive experimental results illustrate that the proposed method is reliable and efficiency for predicting DTIs. It is anticipated that the proposed method can become a useful tool for predicting a large-scale potential DTIs.
AB - Predicting the target-drug interactions (DITs) is of great important for screening new drug candidate and understanding biological processes. However, identifying the drug-target interactions through traditional experiments is still costly, laborious and complicated. Thus, there is a great need for developing reliable computational methods to effectively predict DTIs. In this study, we report a novel computational method combining local optimal oriented pattern (LOOP), Position Specific Scoring Matrix (PSSM) and Rotation Forest (RF) for predicting DTI. Specifically, the target protein sequence is firstly transformed as the PSSM, in which the evolutionary information of protein is retained. Then, the LOOP is used to extract the feature vectors from PSSM, and the sub-structure information of drug molecule is represented as fingerprint features. Finally, RF classifier is adopted to infer the potential drug-target interactions. When the experiment is carried out on four benchmark datasets including enzyme, ion channel, G protein-coupled receptors (GPCRs), and nuclear receptor, we achieved the high average prediction accuracies of 89.09%, 87.53%, 82.05%, and 73.33% respectively. For further evaluating the proposed method, we compare the prediction performance of the proposed method with the state-of-the-art support vector machine (SVM) and K-Nearest Neighbor (KNN). The comprehensive experimental results illustrate that the proposed method is reliable and efficiency for predicting DTIs. It is anticipated that the proposed method can become a useful tool for predicting a large-scale potential DTIs.
KW - Drug-target interaction
KW - Local optimal oriented pattern
KW - Position specific scoring matrix
KW - Rotation forest
UR - http://www.scopus.com/inward/record.url?scp=85102806441&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2020.3026479
DO - 10.1109/ACCESS.2020.3026479
M3 - 文章
AN - SCOPUS:85102806441
SN - 2169-3536
VL - 8
SP - 185465
EP - 185476
JO - IEEE Access
JF - IEEE Access
ER -