计算机科学
生物标志物
图形
代表(政治)
生物标志物发现
源代码
人工智能
疾病
关系(数据库)
机器学习
数据挖掘
理论计算机科学
生物
蛋白质组学
基因
操作系统
政治
法学
生物化学
政治学
作者
Minghao Yang,Zhian Huang,Wenhao Gu,Kun Han,Wenying Pan,Xiao Yang,Zexuan Zhu
摘要
The associations between biomarkers and human diseases play a key role in understanding complex pathology and developing targeted therapies. Wet lab experiments for biomarker discovery are costly, laborious and time-consuming. Computational prediction methods can be used to greatly expedite the identification of candidate biomarkers.Here, we present a novel computational model named GTGenie for predicting the biomarker-disease associations based on graph and text features. In GTGenie, a graph attention network is utilized to characterize diverse similarities of biomarkers and diseases from heterogeneous information resources. Meanwhile, a pretrained BERT-based model is applied to learn the text-based representation of biomarker-disease relation from biomedical literature. The captured graph and text features are then integrated in a bimodal fusion network to model the hybrid entity representation. Finally, inductive matrix completion is adopted to infer the missing entries for reconstructing relation matrix, with which the unknown biomarker-disease associations are predicted. Experimental results on HMDD, HMDAD and LncRNADisease data sets showed that GTGenie can obtain competitive prediction performance with other state-of-the-art methods.The source code of GTGenie and the test data are available at: https://github.com/Wolverinerine/GTGenie.
科研通智能强力驱动
Strongly Powered by AbleSci AI