计算机科学
人工智能
机器学习
边距(机器学习)
矩阵分解
图形
利用
代表(政治)
模式识别(心理学)
理论计算机科学
法学
政治学
政治
特征向量
物理
量子力学
计算机安全
作者
Peng Han,Pengling Yang,Peilin Zhao,Shuo Shang,Yong Liu,Jiayu Zhou,Xin Gao,Panos Kalnis
出处
期刊:Knowledge Discovery and Data Mining
日期:2019-07-25
被引量:106
标识
DOI:10.1145/3292500.3330912
摘要
Discovering disease-gene association is a fundamental and critical biomedical task, which assists biologists and physicians to discover pathogenic mechanism of syndromes. With various clinical biomarkers measuring the similarities among genes and disease phenotypes, network-based semi-supervised learning (NSSL) has been commonly utilized by these studies to address this class-imbalanced large-scale data issue. However, most existing NSSL approaches are based on linear models and suffer from two major limitations: 1) They implicitly consider a local-structure representation for each candidate; 2) They are unable to capture nonlinear associations between diseases and genes. In this paper, we propose a new framework for disease-gene association task by combining Graph Convolutional Network (GCN) and matrix factorization, named GCN-MF. With the help of GCN, we could capture non-linear interactions and exploit measured similarities. Moreover, we define a margin control loss function to reduce the effect of sparsity. Empirical results demonstrate that the proposed deep learning algorithm outperforms all other state-of-the-art methods on most of metrics.
科研通智能强力驱动
Strongly Powered by AbleSci AI