计算机科学
人工神经网络
特征(语言学)
图形
人工智能
图论
模式识别(心理学)
理论计算机科学
数学
组合数学
语言学
哲学
作者
Yuan Zhang,Juan Wang,Jiajie Xing,X. T. Chen
标识
DOI:10.1109/jbhi.2025.3549509
摘要
Predicting gene-disease associations is essential for understanding disease pathogenesis and determining therapeutic targets. While prior methods have integrated diverse biological information to make predictions, they still encounter several challenges. First, incomplete and sparse gene-disease association data constrain model performance. Second, integrating heterogeneous data sources is not straightforward. To address these challenges, we propose a novel method, DAVGAE, which combines data augmentation, Variational Graph Auto-Encoders (VGAE), and attention mechanisms. DAVGAE integrates both the biological and topological features of genes and diseases to address challenges such as data sparsity and heterogeneity. By leveraging these features, it calculates cosine similarity scores for gene-disease pairs and applies a novel data augmentation strategy to enhance association data by selecting gene-disease associations with higher similarity scores. Using a four-layer Graph Neural Network (GNN) encoder, DAVGAE effectively learns robust and discriminative representations for genes and diseases within the association network. Finally, an inner product decoder predicts association scores for all gene-disease pairs. Comprehensive experiments on three gene-disease association datasets reveal that DAVGAE outperforms baseline models in predicting gene-disease associations. DAVGAE is freely available at https://github.com/imustu/DAVGAE.
科研通智能强力驱动
Strongly Powered by AbleSci AI