计算机科学
人工智能
图形
核(代数)
模式识别(心理学)
机器学习
数据挖掘
理论计算机科学
数学
组合数学
作者
Shengpeng Yu,Hong Wang,Meifang Hua,Cheng Liang,Yanshen Sun
标识
DOI:10.1016/j.eswa.2024.124092
摘要
Predicting microbe-disease associations (MDA) is crucial for proactively demystifying diseases causes and preventing them. Traditional prediction methods endure labor-intensive, time-consuming, and expensive. Therefore, this paper proposes CasMF-GCL, a novel Graph Contrastive Learning model based on sparse relation augmentation and a Cascaded Multi-kernel Fusion Network for MDA prediction. CasMF-GCL maximizes the use of sparse correlation information and eliminates data noise through an association augmentation technique based on low-rank sparse matrix completion. We employ diverse strategies to enhance the features of heterogeneous networks by exploring the similarity between microbes and diseases. Our model employs a new multi-layer graph convolutional network variant with a cascaded multi-kernel fusion mechanism, enabling weighted coding of local and global features for diseases and microbes. To further improve performance, we incorporate a self-supervised contrastive learning schema using multi-grain disease and microbe features and large-scale relation augmentations. Extensive experiments on two renowned datasets demonstrate that CasMF-GCL outperforms current state-of-the-art methods in seven indexes. The AUC values from 5-fold cross-validation are 0.997 and 0.989 for the two datasets, respectively. Ablation studies confirm the effectiveness of graph data augmentation, the power of the contrastive strategy, and the indispensability of the cascaded multi-kernel fusion network. Furthermore, case studies validate the prediction performance of CasMF-GCL.
科研通智能强力驱动
Strongly Powered by AbleSci AI