计算机科学
财产(哲学)
人工智能
自然语言处理
认识论
哲学
作者
Jie Zhou,Qichang Zhao,Pengcheng Shu,Xiang Tang,Jianxin Wang
标识
DOI:10.1109/jbhi.2025.3616354
摘要
Accurate prediction of molecular properties is essential for drug discovery. While single modal molecular representations have shown promising results, their generalizability remains limited due to data sparsity and the inherent incompleteness of single modal characterizations. To overcome these limitations, we propose GSST, a novel multimodal pretraining framework that integrates graph-based molecular representations with SMILES sequences through learnable soft SMILES tokens. At the core of GSST is the G2S-Former module, which injects topological information from the graph-based representation into the soft SMILES tokens to enable effective cross-modal interaction while preserving modality-specific features. Extensive experiments on the MoleculeNet and MoleculeACE benchmarks demonstrate that GSST consistently outperforms state-of-the-art methods in molecular property prediction and activity cliff assessment. These results underscore the importance of effective multimodal alignment in capturing shared molecular patterns and alleviating the challenges posed by limited labeled data. GSST represents a scalable and high-throughput approach with significant potential to advance drug discovery.
科研通智能强力驱动
Strongly Powered by AbleSci AI