扩散
情态动词
统计物理学
计算机科学
材料科学
物理
热力学
高分子化学
作者
Huaisheng Zhu,Teng Xiao,Vasant Honavar
出处
期刊:Cornell University - arXiv
日期:2024-03-11
被引量:1
标识
DOI:10.48550/arxiv.2403.07179
摘要
Generating molecules with desired properties is a critical task with broad applications in drug discovery and materials design. Inspired by recent advances in large language models, there is a growing interest in using natural language descriptions of molecules to generate molecules with the desired properties. Most existing methods focus on generating molecules that precisely match the text description. However, practical applications call for methods that generate diverse, and ideally novel, molecules with the desired properties. We propose 3M-Diffusion, a novel multi-modal molecular graph generation method, to address this challenge. 3M-Diffusion first encodes molecular graphs into a graph latent space aligned with text descriptions. It then reconstructs the molecular structure and atomic attributes based on the given text descriptions using the molecule decoder. It then learns a probabilistic mapping from the text space to the latent molecular graph space using a diffusion model. The results of our extensive experiments on several datasets demonstrate that 3M-Diffusion can generate high-quality, novel and diverse molecular graphs that semantically match the textual description provided.
科研通智能强力驱动
Strongly Powered by AbleSci AI