片段(逻辑)
指纹(计算)
计算机科学
人工智能
指纹识别
计算生物学
计算机视觉
算法
生物
作者
Tim Cofala,Oliver Krämer
标识
DOI:10.1145/3512290.3528824
摘要
For in silico drug discovery various representations have been established regarding storing and processing molecular data. The choice of representation has a great impact on employed methods and algorithms. Molecular fingerprints in the form of fixed-size bit vectors are a widely used representation which captures structural features of a molecule and enables a straight-forward way of estimating molecule similarities. However, since fingerprints are not invertible, they are rarely utilized for molecule generation tasks. This study presents an approach to the reconstruction of molecules from their fingerprint representation that is based on genetic algorithms. The algorithm assembles molecules from BRICS fragments and therefore only generates valid molecular structures. We demonstrate that the genetic algorithm is able to construct molecules similar to the specified target, or even reconstruct the original molecule. Furthermore, to illustrate how this genetic algorithm unlocks fingerprints as a representation for other in silico drug discovery methods, a novel Transformer neural language model trained on molecular fingerprints is introduced as a molecule generation model.
科研通智能强力驱动
Strongly Powered by AbleSci AI