计算机科学
人工智能
机器学习
匹配(统计)
虚拟筛选
人工神经网络
功能(生物学)
数据挖掘
模式识别(心理学)
药物发现
生物信息学
数学
进化生物学
生物
统计
作者
Raed Khashan,Alexander Tropsha,Weifan Zheng
标识
DOI:10.1002/minf.202100248
摘要
Abstract Accurate prediction of binding poses is crucial to structure‐based drug design. We employ two powerful artificial intelligence (AI) approaches, data‐mining and machine‐learning, to design artificial neural network (ANN) based pose‐scoring function. It is a simple machine‐learning‐based statistical function that employs frequent geometric and chemical patterns of interacting atoms at protein‐ligand interfaces. The patterns are derived by mining interfaces of “native” protein‐ligand complexes. Each interface is represented by a graph where nodes are atoms and edges connect protein‐ligand interfacial atoms located within certain cutoff distance of each other. Applying frequent subgraph mining to these interfaces provides “native” frequent patterns of interacting atoms. Subsequently, given a pose for a protein‐ligand complex of interest, the pose‐scoring function (the information‐processing unit or neuron) calculates the degree of matching between the interaction patterns present at the pose's interface and the native frequent patterns. The pose‐scoring function takes into account the frequency of occurrence of the matching native patterns, the size of the match, and the degree of geometrical similarity between pose‐specific and matching native frequent patterns. This novel “multi‐body interaction” pose‐scoring function (MBI‐Score) was validated using two databases, PDBbind and Astex‐85, and it outperformed seven commonly used commercial scoring functions. MBI‐Score is available at www.khashanlab.org/mbi‐score .
科研通智能强力驱动
Strongly Powered by AbleSci AI