人工智能
支持向量机
计算生物学
计算机科学
机器学习
数量结构-活动关系
主成分分析
生物
作者
Abir Sarraj-Laabidi,Habib Messai,Asma Hammami-Semmar,Nabil Semmar
标识
DOI:10.2174/1568026617666170719165552
摘要
Metabolisms represent highly organized systems characterized by strong regulations satisfying the mass conservation principle. This makes a whole chemical resource to be competitively shared between several ways at both intra-and inter-molecular scales. Whole resource sharing can be statistically associated with a constant sum-unit constraint which represents the basis of simplex mixture rule. In this work, a new simplex-based simulation approach was developed to learn scaffold information on metabolic processes controlling molecular diversity from a wide set of observed chemical structures. Starting from a dataset of chemical structures classified into p clusters, a machine learning process was applied by linearly combining the p clusters j and randomly sampling a constant number (n) of molecules according to different clusters’ weights (wj/w) given by Scheffe’s mixture matrix. At the output of mixture design, molecular linear combinations lead to calculate barycentric molecules integrating the characteristics of the different weighted clusters. The N mixtures-design was iterated by bootstrap technique leading to extensive exploration of chemical variability between and within clusters. Finally, the K response matrices issued from K iterated mixture designs were averaged to calculate a smoothed matrix containing scaffold information on regulation processes responsible for molecular diversification at inter- and intra-molecular (atomic) scales. This matrix was used as a backbone for graphical analysis of positive and negative trends between atomic characteristics (chemical substitutions) at both mentioned scales. This new simplex approach was illustrated by cycloartane-based saponins of Astragalus genus by combining three desmosylation clusters characterized by relative glycosylation levels of different aglycones’ carbons.
科研通智能强力驱动
Strongly Powered by AbleSci AI