管道(软件)
计算机科学
两亲性
嵌入
化学
吸附
生化工程
鉴定(生物学)
生物系统
肺表面活性物质
纳米技术
块(置换群论)
可视化
指纹(计算)
蛋白质-蛋白质相互作用
工艺工程
蛋白质结构
计算生物学
蛋白质折叠
人工智能
数据可视化
表征(材料科学)
植物蛋白
作者
Simha Sridharan,Rammile Ettelaie,Rik Sarkar,Maryam Afzali,Haji Dela,Brent S. Murray,Thomas A. Hazlehurst,Nicholas J Watson,Anwesha Sarkar
标识
DOI:10.26434/chemrxiv.15000208/v1
摘要
Designing amphiphilic block copolymers from fossil fuel-derived hydrocarbons is a cornerstone of colloid chemistry, enabling the creation of advanced polymeric surfactants. Despite the high attractiveness of low carbon-emitting plant proteins to defossilise surfactant processing, attempts in identifying plant proteins that will act as appropriate surfactants are somewhat hit and miss. Here, we present a data-driven pipeline to fingerprint plant protein surfactants by harnessing the diblock-like signature of a classic surfactant, integrating protein sequence data with statistical thermodynamics-based calculations, finetuned by machine learning. We demonstrate that protein sequence-level features do not in themselves allow for the prediction of adsorbed configuration. Instead, segmenting adsorbed proteins into blocks enabled identification of hundreds of plant proteins possessing a diblock-like signature, validated experimentally for optimal surface properties. Thus, by embedding adsorption configuration information into the classification process, we streamline the discovery of plant protein surfactants at scale, unlikely to be realised by purely empirical screening.
科研通智能强力驱动
Strongly Powered by AbleSci AI