Multimodal regularized linear models with flux balance analysis for mechanistic integration of omics data

计算机科学 数据集成 正规化(语言学) 通量平衡分析 上传 数据挖掘 数据类型 机器学习 人工智能 生物信息学 生物 操作系统 程序设计语言
作者
Giuseppe Magazzù,Guido Zampieri,Claudio Angione
标识
DOI:10.1093/bioinformatics/btab324
摘要

Abstract Motivation High-throughput biological data, thanks to technological advances, have become cheaper to collect, leading to the availability of vast amounts of omic data of different types. In parallel, the in silico reconstruction and modeling of metabolic systems is now acknowledged as a key tool to complement experimental data on a large scale. The integration of these model- and data-driven information is therefore emerging as a new challenge in systems biology, with no clear guidance on how to better take advantage of the inherent multisource and multiomic nature of these data types while preserving mechanistic interpretation. Results Here, we investigate different regularization techniques for high-dimensional data derived from the integration of gene expression profiles with metabolic flux data, extracted from strain-specific metabolic models, to improve cellular growth rate predictions. To this end, we propose ad-hoc extensions of previous regularization frameworks including group, view-specific and principal component regularization and experimentally compare them using data from 1143 Saccharomyces cerevisiae strains. We observe a divergence between methods in terms of regression accuracy and integration effectiveness based on the type of regularization employed. In multiomic regression tasks, when learning from experimental and model-generated omic data, our results demonstrate the competitiveness and ease of interpretation of multimodal regularized linear models compared to data-hungry methods based on neural networks. Availability and implementation All data, models and code produced in this work are available on GitHub at https://github.com/Angione-Lab/HybridGroupIPFLasso_pc2Lasso. Supplementary information Supplementary data are available at Bioinformatics online.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
CharlotteBlue给txfxx的求助进行了留言
刚刚
Mecalren完成签到,获得积分10
刚刚
HXia发布了新的文献求助10
1秒前
酷波er应助坚强元枫采纳,获得10
1秒前
木子完成签到 ,获得积分10
2秒前
2秒前
3秒前
xiuxiuxiuxiu完成签到,获得积分10
4秒前
guang5210完成签到,获得积分10
4秒前
脑洞疼应助三木采纳,获得10
4秒前
隐形曼青应助陈圈圈采纳,获得10
4秒前
嗜酸杆君完成签到,获得积分10
5秒前
5秒前
烟花应助葛葛巫采纳,获得10
5秒前
6秒前
6秒前
YGDS完成签到,获得积分10
6秒前
文艺的芫完成签到,获得积分20
6秒前
彭于晏应助戴戴采纳,获得30
6秒前
bkagyin应助day_on采纳,获得10
7秒前
7秒前
单建安发布了新的文献求助10
7秒前
wujiwuhui发布了新的文献求助10
7秒前
8秒前
maox1aoxin应助timiim采纳,获得30
8秒前
8秒前
文艺的芫发布了新的文献求助20
9秒前
hyman发布了新的文献求助10
11秒前
CipherSage应助luxi0714采纳,获得10
12秒前
啦啦啦发布了新的文献求助10
12秒前
yhx发布了新的文献求助10
13秒前
13秒前
HXia完成签到,获得积分10
13秒前
万能图书馆应助嗜酸杆君采纳,获得10
13秒前
14秒前
围城烟火应助djdh采纳,获得10
15秒前
15秒前
16秒前
AAA建筑包工头完成签到,获得积分10
16秒前
阿瑞塞莎完成签到 ,获得积分10
16秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Gymnastik für die Jugend 600
Chinese-English Translation Lexicon Version 3.0 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 440
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 400
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 400
薩提亞模式團體方案對青年情侶輔導效果之研究 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2386240
求助须知:如何正确求助?哪些是违规求助? 2092637
关于积分的说明 5264793
捐赠科研通 1819546
什么是DOI,文献DOI怎么找? 907567
版权声明 559181
科研通“疑难数据库(出版商)”最低求助积分说明 484822