计算机科学
水准点(测量)
标杆管理
推论
生物信息学
染色质
数据挖掘
表达式(计算机科学)
计算生物学
人工智能
基因
生物
业务
营销
生物化学
程序设计语言
地理
大地测量学
作者
Xiuwei Zhang,Hechen Li,Ziqi Zhang,Michael Squires,Xi Chen
出处
期刊:Research Square - Research Square
日期:2023-09-19
被引量:4
标识
DOI:10.21203/rs.3.rs-3301625/v1
摘要
Abstract Simulated single-cell data is essential for designing and evaluating computational methods in the absence of experimental ground truth. Existing simulators typically focus on modeling one or two specific biological factors or mechanisms that affect the output data, which limits their capacity to simulate the complexity and multi-modality in real data. Here, we present scMultiSim, an in silico simulator that generates multi-modal single-cell data, including gene expression, chromatin accessibility, RNA velocity, and spatial cell locations while accounting for the relationships between modalities. scMultiSim jointly models various biological factors that affect the output data, including cell identity, within-cell gene regulatory networks (GRNs), cell-cell interactions (CCIs), and chromatin accessibility, while also incorporating technical noises. Moreover, it allows users to adjust each factor's effect easily. We validated scMultiSim’s simulated biological effects and demonstrated its applications by benchmarking a wide range of computational tasks, including multi-modal and multi-batch data integration, RNA velocity estimation, GRN inference and CCI inference using spatially resolved gene expression data, many of them were not benchmarked before due to the lack of proper tools. Compared to existing simulators, scMultiSim can benchmark a much broader range of existing computational problems and even new potential tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI