Supervised learning of high-confidence phenotypic subpopulations from single-cell data

表型 计算生物学 特征选择 降维 计算机科学 可扩展性 范畴变量 生物 机器学习 人工智能 基因 遗传学 数据库
作者
Tao Ren,Canping Chen,Alexey V. Danilov,Susan Liu,Xiangnan Guan,Shunyi Du,Xiwei Wu,Mara H. Sherman,Paul T. Spellman,Lisa M. Coussens,Andrew C. Adey,Gordon B. Mills,Lingyun Wu,Zheng Xia
出处
期刊:Nature Machine Intelligence [Springer Nature]
卷期号:5 (5): 528-541 被引量:3
标识
DOI:10.1038/s42256-023-00656-y
摘要

Accurately identifying phenotype-relevant cell subsets from heterogeneous cell populations is crucial for delineating the underlying mechanisms driving biological or clinical phenotypes. Here by deploying a Learning with Rejection strategy, we developed a novel supervised learning framework called PENCIL to identify subpopulations associated with categorical or continuous phenotypes from single-cell data. By embedding a feature selection function into this flexible framework, for the first time, we were able to simultaneously select informative features and identify cell subpopulations, enabling accurate identification of phenotypic subpopulations otherwise missed by methods incapable of concurrent gene selection. Furthermore, the regression mode of PENCIL presents a novel ability for supervised phenotypic trajectory learning of subpopulations from single-cell data. We conducted comprehensive simulations to evaluate PENCIL's versatility in simultaneous gene selection, subpopulation identification and phenotypic trajectory prediction. PENCIL is fast and scalable to analyse one million cells within 1 h. Using the classification mode, PENCIL detected T-cell subpopulations associated with melanoma immunotherapy outcomes. Moreover, when applied to single-cell RNA sequencing of a patient with mantle cell lymphoma with drug treatment across multiple timepoints, the regression mode of PENCIL revealed a transcriptional treatment response trajectory. Collectively, our work introduces a scalable and flexible infrastructure to accurately identify phenotype-associated subpopulations from single-cell data. To detect phenotype-related cell subpopulations from single-cell data, appropriate feature sets need to be chosen or learned simultaneously. Ren et al. present here a tool based on Learning with Rejection, a method that during training learns features from cells that can be predicted with high confidence, while cells that the model is not yet certain about are rejected.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
SMLW完成签到 ,获得积分10
1秒前
2秒前
Angsent完成签到 ,获得积分10
2秒前
4秒前
6秒前
yiyi完成签到,获得积分10
7秒前
大大大长腿完成签到,获得积分10
8秒前
jacs111完成签到,获得积分10
9秒前
杨旸发布了新的文献求助10
10秒前
Sun完成签到,获得积分10
10秒前
yiyi发布了新的文献求助20
10秒前
目标是博士毕业完成签到,获得积分20
12秒前
Jasper应助xw采纳,获得10
12秒前
木木完成签到,获得积分10
12秒前
wanci应助A溶大美噶采纳,获得10
15秒前
潘潘完成签到 ,获得积分10
16秒前
Hello应助chcui采纳,获得10
16秒前
16秒前
16秒前
田様应助禹宛白采纳,获得10
18秒前
ouwen完成签到,获得积分10
19秒前
小董完成签到,获得积分10
19秒前
20秒前
称心雁枫发布了新的文献求助10
21秒前
栗爷完成签到,获得积分0
22秒前
华仔应助yiyi采纳,获得10
22秒前
23秒前
111发布了新的文献求助10
23秒前
24秒前
27秒前
27秒前
Nick完成签到 ,获得积分10
28秒前
禹宛白发布了新的文献求助10
29秒前
30秒前
chcui发布了新的文献求助10
30秒前
Mike001发布了新的文献求助10
31秒前
秋雪瑶应助111采纳,获得10
31秒前
一天吃瓜25h完成签到 ,获得积分10
32秒前
Mike001发布了新的文献求助10
32秒前
32秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 800
Recherches Ethnographiques sue les Yao dans la Chine du Sud 500
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 500
Chinese-English Translation Lexicon Version 3.0 500
Wisdom, Gods and Literature Studies in Assyriology in Honour of W. G. Lambert 400
薩提亞模式團體方案對青年情侶輔導效果之研究 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2392033
求助须知:如何正确求助?哪些是违规求助? 2096714
关于积分的说明 5282358
捐赠科研通 1824242
什么是DOI,文献DOI怎么找? 909820
版权声明 559877
科研通“疑难数据库(出版商)”最低求助积分说明 486170