GNN-based embedding for clustering scRNA-seq data

聚类分析 计算机科学 数据挖掘 图形 嵌入 自编码 仿形(计算机编程) 图嵌入 人工智能 机器学习 人工神经网络 理论计算机科学 操作系统
作者
Madalina Ciortan,Matthieu Defrance
出处
期刊:Bioinformatics [Oxford University Press]
卷期号:38 (4): 1037-1044 被引量:11
标识
DOI:10.1093/bioinformatics/btab787
摘要

Abstract Motivation Single-cell RNA sequencing (scRNA-seq) provides transcriptomic profiling for individual cells, allowing researchers to study the heterogeneity of tissues, recognize rare cell identities and discover new cellular subtypes. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, the high sparsity of scRNA-seq data, accentuated by dropout events generates challenges that have motivated the development of numerous dedicated clustering methods. Nevertheless, there is still no consensus on the best performing method. Results graph-sc is a new method leveraging a graph autoencoder network to create embeddings for scRNA-seq cell data. While this work analyzes the performance of clustering the embeddings with various clustering algorithms, other downstream tasks can also be performed. A broad experimental study has been performed on both simulated and scRNA-seq datasets. The results indicate that although there is no consistently best method across all the analyzed datasets, graph-sc compares favorably to competing techniques across all types of datasets. Furthermore, the proposed method is stable across consecutive runs, robust to input down-sampling, generally insensitive to changes in the network architecture or training parameters and more computationally efficient than other competing methods based on neural networks. Modeling the data as a graph provides increased flexibility to define custom features characterizing the genes, the cells and their interactions. Moreover, external data (e.g. gene network) can easily be integrated into the graph and used seamlessly under the same optimization task. Availability and implementation https://github.com/ciortanmadalina/graph-sc. Supplementary information Supplementary data are available at Bioinformatics online.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
旭琦发布了新的文献求助10
1秒前
shimfey给shimfey的求助进行了留言
1秒前
Haho应助xxw采纳,获得10
2秒前
3秒前
4秒前
灯火发布了新的文献求助10
5秒前
怡然平露发布了新的文献求助10
6秒前
7秒前
8秒前
我的娃发布了新的文献求助10
8秒前
8秒前
嘴嘴是大嘴007完成签到,获得积分10
8秒前
光亮的天真完成签到 ,获得积分10
9秒前
yry完成签到,获得积分10
9秒前
眞_发布了新的文献求助10
10秒前
11秒前
隐形曼青应助拥抱采纳,获得30
11秒前
帅气之云发布了新的文献求助10
11秒前
12秒前
木榕城完成签到,获得积分10
13秒前
情怀应助追寻的孤风采纳,获得30
13秒前
luzhi发布了新的文献求助10
13秒前
13秒前
我的娃完成签到,获得积分10
14秒前
爆米花应助天明采纳,获得10
14秒前
20秒前
skeptical发布了新的文献求助10
20秒前
隐形曼青应助英俊棉花糖采纳,获得10
21秒前
21秒前
lsyt应助白云朵儿采纳,获得20
22秒前
23秒前
火星上白枫完成签到,获得积分10
24秒前
rocky15应助没用的三轮采纳,获得20
24秒前
24秒前
科研通AI2S应助怡然平露采纳,获得10
25秒前
拥抱发布了新的文献求助30
25秒前
明理小土豆完成签到,获得积分10
27秒前
可爱的函函应助oldyang采纳,获得10
27秒前
谷贝贝发布了新的文献求助10
28秒前
高分求助中
Sustainable Land Management: Strategies to Cope with the Marginalisation of Agriculture 1000
Corrosion and Oxygen Control 600
Python Programming for Linguistics and Digital Humanities: Applications for Text-Focused Fields 500
Heterocyclic Stilbene and Bibenzyl Derivatives in Liverworts: Distribution, Structures, Total Synthesis and Biological Activity 500
重庆市新能源汽车产业大数据招商指南(两链两图两池两库两平台两清单两报告) 400
Division and square root. Digit-recurrence algorithms and implementations 400
行動データの計算論モデリング 強化学習モデルを例として 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2548118
求助须知:如何正确求助?哪些是违规求助? 2176421
关于积分的说明 5604484
捐赠科研通 1897264
什么是DOI,文献DOI怎么找? 946843
版权声明 565419
科研通“疑难数据库(出版商)”最低求助积分说明 503913