本体论
计算机科学
注释
词汇
集合(抽象数据类型)
术语
情报检索
基因本体论
电池类型
受控词汇
细胞
计算生物学
自然语言处理
人工智能
生物
基因
遗传学
认识论
哲学
基因表达
程序设计语言
语言学
作者
Sheng Wang,Angela Oliveira Pisco,Jim Karkanias,Russ B. Altman
摘要
Abstract Single cell technologies have rapidly generated an unprecedented amount of data that enables us to understand biological systems at single-cell resolution. However, joint analysis of datasets generated by independent labs remains challenging due to a lack of consistent terminology to describe cell types. Here, we present OnClass, an algorithm and accompanying software for automatically classifying cells into cell types part of the controlled vocabulary that forms the Cell Ontology. A key advantage of OnClass is its capability to classify cells into cell types not present in the training data because it uses the Cell Ontology graph to infer cell type relationships. Furthermore, OnClass can be used to identify marker genes for all the cell ontology categories, independently of whether the cells types are present or absent in the training data, suggesting that OnClass can be used not only as an annotation tool for single cell datasets but also as an algorithm to identify marker genes specific to each term of the Cell Ontology, offering the possibility of refining the Cell Ontology using a data-centric approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI