注释
领域(数学分析)
生物
公共领域
蛋白质结构域
一致性(知识库)
计算生物学
数据库
蛋白质结构数据库
序列比对
保守序列
序列(生物学)
计算机科学
情报检索
序列数据库
生物信息学
遗传学
肽序列
基因
人工智能
数学
神学
哲学
数学分析
作者
Aron Marchler‐Bauer,Myra K. Derbyshire,Noreen R. Gonzales,Shennan Lu,Farideh Chitsaz,Lewis Y. Geer,Renata C. Geer,Jane He,Marc Gwadz,David I. Hurwitz,Christopher J. Lanczycki,Fu Lu,Gabriele H. Marchler,James S. Song,Narmada Thanki,Zhouxi Wang,Roxanne A. Yamashita,Dachuan Zhang,Chanjuan Zheng,Stephen H. Bryant
摘要
NCBI's CDD, the Conserved Domain Database, enters its 15(th) year as a public resource for the annotation of proteins with the location of conserved domain footprints. Going forward, we strive to improve the coverage and consistency of domain annotation provided by CDD. We maintain a live search system as well as an archive of pre-computed domain annotation for sequences tracked in NCBI's Entrez protein database, which can be retrieved for single sequences or in bulk. We also maintain import procedures so that CDD contains domain models and domain definitions provided by several collections available in the public domain, as well as those produced by an in-house curation effort. The curation effort aims at increasing coverage and providing finer-grained classifications of common protein domains, for which a wealth of functional and structural data has become available. CDD curation generates alignment models of representative sequence fragments, which are in agreement with domain boundaries as observed in protein 3D structure, and which model the structurally conserved cores of domain families as well as annotate conserved features. CDD can be accessed at http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.
科研通智能强力驱动
Strongly Powered by AbleSci AI