序列(生物学)
匹配(统计)
计算生物学
蛋白质测序
序列数据库
基因组
计算机科学
序列比对
DNA测序
生物
数据挖掘
遗传学
肽序列
DNA
基因
数学
统计
作者
Gastón H. Gonnet,Mark A. Cohen,Steven A. Benner
出处
期刊:Science
[American Association for the Advancement of Science]
日期:1992-06-05
卷期号:256 (5062): 1443-1445
被引量:841
标识
DOI:10.1126/science.1604319
摘要
The entire protein sequence database has been exhaustively matched. Definitive mutation matrices and models for scoring gaps were obtained from the matching and used to organize the sequence database as sets of evolutionarily connected components. The methods developed are general and can be used to manage sequence data generated by major genome sequencing projects. The alignments made possible by the exhaustive matching are the starting point for successful de novo prediction of the folded structures of proteins, for reconstructing sequences of ancient proteins and metabolisms in ancient organisms, and for obtaining new perspectives in structural biochemistry.
科研通智能强力驱动
Strongly Powered by AbleSci AI