生物
遗传学
单倍型
编码区
连续性
基因组学
进化生物学
基因
计算生物学
等位基因
基因组
生态学
作者
Yang Gao,Xiaofei Yang,Hao Chen,Xinjiang Tan,Zhaoqing Yang,Lian Deng,Baonan Wang,Shuang Kong,Songyang Li,Yuhang Cui,Chang Lei,Yimin Wang,Yuwen Pan,Sen Ma,Hao Sun,Xiaohan Zhao,Yingbing Shi,Ziyi Yang,Dong‐Dong Wu,Shaoyuan Wu
出处
期刊:Nature
[Nature Portfolio]
日期:2023-06-14
卷期号:619 (7968): 112-121
被引量:84
标识
DOI:10.1038/s41586-023-06173-7
摘要
Human genomics is witnessing an ongoing paradigm shift from a single reference sequence to a pangenome form, but populations of Asian ancestry are underrepresented. Here we present data from the first phase of the Chinese Pangenome Consortium, including a collection of 116 high-quality and haplotype-phased de novo assemblies based on 58 core samples representing 36 minority Chinese ethnic groups. With an average 30.65× high-fidelity long-read sequence coverage, an average contiguity N50 of more than 35.63 megabases and an average total size of 3.01 gigabases, the CPC core assemblies add 189 million base pairs of euchromatic polymorphic sequences and 1,367 protein-coding gene duplications to GRCh38. We identified 15.9 million small variants and 78,072 structural variants, of which 5.9 million small variants and 34,223 structural variants were not reported in a recently released pangenome reference
科研通智能强力驱动
Strongly Powered by AbleSci AI