生物
遗传学
基因组
基因
染色体
Y染色体
人类基因组
绿色果蝇
DNA测序
常染色质
X染色体
计算生物学
果蝇属(亚属)
异染色质
作者
Antonio Bernardo Carvalho,Andrew G. Clark
出处
期刊:Genome Research
[Cold Spring Harbor Laboratory Press]
日期:2013-08-06
卷期号:23 (11): 1894-1907
被引量:105
标识
DOI:10.1101/gr.156034.113
摘要
Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unmapped scaffolds. Identification of Y-linked genes among these fragments has yielded important insights about the origin and evolution of Y chromosomes, but the process is labor intensive, restricting studies to a small number of species. Apart from these fragmentary assemblies, in a few mammalian species, the euchromatic sequence of the Y is essentially complete, owing to painstaking BAC mapping and sequencing. Here we use female short-read sequencing and k -mer comparison to identify Y-linked sequences in two very different genomes, Drosophila virilis and human. Using this method, essentially all D. virilis scaffolds were unambiguously classified as Y-linked or not Y-linked. We found 800 new scaffolds (totaling 8.5 Mbp), and four new genes in the Y chromosome of D. virilis , including JYalpha , a gene involved in hybrid male sterility. Our results also strongly support the preponderance of gene gains over gene losses in the evolution of the Drosophila Y. In the intensively studied human genome, used here as a positive control, we recovered all previously known genes or gene families, plus a small amount (283 kb) of new, unfinished sequence. Hence, this method works in large and complex genomes and can be applied to any species with sex chromosomes.
科研通智能强力驱动
Strongly Powered by AbleSci AI