Ensembl公司
生物
基因组
基因
牛基因组
基因组计划
参考文献
遗传学
计算生物学
基因注释
转录组
参考基因组
基因预测
从头转录组组装
假基因
顺序装配
基因组学
RNA序列
癌症基因组测序
染色质
染色质免疫沉淀
蛋白质组学
嘉雅宠物
蛋白质基因组学
DNA测序
人类基因组
蛋白质组
发起人
功能基因组学
基因组浏览器
比较基因组学
基因组文库
序列分析
作者
Hamid Beiki,Brenda M. Murdoch,Carissa A. Park,Chandlar Kern,Denise Kontechy,Gabrielle M. Becker,Gonzalo Rincón,Honglin Jiang,Huaijun Zhou,Jacob W Thorne,James E. Koltes,Jennifer J. Michal,Kimberly M Davenport,Monique Rijnkels,Pablo J. Ross,Rui Hu,Sarah Corum,Stephanie McKay,Timothy P. L. Smith,Wansheng Liu
出处
期刊:GigaScience
[Oxford University Press]
日期:2024-01-01
卷期号:13
被引量:6
标识
DOI:10.1093/gigascience/giae019
摘要
Abstract Background The accurate identification of the functional elements in the bovine genome is a fundamental requirement for high-quality analysis of data informing both genome biology and genomic selection. Functional annotation of the bovine genome was performed to identify a more complete catalog of transcript isoforms across bovine tissues. Results A total of 160,820 unique transcripts (50% protein coding) representing 34,882 unique genes (60% protein coding) were identified across tissues. Among them, 118,563 transcripts (73% of the total) were structurally validated by independent datasets (PacBio isoform sequencing data, Oxford Nanopore Technologies sequencing data, de novo assembled transcripts from RNA sequencing data) and comparison with Ensembl and NCBI gene sets. In addition, all transcripts were supported by extensive data from different technologies such as whole transcriptome termini site sequencing, RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression, chromatin immunoprecipitation sequencing, and assay for transposase-accessible chromatin using sequencing. A large proportion of identified transcripts (69%) were unannotated, of which 86% were produced by annotated genes and 14% by unannotated genes. A median of two 5′ untranslated regions were expressed per gene. Around 50% of protein-coding genes in each tissue were bifunctional and transcribed both coding and noncoding isoforms. Furthermore, we identified 3,744 genes that functioned as noncoding genes in fetal tissues but as protein-coding genes in adult tissues. Our new bovine genome annotation extended more than 11,000 annotated gene borders compared to Ensembl or NCBI annotations. The resulting bovine transcriptome was integrated with publicly available quantitative trait loci data to study tissue–tissue interconnection involved in different traits and construct the first bovine trait similarity network. Conclusions These validated results show significant improvement over current bovine genome annotations.
科研通智能强力驱动
Strongly Powered by AbleSci AI