First Steps in the Analysis of Prokaryotic Pan-Genomes

基因组 生物 基因组计划 基因 GenBank公司 遗传学 细菌基因组大小 比较基因组学 计算生物学 基因组学
作者
Sávio Souza Costa,Luís Carlos Guimarães,Artur M. S. Silva,Siomar C. Soares,Rafael Azevedo Baraúna
出处
期刊:Bioinformatics and Biology Insights [SAGE Publishing]
卷期号:14: 117793222093806-117793222093806 被引量:46
标识
DOI:10.1177/1177932220938064
摘要

Pan-genome is defined as the set of orthologous and unique genes of a specific group of organisms. The pan-genome is composed by the core genome, accessory genome, and species- or strain-specific genes. The pan-genome is considered open or closed based on the alpha value of the Heap law. In an open pan-genome, the number of gene families will continuously increase with the addition of new genomes to the analysis, while in a closed pan-genome, the number of gene families will not increase considerably. The first step of a pan-genome analysis is the homogenization of genome annotation. The same software should be used to annotate genomes, such as GeneMark or RAST. Subsequently, several software are used to calculate the pan-genome such as BPGA, GET_HOMOLOGUES, PGAP, among others. This review presents all these initial steps for those who want to perform a pan-genome analysis, explaining key concepts of the area. Furthermore, we present the pan-genomic analysis of 9 bacterial species. These are the species with the highest number of genomes deposited in GenBank. We also show the influence of the identity and coverage parameters on the prediction of orthologous and paralogous genes. Finally, we cite the perspectives of several research areas where pan-genome analysis can be used to answer important issues.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
wyz653完成签到,获得积分10
1秒前
3秒前
北墨完成签到,获得积分10
4秒前
5秒前
Owen应助皮戾采纳,获得10
5秒前
8秒前
14秒前
共享精神应助思琪采纳,获得10
14秒前
浏阳河发布了新的文献求助10
18秒前
花痴的易真完成签到,获得积分10
21秒前
思琪完成签到,获得积分10
26秒前
周鑫硕完成签到,获得积分10
26秒前
26秒前
kiki完成签到 ,获得积分10
27秒前
思琪发布了新的文献求助10
31秒前
baibai完成签到 ,获得积分10
32秒前
ahh完成签到 ,获得积分10
33秒前
英俊的铭应助你好采纳,获得10
34秒前
浮游应助yang采纳,获得10
34秒前
Biggest完成签到,获得积分10
37秒前
yang应助LQ采纳,获得10
42秒前
科研一坤年完成签到,获得积分10
43秒前
43秒前
Ssyong完成签到 ,获得积分10
48秒前
乘风破浪完成签到,获得积分10
48秒前
析木完成签到,获得积分10
49秒前
Sea_U应助科研通管家采纳,获得10
51秒前
51秒前
Lucas应助科研通管家采纳,获得10
51秒前
yznfly应助科研通管家采纳,获得30
52秒前
SciGPT应助科研通管家采纳,获得10
52秒前
52秒前
52秒前
52秒前
NexusExplorer应助科研通管家采纳,获得30
52秒前
52秒前
52秒前
52秒前
酷波er应助科研通管家采纳,获得10
52秒前
52秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Rapid Review of Electrodiagnostic and Neuromuscular Medicine: A Must-Have Reference for Neurologists and Physiatrists 800
求中国石油大学(北京)图书馆的硕士论文,作者董晨,十年前搞太赫兹的 500
Vertebrate Palaeontology, 5th Edition 500
Narrative Method and Narrative form in Masaccio's Tribute Money 500
Aircraft Engine Design, Third Edition 500
Neonatal and Pediatric ECMO Simulation Scenarios 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 内科学 生物化学 物理 计算机科学 纳米技术 遗传学 基因 复合材料 化学工程 物理化学 病理 催化作用 免疫学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 4767845
求助须知:如何正确求助?哪些是违规求助? 4104756
关于积分的说明 12697579
捐赠科研通 3822648
什么是DOI,文献DOI怎么找? 2109709
邀请新用户注册赠送积分活动 1134219
关于科研通互助平台的介绍 1015283