生物
系统发育树
基因组
系统基因组学
遗传学
生物信息学
全基因组测序
比较基因组学
系统发育学
计算生物学
序列分析
基因组学
多位点序列分型
进化生物学
基因
基因型
克莱德
作者
Matías Javier Garavaglia,Andrés Muzlera,Claudio Valverde
标识
DOI:10.1016/j.ympev.2022.107663
摘要
In the field of prokaryotic taxonomy, there has been a recent transition towards phylogenomics as the gold standard approach. However, genome-based phylogenetics is still restrictive for its cost when managing large amounts of isolates. Fast, cheap, and taxonomically competent alternatives, like multilocus sequence analysis (MLSA) are thus recommendable. Nevertheless, the criteria for selecting the conserved genes for MLSA have not been explicit for different bacterial taxa, including the broadly diverse Pseudomonas genus. Here, we have carried out an unbiased and rational workflow to select internal sequence regions of Pseudomonas core genes (CG) for a MLSA with the best phylogenetic power, and with a resolution comparable to the genome-based ANI approach. A computational workflow was established to inspect 126 complete genomes of representatives from over 60 Pseudomonas species and subspecies, in order to identify the most informative CG internal regions and determine which combinations in sets of three partial CG sequences have comparable phylogenetic resolution to that of the current ANI standard. We found that the rpoD346-1196-pepN1711-2571-gltX86-909 concatenated sequences were the best performing in terms of phylogenetic robustness and resulted highly sensitive and specific when contrasted with ANI. The rpoD-pepN-gltX MLSA was validated in silico and in vitro. Altogether, the results presented here supports the proposal of the rpoD-pepN-gltX MLSA as a fast, affordable, and robust phylogenetic tool for members of the Pseudomonas genus.
科研通智能强力驱动
Strongly Powered by AbleSci AI