成对比较
系统发育树
计算机科学
熵(时间箭头)
度量(数据仓库)
多序列比对
基因组
系统发育学
数据挖掘
算法
序列比对
人工智能
生物
遗传学
基因
物理
量子力学
肽序列
作者
Yao‐Qun Wu,Zu‐Guo Yu,Runbin Tang,Guosheng Han,Vo Anh
标识
DOI:10.3389/fgene.2021.766496
摘要
Alignment methods have faced disadvantages in sequence comparison and phylogeny reconstruction due to their high computational costs in handling time and space complexity. On the other hand, alignment-free methods incur low computational costs and have recently gained popularity in the field of bioinformatics. Here we propose a new alignment-free method for phylogenetic tree reconstruction based on whole genome sequences. A key component is a measure called information-entropy position-weighted k-mer relative measure (IEPWRMkmer), which combines the position-weighted measure of k -mers proposed by our group and the information entropy of frequency of k -mers. The Manhattan distance is used to calculate the pairwise distance between species. Finally, we use the Neighbor-Joining method to construct the phylogenetic tree. To evaluate the performance of this method, we perform phylogenetic analysis on two datasets used by other researchers. The results demonstrate that the IEPWRMkmer method is efficient and reliable. The source codes of our method are provided at https://github.com/ wuyaoqun37/IEPWRMkmer.
科研通智能强力驱动
Strongly Powered by AbleSci AI