计算机科学
基因组
寄主(生物学)
数据集成
计算生物学
生物
任务(项目管理)
数据挖掘
基因
遗传学
管理
经济
作者
Xi Zeng,Linghao Zhao,Chenhang Shen,Yi Zhou,Guoliang Li,Wing‐Kin Sung
出处
期刊:Bioinformatics
[Oxford University Press]
日期:2021-01-13
卷期号:37 (13): 1821-1827
被引量:6
标识
DOI:10.1093/bioinformatics/btab031
摘要
Virus integration in the host genome is frequently reported to be closely associated with many human diseases, and the detection of virus integration is a critically challenging task. However, most existing tools show limited specificity and sensitivity. Therefore, the objective of this study is to develop a method for accurate detection of virus integration into host genomes.Herein, we report a novel method termed HIVID2 that is a significant upgrade of HIVID. HIVID2 performs a paired-end combination (PE-combination) for potentially integrated reads. The resulting sequences are then remapped onto the reference genomes, and both split and discordant chimeric reads are used to identify accurate integration breakpoints with high confidence. HIVID2 represents a great improvement in specificity and sensitivity, and predicts breakpoints closer to the real integrations, compared with existing methods. The advantage of our method was demonstrated using both simulated and real datasets. HIVID2 uncovered novel integration breakpoints in well-known cervical cancer-related genes, including FHIT and LRP1B, which was verified using protein expression data. In addition, HIVID2 allows the user to decide whether to automatically perform advanced analysis using the identified virus integrations. By analyzing the simulated data and real data tests, we demonstrated that HIVID2 is not only more accurate than HIVID but also better than other existing programs with respect to both sensitivity and specificity. We believe that HIVID2 will help in enhancing future research associated with virus integration.HIVID2 can be accessed at https://github.com/zengxi-hada/HIVID2/.Supplementary data are available at Bioinformatics online.
科研通智能强力驱动
Strongly Powered by AbleSci AI