计算机科学
软件
公制(单位)
片段(逻辑)
编码(集合论)
源代码
样品(材料)
取样偏差
算法
数据挖掘
样本量测定
统计
数学
化学
色谱法
经济
操作系统
集合(抽象数据类型)
程序设计语言
运营管理
作者
Chowdhury Rafeed Rahman,Zhong Wee Poh,Anders J. Skanderup,Limsoon Wong
标识
DOI:10.1093/bioinformatics/btaf293
摘要
Abstract Motivation Cell-free DNA (cfDNA) analysis has wide-ranging clinical applications due to its non-invasive nature. However, cfDNA fragmentomics and copy number analysis can be complicated by GC bias. There is a lack of GC correction software based on rigorous cfDNA GC bias analysis. Furthermore, there is no standardized metric for comparing GC bias correction methods across large sample sets, nor a rigorous experiment setup to demonstrate their effectiveness on cfDNA data at various coverage levels. Results We present GCfix, a method for robust GC bias correction in cfDNA data across diverse coverages. Developed following an in-depth analysis of cfDNA GC bias at the region and fragment length levels, GCfix is both fast and accurate. It works on all reference genomes and generates correction factors, tagged BAM files, and corrected coverage tracks. We also introduce two orthogonal performance metrics for (1) comparing the fragment count density distribution of GC content between expected and corrected samples, and (2) evaluating coverage profile improvement post-correction. GCfix outperforms existing cfDNA GC bias correction methods on these metrics. Availability and Implementation GCfix software and code for reproducing the figures are publicly accessible on GitHub: https://github.com/Rafeed-bot/GCfix_Software. Supplementary Information All Supplementary figures and data are available online through Bioinformatics.
科研通智能强力驱动
Strongly Powered by AbleSci AI