剧目
计算生物学
聚类分析
DNA测序
生物
计算机科学
抗体库
抗体
遗传学
基因
人工智能
声学
物理
作者
João Henrique Diniz Brandão Gervásio,Alice Ferreira,Liza Felicori
标识
DOI:10.1016/j.jim.2023.113576
摘要
The next-generation sequencing technologies have transformed our understanding of immunoglobulin (Ig) profiles in various immune states. Clonotyping, which groups Ig sequences into B cell clones, is crucial in investigating the diversity of repertoires and changes in antigen exposure. Despite its importance, there is no widely accepted method for clonotyping, and existing methods are computationally intensive for large sequencing datasets. To address this challenge, we introduce YClon, a fast and efficient approach for clonotyping Ig repertoire data. YClon uses a hierarchical clustering approach, similar to other methods, to group Ig sequences into B cell clones in a highly sensitive and specific manner. Notably, our approach outperforms other methods by being more than 30 to 5000 times faster in processing the repertoires analyzed. Astonishingly, YClon can effortlessly handle up to 2 million Ig sequences on a standard laptop computer. This enables in-depth analysis of large and numerous antibody repertoires. YClon was implemented in Python3 and is freely available on GitHub.
科研通智能强力驱动
Strongly Powered by AbleSci AI