T细胞受体
序列(生物学)
计算生物学
计算机科学
肽
化学
生物
遗传学
T细胞
生物化学
免疫系统
作者
Alessandro Montemurro,Viktoria Schuster,Helle Rus Povlsen,Amalie Kai Bentzen,Vanessa Jurtz,William D. Chronister,Austin Crinklaw,Sine Reker Hadrup,Ole Winther,Bjoern Peters,Leon Eyrich Jessen,Morten Nielsen
标识
DOI:10.1038/s42003-021-02610-3
摘要
Prediction of T-cell receptor (TCR) interactions with MHC-peptide complexes remains highly challenging. This challenge is primarily due to three dominant factors: data accuracy, data scarceness, and problem complexity. Here, we showcase that "shallow" convolutional neural network (CNN) architectures are adequate to deal with the problem complexity imposed by the length variations of TCRs. We demonstrate that current public bulk CDR3 beta-pMHC binding data overall is of low quality and that the development of accurate prediction models is contingent on paired alpha/beta TCR sequence data corresponding to at least 150 distinct pairs for each investigated pMHC. In comparison, models trained on CDR3 alpha or CDR3 beta data alone demonstrated a variable and pMHC specific relative performance drop. Together these findings support that T-cell specificity is predictable given the availability of accurate and sufficient paired TCR sequence data. NetTCR-2.0 is publicly available at https://services.healthtech.dtu.dk/service.php?NetTCR-2.0..
科研通智能强力驱动
Strongly Powered by AbleSci AI