计算机科学
RNA序列
计算生物学
数据科学
可扩展性
核糖核酸
RNA剪接
生物学数据
转录组
生物
生物信息学
基因
遗传学
数据库
基因表达
作者
Dhrithi Deshpande,Karishma Chhugani,Yutong Chang,Aaron Karlsberg,Caitlin Loeffler,Jinyang Zhang,Agata Muszyńska,Viorel Munteanu,Harry Taegyun Yang,Jeremy Rotman,Laura Tao,Brunilda Balliu,Elizabeth Tseng,Eleazar Eskin,Fangqing Zhao,Pejman Mohammadi,Paweł P. Łabaj,Serghei Mangul
标识
DOI:10.3389/fgene.2023.997383
摘要
RNA sequencing (RNA-seq) has become an exemplary technology in modern biology and clinical science. Its immense popularity is due in large part to the continuous efforts of the bioinformatics community to develop accurate and scalable computational tools to analyze the enormous amounts of transcriptomic data that it produces. RNA-seq analysis enables genes and their corresponding transcripts to be probed for a variety of purposes, such as detecting novel exons or whole transcripts, assessing expression of genes and alternative transcripts, and studying alternative splicing structure. It can be a challenge, however, to obtain meaningful biological signals from raw RNA-seq data because of the enormous scale of the data as well as the inherent limitations of different sequencing technologies, such as amplification bias or biases of library preparation . The need to overcome these technical challenges has pushed the rapid development of novel computational tools, which have evolved and diversified in accordance with technological advancements, leading to the current myriad of RNA-seq tools. These tools, combined with the diverse computational skill sets of biomedical researchers, help to unlock the full potential of RNA-seq. The purpose of this review is to explain basic concepts in the computational analysis of RNA-seq data and define discipline-specific jargon.
科研通智能强力驱动
Strongly Powered by AbleSci AI