管道(软件)
转录组
RNA序列
参考基因组
顺序装配
计算生物学
脚本语言
Python(编程语言)
Ensembl公司
预处理器
生物
基因组
计算机科学
从头转录组组装
原始数据
数据挖掘
基因
基因表达
遗传学
基因组学
人工智能
操作系统
程序设计语言
作者
David J. Burks,Rajeev K. Azad
标识
DOI:10.1007/978-1-0716-1822-6_5
摘要
In this chapter, we describe methods for analyzing RNA-Seq data, presented as a flow along a pipeline beginning with raw data from a sequencer and ending with an output of differentially expressed genes and their functional characterization. The first section covers de novo transcriptome assembly for organisms lacking reference genomes or for those interested in probing against the background of organism-specific transcriptomes assembled from RNA-Seq data. Section 2 covers both gene- and transcript-level quantifications, leading to the third and final section on differential expression analysis between two or more conditions. The pipeline starts with raw sequence reads, followed by quality assessment and preprocessing of the input data to ensure a robust estimate of the transcripts and their differential regulation. The preprocessed data can be inputted into the de novo transcriptome flow to assemble transcripts, functionally annotated using tools such as InterProScan or Blast2Go and then forwarded to differential expression analysis flow, or directly inputted into the differential expression analysis flow if a reference genome is available. An online repository containing sample data has also been made available, as well as custom Python scripts to modify the output of the programs within the pipeline for various downstream analyses.
科研通智能强力驱动
Strongly Powered by AbleSci AI