ABSTRACT
RNA-Seq techniques generate hundreds of millions of short RNA reads using next-generation sequencing (NGS). These RNA reads can be mapped to reference genomes to investigate changes of gene expression but improved procedures for mining large RNA-Seq datasets to extract valuable biological knowledge are needed. RNAMiner -- a multi-level bioinformatics protocol and pipeline -- has been developed for such datasets. It includes five steps: mapping RNA-Seq reads to a reference genome, calculating gene expression values, identifying differentially expressed genes, predicting gene functions, and constructing gene regulatory networks. To demonstrate its utility, we applied RNAMiner to datasets generated from Human, Mouse, Arabidopsis thaliana, and Drosophila melanogaster cells, and successfully identified differentially expressed genes, clustered them into cohesive functional groups, and constructed novel gene regulatory networks. The RNAMiner web service is available at http://calla.rnet.missouri.edu/rnaminer/index.html.
Index Terms
- From gigabyte to kilobyte: a bioinformatics protocol for mining large RNA-Seq transcriptomics data
Recommendations
CEDER: Accurate Detection of Differentially Expressed Genes by Combining Significance of Exons Using RNA-Seq
RNA-Seq is widely used in transcriptome studies, and the detection of differentially expressed genes (DEGs) between two classes of individuals, e.g., cases versus controls, using RNA-Seq is of fundamental importance. Many statistical methods for DEG ...
Differential splicing analysis based on isoforms expression with NBSplice
Graphical abstractDisplay Omitted
Highlights- Isoforms expression analysis enables the detection of differentially spliced genes.
AbstractAlternative splicing alterations have been widely related to several human diseases revealing the importance of their study for the success of translational medicine. Differential splicing (DS) occurrence has been mainly analyzed ...
Microarray vs. RNA-Seq: a comparison for active subnetwork discovery
BCB '12: Proceedings of the ACM Conference on Bioinformatics, Computational Biology and BiomedicineWhile microarrays have been successfully used by the researchers to analyze gene expression levels, cutting edge high throughput sequencing technologies now made it possible to go one step further. Recent studies show that absolute expression levels are ...
Comments