Accurate detection of differential RNA processing

Nucleic Acids Res. 2013 May 1;41(10):5189-98. doi: 10.1093/nar/gkt211. Epub 2013 Apr 12.

Abstract

Deep transcriptome sequencing (RNA-Seq) has become a vital tool for studying the state of cells in the context of varying environments, genotypes and other factors. RNA-Seq profiling data enable identification of novel isoforms, quantification of known isoforms and detection of changes in transcriptional or RNA-processing activity. Existing approaches to detect differential isoform abundance between samples either require a complete isoform annotation or fall short in providing statistically robust and calibrated significance estimates. Here, we propose a suite of statistical tests to address these open needs: a parametric test that uses known isoform annotations to detect changes in relative isoform abundance and a non-parametric test that detects differential read coverages and can be applied when isoform annotations are not available. Both methods account for the discrete nature of read counts and the inherent biological variability. We demonstrate that these tests compare favorably to previous methods, both in terms of accuracy and statistical calibrations. We use these techniques to analyze RNA-Seq libraries from Arabidopsis thaliana and Drosophila melanogaster. The identified differential RNA processing events were consistent with RT-qPCR measurements and previous studies. The proposed toolkit is available from http://bioweb.me/rdiff and enables in-depth analyses of transcriptomes, with or without available isoform annotation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Animals
  • Arabidopsis / genetics
  • Arabidopsis / metabolism
  • Data Interpretation, Statistical
  • Drosophila melanogaster / genetics
  • Drosophila melanogaster / metabolism
  • Gene Expression Profiling
  • Molecular Sequence Annotation
  • RNA Isoforms / metabolism
  • RNA Processing, Post-Transcriptional*
  • Reverse Transcriptase Polymerase Chain Reaction

Substances

  • RNA Isoforms