RT Journal Article SR Electronic T1 RNA-seq 2G: online analysis of differential gene expression with comprehesive options of statistical methods JF bioRxiv FD Cold Spring Harbor Laboratory SP 122747 DO 10.1101/122747 A1 Zhe Zhang A1 Yuanchao Zhang A1 Perry Evans A1 Asif Chinwalla A1 Deanne Taylor YR 2017 UL http://biorxiv.org/content/early/2017/03/31/122747.abstract AB RNA-seq has become the most prevalent technology for measuring genome-wide gene expression, but the best practices for processing and analysing RNA-seq data are still an open question. Many statistical methods have been developed to identify genes differentially expressed between sample groups from RNA-seq data. These methods differ by their data distribution assumptions, choice of statistical test, and computational resource requirements. Over 25 methods of differential expression detection were validated and made available through a user-friendly web portal, RNA-seq 2G. All methods are suitable for analysing differential gene expression between two groups of samples. They commonly use a read count matrix derived from RNA-seq data as input and statistically compare groups for each gene. The web portal uses a Shiny app front-end and is hosted by a cloud-based server provided by Amazon Web Service. The comparison of methods showed that the data distribution assumption is the major determinant of differences between methods. Most methods are more likely to find that longer genes are differentially expressed, which substantially impacts downstream gene set-level analysis. Combining results from multiple methods can potentially diminish this bias. RNA-seq 2G makes the analysis of RNA-seq data more accessible and efficient, and is freely available at http://rnaseq2g.awsomics.org.