TY - JOUR T1 - aRNApipe: A balanced, efficient and distributed pipeline for processing RNA-seq data in high performance computing environments JF - bioRxiv DO - 10.1101/060277 SP - 060277 AU - Arnald Alonso AU - Brittany N. Lasseigne AU - Kelly Williams AU - Josh Nielsen AU - Ryne C. Ramaker AU - Andrew A. Hardigan AU - Bobbi Johnston AU - Brian S. Roberts AU - Sara J. Cooper AU - Sara Marsal AU - Richard M. Myers Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/01/20/060277.abstract N2 - Summary The wide range of RNA-seq applications and their high computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high performance cluster environments. aRNApipe is highly modular and can be easily migrated to any high performance computing (HPC) environment. The current applications included in aRNApipe combine the essential RNA-seq primary analyses, including quality control metrics, transcript alignment, count generation, transcript fusion identification, alternative splicing, and sequence variant calling. aRNApipe is project-oriented and dynamic so users can easily update analyses to include or exclude samples or enable additional processing modules. Workflow parameters are easily set using a single configuration file that provides centralized tracking of all analytical processes. Finally, aRNApipe incorporates interactive web reports for sample tracking and a tool for managing the genome assemblies available to perform an analysis.Availability and documentation https://github.com/HudsonAlpha/aRNAPipe; DOI:10.5281/zenodo.202950Contact rmyers{at}hudsonalpha.orgSupplementary information Supplementary data are available at Bioinformatics online. ER -