PT - JOURNAL ARTICLE AU - Vivian, John AU - Rao, Arjun AU - Nothaft, Frank Austin AU - Ketchum, Christopher AU - Armstrong, Joel AU - Novak, Adam AU - Pfeil, Jacob AU - Narkizian, Jake AU - Deran, Alden D. AU - Musselman-Brown, Audrey AU - Schmidt, Hannes AU - Amstutz, Peter AU - Craft, Brian AU - Goldman, Mary AU - Rosenbloom, Kate AU - Cline, Melissa AU - O’Connor, Brian AU - Hanna, Megan AU - Birger, Chet AU - Kent, W. James AU - Patterson, David A. AU - Joseph, Anthony D. AU - Zhu, Jingchun AU - Zaranek, Sasha AU - Getz, Gad AU - Haussler, David AU - Paten, Benedict TI - Rapid and efficient analysis of 20,000 RNA-seq samples with Toil AID - 10.1101/062497 DP - 2016 Jan 01 TA - bioRxiv PG - 062497 4099 - http://biorxiv.org/content/early/2016/07/07/062497.short 4100 - http://biorxiv.org/content/early/2016/07/07/062497.full AB - Toil is portable, open-source workflow software that supports contemporary workflow definition languages and can be used to securely and reproducibly run scientific workflows efficiently at large-scale. To demonstrate Toil, we processed over 20,000 RNA-seq samples to create a consistent meta-analysis of five datasets free of computational batch effects that we make freely available. Nearly all the samples were analysed in under four days using a commercial cloud cluster of 32,000 preemptable cores.