Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods

Aliaksei Z. Holik, Charity W. Law, Ruijie Liu, Zeya Wang, Wenyi Wang, Jaeil Ahn, Marie-Liesse Asselin-Labat, Gordon K. Smyth, Matthew E. Ritchie
doi: https://doi.org/10.1101/063008
Aliaksei Z. Holik
1ACRF Stem Cells and Cancer Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
6Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Charity W. Law
2Molecular Medicine Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
6Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruijie Liu
2Molecular Medicine Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zeya Wang
3Statistics Department, George R. Brown School of Engineering, Rice University, 6100 Main St, Duncan Hall 2124, Houston, Texas 77005, USA.
4Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd, Houston, Texas 77030, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wenyi Wang
4Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd, Houston, Texas 77030, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jaeil Ahn
5Department of Biostatistics, Bioinformatics, and Biomathematics, Georgetown University School of Medicine, Washington, DC 20057, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Marie-Liesse Asselin-Labat
1ACRF Stem Cells and Cancer Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
6Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gordon K. Smyth
7Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
8School of Mathematics and Statistics, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew E. Ritchie
2Molecular Medicine Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
6Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
8School of Mathematics and Statistics, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: mritchie@wehi.edu.au
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Carefully designed control experiments provide a gold standard for benchmarking different genomics research tools. A shortcoming of many gene expression control studies is that replication involves profiling the same reference RNA sample multiple times. This leads to low, pure technical noise that is atypical of regular studies. To achieve a more realistic noise structure, we generated a RNA-sequencing mixture experiment using two cell lines of the same cancer type. Variability was added by extracting RNA from independent cell cultures and degrading particular samples. The systematic gene expression changes induced by this design allowed benchmarking of different library preparation kits (standard poly-A versus total RNA with Ribozero depletion) and analysis pipelines. Data generated using the total RNA kit had more signal for introns and various RNA classes (ncRNA, snRNA, snoRNA) and less variability after degradation. For differential expression analysis, voom with quality weights marginally outperformed other popular methods, while for differential splicing, DEXSeq was simultaneously the most sensitive and the most inconsistent method. For sample deconvolution analysis, DeMix outperformed IsoPure convincingly. Our RNA-sequencing dataset provides a valuable resource for benchmarking different protocols and data pre-processing workflows. The extra noise mimics routine lab experiments more closely, ensuring any conclusions are widely applicable.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted July 19, 2016.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods
Aliaksei Z. Holik, Charity W. Law, Ruijie Liu, Zeya Wang, Wenyi Wang, Jaeil Ahn, Marie-Liesse Asselin-Labat, Gordon K. Smyth, Matthew E. Ritchie
bioRxiv 063008; doi: https://doi.org/10.1101/063008
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods
Aliaksei Z. Holik, Charity W. Law, Ruijie Liu, Zeya Wang, Wenyi Wang, Jaeil Ahn, Marie-Liesse Asselin-Labat, Gordon K. Smyth, Matthew E. Ritchie
bioRxiv 063008; doi: https://doi.org/10.1101/063008

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4658)
  • Biochemistry (10313)
  • Bioengineering (7636)
  • Bioinformatics (26241)
  • Biophysics (13481)
  • Cancer Biology (10650)
  • Cell Biology (15361)
  • Clinical Trials (138)
  • Developmental Biology (8464)
  • Ecology (12776)
  • Epidemiology (2067)
  • Evolutionary Biology (16794)
  • Genetics (11373)
  • Genomics (15431)
  • Immunology (10580)
  • Microbiology (25087)
  • Molecular Biology (10172)
  • Neuroscience (54233)
  • Paleontology (398)
  • Pathology (1660)
  • Pharmacology and Toxicology (2884)
  • Physiology (4326)
  • Plant Biology (9213)
  • Scientific Communication and Education (1582)
  • Synthetic Biology (2545)
  • Systems Biology (6761)
  • Zoology (1459)