Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods

Aliaksei Z. Holik, Charity W. Law, Ruijie Liu, Zeya Wang, Wenyi Wang, Jaeil Ahn, Marie-Liesse Asselin-Labat, Gordon K Smyth, Matthew E Ritchie
doi: https://doi.org/10.1101/063008
Aliaksei Z. Holik
1ACRF Stem Cells and Cancer Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
6Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Charity W. Law
2Molecular Medicine Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
6Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruijie Liu
2Molecular Medicine Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zeya Wang
3Statistics Department, George R. Brown School of Engineering, Rice University, 6100 Main St, Duncan Hall 2124, Houston, Texas 77005, USA.
4Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd, Houston, Texas 77030, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wenyi Wang
4Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd, Houston, Texas 77030, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jaeil Ahn
5Department of Biostatistics, Bioinformatics, and Biomathematics, Georgetown University School of Medicine, Washington, DC 20057, USA.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Marie-Liesse Asselin-Labat
1ACRF Stem Cells and Cancer Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
6Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gordon K Smyth
7Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
8School of Mathematics and Statistics, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew E Ritchie
2Molecular Medicine Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia.
6Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
8School of Mathematics and Statistics, The University of Melbourne, Parkville, Victoria 3010, Australia.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: mritchie@wehi.edu.au
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Background Carefully designed control experiments provide a gold standard for benchmarking new platforms, protocols and pipelines in genomics research. RNA profiling control studies frequently use the mixture design, which takes two distinct samples and combines them in known proportions to induce predictable expression changes for every gene. Current mixture experiments have low noise and simulate relatively large expression changes by comparing RNA from different tissues, making them atypical of regular experiments.

Results To generate a more realistic RNA-sequencing control data set, two cell lines of the same cancer type were mixed in various proportions. Noise was added by independently preparing, mixing and degrading a subset of the samples. The systematic gene-expression changes induced by this design were used to benchmark different library preparation kits (standard poly-A versus total RNA with Ribozero depletion) and analysis pipelines for differential gene expression, differential splicing and deconvolution analysis. More signal for introns and various RNA classes (ncRNA, snRNA, snoRNA) and less variability after degradation was observed using the total RNA kit. For differential expression analysis, voom with quality weights marginally outperformed other popular methods, while for differential splicing, the DEXSeq method was found to be the most sensitive but also the most inconsistent. For sample deconvolution analysis, DeMix outperformed IsoPure convincingly.

Conclusions RNA-sequencing control experiments such as this provide a valuable resource for benchmarking different sequencing protocols and data pre-processing workflows. We have demonstrated that with a few extra steps, data with noise characteristics much more similar to regular RNA-sequencing experiments can be obtained.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted July 09, 2016.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods
Aliaksei Z. Holik, Charity W. Law, Ruijie Liu, Zeya Wang, Wenyi Wang, Jaeil Ahn, Marie-Liesse Asselin-Labat, Gordon K Smyth, Matthew E Ritchie
bioRxiv 063008; doi: https://doi.org/10.1101/063008
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods
Aliaksei Z. Holik, Charity W. Law, Ruijie Liu, Zeya Wang, Wenyi Wang, Jaeil Ahn, Marie-Liesse Asselin-Labat, Gordon K Smyth, Matthew E Ritchie
bioRxiv 063008; doi: https://doi.org/10.1101/063008

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4115)
  • Biochemistry (8818)
  • Bioengineering (6522)
  • Bioinformatics (23466)
  • Biophysics (11792)
  • Cancer Biology (9212)
  • Cell Biology (13326)
  • Clinical Trials (138)
  • Developmental Biology (7439)
  • Ecology (11413)
  • Epidemiology (2066)
  • Evolutionary Biology (15155)
  • Genetics (10439)
  • Genomics (14045)
  • Immunology (9173)
  • Microbiology (22159)
  • Molecular Biology (8814)
  • Neuroscience (47581)
  • Paleontology (350)
  • Pathology (1429)
  • Pharmacology and Toxicology (2492)
  • Physiology (3731)
  • Plant Biology (8082)
  • Scientific Communication and Education (1437)
  • Synthetic Biology (2221)
  • Systems Biology (6039)
  • Zoology (1253)