Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

A novel approach to identifying marker genes and estimating the cellular composition of whole blood from gene expression profiles

View ORCID ProfileCasey P. Shannon, Robert Balshaw, Virginia Chen, Zsuzsanna Hollander, Mustafa Toma, Bruce M. McManus, J. Mark FitzGerald, Don D. Sin, Raymond T. Ng, Scott J. Tebbutt
doi: https://doi.org/10.1101/038794
Casey P. Shannon
1PROOF Centre of Excell ence, Vancouver, BC, Canada
7Centre for Heart Lung Innovation, University of British Columbia, Vancouver, BC, Canada;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Casey P. Shannon
  • For correspondence: casey.shannon@hli.ubc.ca
Robert Balshaw
1PROOF Centre of Excell ence, Vancouver, BC, Canada
2BC Centre for Disease Control, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Virginia Chen
1PROOF Centre of Excell ence, Vancouver, BC, Canada
7Centre for Heart Lung Innovation, University of British Columbia, Vancouver, BC, Canada;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zsuzsanna Hollander
1PROOF Centre of Excell ence, Vancouver, BC, Canada
7Centre for Heart Lung Innovation, University of British Columbia, Vancouver, BC, Canada;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mustafa Toma
3Division of Cardiology, University of British Columbia, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bruce M. McManus
1PROOF Centre of Excell ence, Vancouver, BC, Canada
4Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada
7Centre for Heart Lung Innovation, University of British Columbia, Vancouver, BC, Canada;
8Institute for Heart and Lung Health, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
J. Mark FitzGerald
6Department of Medicine, Division of Respiratory Medicine, University of British Columbia, Vancouver, BC, Canada
8Institute for Heart and Lung Health, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Don D. Sin
6Department of Medicine, Division of Respiratory Medicine, University of British Columbia, Vancouver, BC, Canada
7Centre for Heart Lung Innovation, University of British Columbia, Vancouver, BC, Canada;
8Institute for Heart and Lung Health, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Raymond T. Ng
1PROOF Centre of Excell ence, Vancouver, BC, Canada
5Department of Computer Science, University of British Columbia, Vancouver, BC, Canada
7Centre for Heart Lung Innovation, University of British Columbia, Vancouver, BC, Canada;
8Institute for Heart and Lung Health, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Scott J. Tebbutt
1PROOF Centre of Excell ence, Vancouver, BC, Canada
6Department of Medicine, Division of Respiratory Medicine, University of British Columbia, Vancouver, BC, Canada
7Centre for Heart Lung Innovation, University of British Columbia, Vancouver, BC, Canada;
8Institute for Heart and Lung Health, Vancouver, BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Measuring genome-wide changes in transcript abundance in circulating peripheral whole blood cells is a useful way to study disease pathobiology and may help elucidate biomarkers and molecular mechanisms of disease. The sensitivity and interpretability of analyses carried out in this complex tissue, however, are significantly affected by its dynamic heterogeneity. It is therefore desirable to quantify this heterogeneity, either to account for it or to better model interactions that may be present between the abundance of certain transcripts, some cell types and the indication under study. Accurate enumeration of the many component cell types that make up peripheral whole blood can be costly, however, and may further complicate the sample collection process. Many approaches have been developed to infer the composition of a sample from high-dimensional transcriptomic and, more recently, epigenetic data. These approaches rely on the availability of isolated expression profiles for the cell types to be enumerated. These profiles are platform-specific, suitable datasets are rare, and generating them is expensive. No such dataset exists on the Affymetrix Gene ST platform. We present a freely-available, and open source, multi-response Gaussian model capable of accurately predicting the composition of peripheral whole blood samples from Affymetrix Gene ST expression profiles. This model outperforms other current methods when applied to Gene ST data and could potentially be used to enrich the >10,000 Affymetrix Gene ST blood gene expression profiles currently available on GEO.

  • We introduce a model that accurately predicts the composition of blood from Affymetrix Gene ST gene expression profiles.

  • This model outperforms existing methods when applied to Affymetrix Gene ST expression profiles from blood.

Key Points

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted February 03, 2016.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A novel approach to identifying marker genes and estimating the cellular composition of whole blood from gene expression profiles
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A novel approach to identifying marker genes and estimating the cellular composition of whole blood from gene expression profiles
Casey P. Shannon, Robert Balshaw, Virginia Chen, Zsuzsanna Hollander, Mustafa Toma, Bruce M. McManus, J. Mark FitzGerald, Don D. Sin, Raymond T. Ng, Scott J. Tebbutt
bioRxiv 038794; doi: https://doi.org/10.1101/038794
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
A novel approach to identifying marker genes and estimating the cellular composition of whole blood from gene expression profiles
Casey P. Shannon, Robert Balshaw, Virginia Chen, Zsuzsanna Hollander, Mustafa Toma, Bruce M. McManus, J. Mark FitzGerald, Don D. Sin, Raymond T. Ng, Scott J. Tebbutt
bioRxiv 038794; doi: https://doi.org/10.1101/038794

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Subject Areas
All Articles
  • Animal Behavior and Cognition (4222)
  • Biochemistry (9095)
  • Bioengineering (6733)
  • Bioinformatics (23916)
  • Biophysics (12066)
  • Cancer Biology (9484)
  • Cell Biology (13720)
  • Clinical Trials (138)
  • Developmental Biology (7614)
  • Ecology (11644)
  • Epidemiology (2066)
  • Evolutionary Biology (15459)
  • Genetics (10610)
  • Genomics (14281)
  • Immunology (9447)
  • Microbiology (22749)
  • Molecular Biology (9056)
  • Neuroscience (48811)
  • Paleontology (354)
  • Pathology (1478)
  • Pharmacology and Toxicology (2558)
  • Physiology (3817)
  • Plant Biology (8299)
  • Scientific Communication and Education (1466)
  • Synthetic Biology (2285)
  • Systems Biology (6163)
  • Zoology (1295)