Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Longitudinal differential abundance analysis of microbial marker-gene surveys using smoothing splines

View ORCID ProfileJoseph N. Paulson, Hisham Talukder, View ORCID ProfileHéctor Corrada Bravo
doi: https://doi.org/10.1101/099457
Joseph N. Paulson
1Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA
2Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joseph N. Paulson
Hisham Talukder
3Applied Mathematics & Statistics, and Scientific Computation graduate program, University of Maryland, College Park, Maryland, USA
4Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Héctor Corrada Bravo
3Applied Mathematics & Statistics, and Scientific Computation graduate program, University of Maryland, College Park, Maryland, USA
4Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, USA
5Computer Science Department, University of Maryland, College Park, Maryland, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Héctor Corrada Bravo
  • For correspondence: hcorrada@umiacs.umd.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Background High-throughput targeted sequencing of the 16S ribosomal RNA marker gene is often used to profile and characterize the taxonomic composition of microbial communities. This type of big high-through sequencing data is rapidly being applied to various infectious diseases like diarrhea. While many studies are limited to single “snapshots” of these communities, there is increasing recognition that longitudinal profiling of these communities are required to understand community dynamics and the complex relationships between dynamics and phenotypes of interest. Statistical methods that determine microbial features that are differentially expressed are required as an initial step to characterizing phenotypic associations with community dynamics in big data and infectious diseases.

Results We present a novel method for longitudinal marker-gene surveys based on smoothing splines that allows discovery and inference of time periods where specific microbial features are differentially abundant. We applied our method to three 16S marker-gene surveys, including, groups of gnotobiotic mice on two diets, patients challenged with ETEC (H10407), and a vaginal microbiome of healthy women. Employing our methodology we recover known bacterial differences and highlight a few extra species providing insight into when specific changes occurred. Additionally, in the cohort challenged with ETEC we recover proposed probiotic bacteria Bacteroides xylanisolvens, Collinsella aerofaciens, and Faecalibacterium prausnitzii associatons with healthy individuals.

Conclusions The method presented is, to our knowledge, the first flexible method of its kind implemented as a software capable of detecting time periods of differential abundance for microbial features species between two or more sample groups of interest. Our method is available within the metagenomeSeq open-source software for analysis of metagenomic package available through the Bioconductor project and is termed metaSplines.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted January 10, 2017.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Longitudinal differential abundance analysis of microbial marker-gene surveys using smoothing splines
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Longitudinal differential abundance analysis of microbial marker-gene surveys using smoothing splines
Joseph N. Paulson, Hisham Talukder, Héctor Corrada Bravo
bioRxiv 099457; doi: https://doi.org/10.1101/099457
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Longitudinal differential abundance analysis of microbial marker-gene surveys using smoothing splines
Joseph N. Paulson, Hisham Talukder, Héctor Corrada Bravo
bioRxiv 099457; doi: https://doi.org/10.1101/099457

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (2440)
  • Biochemistry (4803)
  • Bioengineering (3340)
  • Bioinformatics (14724)
  • Biophysics (6658)
  • Cancer Biology (5188)
  • Cell Biology (7455)
  • Clinical Trials (138)
  • Developmental Biology (4378)
  • Ecology (6904)
  • Epidemiology (2057)
  • Evolutionary Biology (9943)
  • Genetics (7357)
  • Genomics (9550)
  • Immunology (4583)
  • Microbiology (12730)
  • Molecular Biology (4960)
  • Neuroscience (28422)
  • Paleontology (199)
  • Pathology (810)
  • Pharmacology and Toxicology (1400)
  • Physiology (2031)
  • Plant Biology (4521)
  • Scientific Communication and Education (980)
  • Synthetic Biology (1305)
  • Systems Biology (3922)
  • Zoology (731)