Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
Confirmatory Results

Fast functional annotation of metagenomic shotgun data by DNA alignment to a microbial gene catalog

Stuart M. Brown, Yuhan Hao, Hao Chen, Bobby P. Laungani, Thahmina A. Ali, Changsu Dong, Carlos Lijeron, Baekdoo Kim, Konstantinos Krampis, Zhiheng Pei
doi: https://doi.org/10.1101/120402
Stuart M. Brown
1NYU School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuhan Hao
1NYU School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hao Chen
1NYU School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bobby P. Laungani
2Hunter College
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Thahmina A. Ali
2Hunter College
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Changsu Dong
2Hunter College
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carlos Lijeron
2Hunter College
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Baekdoo Kim
2Hunter College
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Konstantinos Krampis
2Hunter College
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhiheng Pei
1NYU School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Background Metagenomic shotgun sequencing is becoming increasingly popular to study microbes associated with the human body and in environmental samples. A key goal of shotgun metagenomic sequencing is to identify gene functions and metabolic pathways that differ between samples or conditions. However, current methods to identify function in the large number of reads in a high-throughput sequence data file rely on the computationally intensive and low stringency approach of mapping each read to a generic database of proteins or reference microbial genomes.

Results We have developed an alternative analysis approach for shotgun metagenomic sequence data utilizing Bowtie2 DNA-DNA alignment of the reads to a database of well annotated genes compiled from human microbiome data. This method is rapid, and provides high stringency matches (>90% DNA sequence identity) of shotgun metagenomics reads to genes with annotated functions. We demonstrate the use of this method with synthetic data, Human Microbiome Project shotgun metagenomic data sets, and data from a study of liver disease. Differentially abundant KEGG gene functions can be detected in these experiments.

Conclusions Functional annotation of metagenomic shotgun sequence reads can be accomplished by rapid DNA-DNA matching to a custom database of microbial sequences using the Bowtie2 sequence alignment tool. This method can be used for a variety of microbiome studies and allows functional analysis which is otherwise computationally demanding. This rapid annotation method is freely available as a Galaxy workflow within a Docker image.

  • Abbreviations

    KEGG
    Kyoto Encyclopedia of Genes and Genomes
    MGS
    Metagenomics Shotgun sequencing
    HMP
    Human Microbiome Project
    IGC
    Integrated Gene Catalog of the human gut microbiome
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted March 25, 2017.
    Download PDF

    Supplementary Material

    Email

    Thank you for your interest in spreading the word about bioRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Fast functional annotation of metagenomic shotgun data by DNA alignment to a microbial gene catalog
    (Your Name) has forwarded a page to you from bioRxiv
    (Your Name) thought you would like to see this page from the bioRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Fast functional annotation of metagenomic shotgun data by DNA alignment to a microbial gene catalog
    Stuart M. Brown, Yuhan Hao, Hao Chen, Bobby P. Laungani, Thahmina A. Ali, Changsu Dong, Carlos Lijeron, Baekdoo Kim, Konstantinos Krampis, Zhiheng Pei
    bioRxiv 120402; doi: https://doi.org/10.1101/120402
    Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
    Citation Tools
    Fast functional annotation of metagenomic shotgun data by DNA alignment to a microbial gene catalog
    Stuart M. Brown, Yuhan Hao, Hao Chen, Bobby P. Laungani, Thahmina A. Ali, Changsu Dong, Carlos Lijeron, Baekdoo Kim, Konstantinos Krampis, Zhiheng Pei
    bioRxiv 120402; doi: https://doi.org/10.1101/120402

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Bioinformatics
    Subject Areas
    All Articles
    • Animal Behavior and Cognition (3689)
    • Biochemistry (7789)
    • Bioengineering (5674)
    • Bioinformatics (21282)
    • Biophysics (10576)
    • Cancer Biology (8173)
    • Cell Biology (11937)
    • Clinical Trials (138)
    • Developmental Biology (6762)
    • Ecology (10401)
    • Epidemiology (2065)
    • Evolutionary Biology (13863)
    • Genetics (9708)
    • Genomics (13070)
    • Immunology (8139)
    • Microbiology (19983)
    • Molecular Biology (7842)
    • Neuroscience (43053)
    • Paleontology (319)
    • Pathology (1279)
    • Pharmacology and Toxicology (2258)
    • Physiology (3351)
    • Plant Biology (7232)
    • Scientific Communication and Education (1312)
    • Synthetic Biology (2004)
    • Systems Biology (5537)
    • Zoology (1128)