Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Resistome SNP Calling via Read Colored de Bruijn Graphs

Bahar Alipanahi, Martin D. Muggli, Musa Jundi, Noelle Noyes, Christina Boucher
doi: https://doi.org/10.1101/156174
Bahar Alipanahi
University of Florida;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Martin D. Muggli
Colorado State University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Musa Jundi
University of Florida;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Noelle Noyes
Colorado State University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christina Boucher
University of Florida;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: cboucher@cise.ufl.edu
  • Abstract
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Motivation: The resistome, which refers to all of the antimicrobial resistance (AMR) genes in pathogenic and non-pathogenic bacteria, is frequently studied using shotgun metagenomic data [14, 47]. Unfortunately, few existing methods are able to identify single nucleotide polymorphisms (SNPs) within metagenomic data, and to the best of our knowledge, no methods exist to detect SNPs within AMR genes within the resistome. The ability to identify SNPs in AMR genes across the resistome would represent a significant advance in understanding the dissemination and evolution of AMR, as SNP identification would enable "fingerprinting" of the resistome, which could then be used to track AMR dynamics across various settings and/or time periods. Results: We present LueVari, a reference-free SNP caller based on the read colored de Bruijn graph, an extension of the traditional de Bruijn graph that allows repeated regions longer than the k-mer length and shorter than the read length to be identified unambiguously. We demonstrate LueVari was the only method that had reliable sensitivity (between 73% and 98%) as the performance of competing methods varied widely. Furthermore, we show LueVari constructs sequences containing the variation which span 93% of the gene in datasets with lower coverage (15X), and 100% of the gene in datasets with higher coverage (30X). Availability: Code and datasets are publicly available at https://github.com/baharpan/cosmo/tree/LueVari.

Copyright 
The copyright holder for this preprint is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
  • Posted April 18, 2018.

Download PDF

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Resistome SNP Calling via Read Colored de Bruijn Graphs
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
Share
Resistome SNP Calling via Read Colored de Bruijn Graphs
Bahar Alipanahi, Martin D. Muggli, Musa Jundi, Noelle Noyes, Christina Boucher
bioRxiv 156174; doi: https://doi.org/10.1101/156174
del.icio.us logo Digg logo Reddit logo Technorati logo Twitter logo CiteULike logo Connotea logo Facebook logo Google logo Mendeley logo
Citation Tools
Resistome SNP Calling via Read Colored de Bruijn Graphs
Bahar Alipanahi, Martin D. Muggli, Musa Jundi, Noelle Noyes, Christina Boucher
bioRxiv 156174; doi: https://doi.org/10.1101/156174

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (619)
  • Biochemistry (857)
  • Bioengineering (515)
  • Bioinformatics (4754)
  • Biophysics (1499)
  • Cancer Biology (1028)
  • Cell Biology (1445)
  • Clinical Trials (52)
  • Developmental Biology (973)
  • Ecology (1628)
  • Epidemiology (808)
  • Evolutionary Biology (3687)
  • Genetics (2509)
  • Genomics (3260)
  • Immunology (601)
  • Microbiology (2408)
  • Molecular Biology (888)
  • Neuroscience (6471)
  • Paleontology (42)
  • Pathology (124)
  • Pharmacology and Toxicology (220)
  • Physiology (286)
  • Plant Biology (890)
  • Scientific Communication and Education (247)
  • Synthetic Biology (383)
  • Systems Biology (1321)
  • Zoology (162)