Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Minos: variant adjudication and joint genotyping of cohorts of bacterial genomes

View ORCID ProfileM. Hunt, View ORCID ProfileB. Letcher, K.M. Malone, G. Nguyen, M.B. Hall, R.M. Colquhoun, L. Lima, M.C. Schatz, S. Ramakrishnan, CRyPTIC consortium, View ORCID ProfileZ. Iqbal
doi: https://doi.org/10.1101/2021.09.15.460475
M. Hunt
1European Bioinformatics Institute, Cambridge, UK
2Nuffield Department of Medicine, University of Oxford, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for M. Hunt
B. Letcher
1European Bioinformatics Institute, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for B. Letcher
K.M. Malone
1European Bioinformatics Institute, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
G. Nguyen
1European Bioinformatics Institute, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
M.B. Hall
1European Bioinformatics Institute, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
R.M. Colquhoun
3Institute of Evolutionary Biology, Ashworth Laboratories, University of Edinburgh, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
L. Lima
1European Bioinformatics Institute, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
M.C. Schatz
4Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
S. Ramakrishnan
4Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Z. Iqbal
1European Bioinformatics Institute, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Z. Iqbal
  • For correspondence: zi@ebi.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Short-read variant calling for bacterial genomics is a mature field, and there are many widely-used software tools. Different underlying approaches (eg pileup, local or global assembly, paired-read use, haplotype use) lend each tool different strengths, especially when considering non-SNP (single nucleotide polymorphism) variation or potentially distant reference genomes. It would therefore be valuable to be able to integrate the results from multiple variant callers, using a robust statistical approach to “adjudicate” at loci where there is disagreement between callers. To this end, we present a tool, Minos, for variant adjudication by mapping reads to a genome graph of variant calls. Minos allows users to combine output from multiple variant callers without loss of precision. Minos also addresses a second problem of joint genotyping SNPs and indels in bacterial cohorts, which can also be framed as an adjudication problem.

We benchmark on 62 samples from 3 species (Mycobacterium tuberculosis, Staphylococcus aureus, Klebsiella pneumoniae) and an outbreak of 385 M. tuberculosis samples. Finally, we joint genotype a large M. tuberculosis cohort (N≈15k) for which the rifampicin phenotype is known. We build a map of non-synonymous variants in the RRDR (rifampicin resistance determining region) of the rpoB gene and extend current knowledge relating RRDR SNPs to heterogeneity in rifampicin resistance levels. We replicate this finding in a second M. tuberculosis cohort (N≈13k).

Minos is released under the MIT license, available at https://github.com/iqbal-lab-org/minos.

Competing Interest Statement

E.R. is employed by Public Health England and holds an honorary contract with Imperial College London. I.F.L. is Director of the Scottish Mycobacteria Reference Laboratory. S.N. receives funding from German Center for Infection Research, Excellenz Cluster Precision Medicine in Chronic Inflammation, Leibniz Science Campus Evolutionary Medicine of the LUNG (EvoLUNG)tion EXC 2167. P.S. is a consultant at Genoscreen. T.R. is funded by NIH and DoD and receives salary support from the non-profit organization FIND. T.R. is a co-founder, board member and shareholder of Verus Diagnostics Inc, a company that was founded with the intent of developing diagnostic assays. Verus Diagnostics was not involved in any way with data collection, analysis or publication of the results. T.R. has not received any financial support from Verus Diagnostics. UCSD Conflict of Interest office has reviewed and approved T.R.'s role in Verus Diagnostics Inc. T.R. is a co-inventor of a provisional patent for a TB diagnostic assay (provisional patent #: 63/048.989). T.R. is a co-inventor on a patent associated with the processing of TB sequencing data (European Patent Application No. 14840432.0 & USSN 14/912,918). T.R. has agreed to "donate all present and future interest in and rights to royalties from this patent" to UCSD to ensure that he does not receive any financial benefits from this patent. S.S. is working and holding ESOPs at HaystackAnalytics Pvt. Ltd. (Product: Using whole genome sequencing for drug susceptibility testing for Mycobacterium tuberculosis). G.F.G. is listed as an inventor on patent applications for RBD-dimer-based CoV vaccines. The patents for RBD-dimers as protein subunit vaccines for SARS-CoV-2 have been licensed to Anhui Zhifei Longcom Biopharmaceutical Co. Ltd, China.

Footnotes

  • ↵∗ Please see later section “CRyPTIC consortium” for details.

  • Citations updated from "in prep" to new preprints on biorxiv.

  • https://figshare.com/projects/Minos-biorxiv-202109/122707

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted September 27, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Minos: variant adjudication and joint genotyping of cohorts of bacterial genomes
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Minos: variant adjudication and joint genotyping of cohorts of bacterial genomes
M. Hunt, B. Letcher, K.M. Malone, G. Nguyen, M.B. Hall, R.M. Colquhoun, L. Lima, M.C. Schatz, S. Ramakrishnan, CRyPTIC consortium, Z. Iqbal
bioRxiv 2021.09.15.460475; doi: https://doi.org/10.1101/2021.09.15.460475
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Minos: variant adjudication and joint genotyping of cohorts of bacterial genomes
M. Hunt, B. Letcher, K.M. Malone, G. Nguyen, M.B. Hall, R.M. Colquhoun, L. Lima, M.C. Schatz, S. Ramakrishnan, CRyPTIC consortium, Z. Iqbal
bioRxiv 2021.09.15.460475; doi: https://doi.org/10.1101/2021.09.15.460475

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4395)
  • Biochemistry (9619)
  • Bioengineering (7110)
  • Bioinformatics (24915)
  • Biophysics (12642)
  • Cancer Biology (9979)
  • Cell Biology (14386)
  • Clinical Trials (138)
  • Developmental Biology (7968)
  • Ecology (12133)
  • Epidemiology (2067)
  • Evolutionary Biology (16008)
  • Genetics (10937)
  • Genomics (14764)
  • Immunology (9889)
  • Microbiology (23718)
  • Molecular Biology (9493)
  • Neuroscience (50964)
  • Paleontology (370)
  • Pathology (1544)
  • Pharmacology and Toxicology (2688)
  • Physiology (4031)
  • Plant Biology (8677)
  • Scientific Communication and Education (1512)
  • Synthetic Biology (2403)
  • Systems Biology (6446)
  • Zoology (1346)