Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA

Genome Biol. 2015 Oct 12:16:224. doi: 10.1186/s13059-015-0776-0.

Abstract

Ancient DNA is typically highly degraded with appreciable cytosine deamination, and contamination with present-day DNA often complicates the identification of endogenous molecules. Together, these factors impede accurate assembly of the endogenous ancient mitochondrial genome. We present schmutzi, an iterative approach to jointly estimate present-day human contamination in ancient human DNA datasets and reconstruct the endogenous mitochondrial genome. By using sequence deamination patterns and fragment length distributions, schmutzi accurately reconstructs the endogenous mitochondrial genome sequence even when contamination exceeds 50 %. Given sufficient coverage, schmutzi also produces reliable estimates of contamination across a range of contamination rates.

Availability: https://bioinf.eva.mpg.de/schmutzi/ license:GPLv3.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Consensus Sequence*
  • DNA Contamination*
  • DNA, Mitochondrial / chemistry*
  • Genome, Mitochondrial*
  • Humans
  • Neanderthals / genetics
  • Sequence Analysis, DNA / methods*
  • Software*

Substances

  • DNA, Mitochondrial