Toward almost closed genomes with GapFiller

Genome Biol. 2012 Jun 25;13(6):R56. doi: 10.1186/gb-2012-13-6-r56.

Abstract

De novo assembly is a commonly used application of next-generation sequencing experiments. The ultimate goal is to puzzle millions of reads into one complete genome, although draft assemblies usually result in a number of gapped scaffold sequences. In this paper we propose an automated strategy, called GapFiller, to reliably close gaps within scaffolds using paired reads. The method shows good results on both bacterial and eukaryotic datasets, allowing only few errors. As a consequence, the amount of additional wetlab work needed to close a genome is drastically reduced. The software is available at http://www.baseclear.com/bioinformatics-tools/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Chromosomes, Human, Pair 14 / genetics
  • Computational Biology / methods
  • Escherichia coli / genetics
  • Genome, Bacterial*
  • Genome, Fungal*
  • Genome, Human*
  • Genomic Library
  • Genomics / methods*
  • Humans
  • INDEL Mutation
  • Internet
  • Reproducibility of Results
  • Saccharomyces cerevisiae / genetics
  • Software*