Contact genomics: scaffolding and phasing (meta)genomes using chromosome 3D physical signatures

FEBS Lett. 2015 Oct 7;589(20 Pt A):2966-74. doi: 10.1016/j.febslet.2015.04.034. Epub 2015 Apr 29.

Abstract

High-throughput DNA sequencing technologies are fuelling an accelerating trend to assemble de novo or resequence the genomes of numerous species as well as to complete unfinished assemblies. While current DNA sequencing technologies remain limited to reading stretches of a few hundreds or thousands of base pairs, experimental and computational methods are continuously improving with the goal of assembling entire genomes from large numbers of short DNA sequences. However, the algorithms that piece together DNA strands face important limitations due, notably, to the presence of repeated sequences or of multiple haplotypes within one genome, thus leaving many assemblies incomplete. Recently, the realization that the physical contacts experienced by a portion of a DNA molecule could be used as a robust and quantitative assay to determine its genomic position has led to the emerging field of contact genomics, which promises to revolutionize current genome assembly approaches by exploiting the flexible polymer properties of chromosomes. Here we review the current applications of contact genomics to genome scaffolding, haplotyping and metagenomic assembly, then outline the future developments we envision.

Keywords: 3C; Contact genomics; Genome assembly; Haplotype phasing; Hi-C; Metagenomics; Scaffolding.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Animals
  • Chromosomes / physiology
  • Chromosomes / ultrastructure*
  • Cluster Analysis
  • Epistasis, Genetic
  • Gene Regulatory Networks
  • Genomics
  • Haplotypes
  • Humans
  • Metagenome*
  • Sequence Analysis, DNA