Normalization of a chromosomal contact map

BMC Genomics. 2012 Aug 30:13:436. doi: 10.1186/1471-2164-13-436.

Abstract

Background: Chromatin organization has been increasingly studied in relation with its important influence on DNA-related metabolic processes such as replication or regulation of gene expression. Since its original design ten years ago, capture of chromosome conformation (3C) has become an essential tool to investigate the overall conformation of chromosomes. It relies on the capture of long-range trans and cis interactions of chromosomal segments whose relative proportions in the final bank reflect their frequencies of interactions, hence their spatial proximity in a population of cells. The recent coupling of 3C with deep sequencing approaches now allows the generation of high resolution genome-wide chromosomal contact maps. Different protocols have been used to generate such maps in various organisms. This includes mammals, drosophila and yeast. The massive amount of raw data generated by the genomic 3C has to be carefully processed to alleviate the various biases and byproducts generated by the experiments. Our study aims at proposing a simple normalization procedure to minimize the influence of these unwanted but inevitable events on the final results.

Results: Careful analysis of the raw data generated previously for budding yeast S. cerevisiae led to the identification of three main biases affecting the final datasets, including a previously unknown bias resulting from the circularization of DNA molecules. We then developed a simple normalization procedure to process the data and allow the generation of a normalized, highly contrasted, chromosomal contact map for S. cerevisiae. The same method was then extended to the first human genome contact map. Using the normalized data, we revisited the preferential interactions originally described between subsets of discrete chromosomal features. Notably, the detection of preferential interactions between tRNA in yeast and CTCF, PolII binding sites in human can vary with the normalization procedure used.

Conclusions: We quantitatively reanalyzed the genomic 3C data obtained for S. cerevisiae, identified some of the biases inherent to the technique and proposed a simple normalization procedure to analyse them. Such an approach can be easily generalized for genomic 3C experiments in other organisms. More experiments and analysis will be necessary to reach optimal resolution and accuracies of the maps generated through these approaches. Working with cell population presenting highest levels of homogeneity will prove useful in this regards.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Binding Sites
  • CCCTC-Binding Factor
  • Chromosome Mapping / methods*
  • Chromosomes, Fungal / genetics*
  • Chromosomes, Human / genetics*
  • DNA Polymerase III / genetics
  • DNA Polymerase III / metabolism
  • DNA, Circular
  • Genome*
  • Humans
  • RNA, Transfer / genetics
  • RNA, Transfer / metabolism
  • ROC Curve
  • Repressor Proteins / genetics
  • Repressor Proteins / metabolism
  • Saccharomyces cerevisiae / genetics*

Substances

  • CCCTC-Binding Factor
  • CTCF protein, human
  • DNA, Circular
  • Repressor Proteins
  • RNA, Transfer
  • DNA Polymerase III