Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Rich chromatin structure prediction from Hi-C data

Laraib Iqbal Malik, Rob Patro
doi: https://doi.org/10.1101/032953
Laraib Iqbal Malik
1Department of Computer Science, Stony Brook University, Stony Brook, NY 11794 USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rob Patro
1Department of Computer Science, Stony Brook University, Stony Brook, NY 11794 USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: rob.patro@cs.stonybrook.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

ABSTRACT

Recent studies involving the 3-dimensional conformation of chromatin have revealed the important role it has to play in different processes within the cell. These studies have also led to the discovery of densely interacting segments of the chromosome, called topologically associating domains. The accurate identification of these domains from Hi-C interaction data is an interesting and important computational problem for which numerous methods have been proposed. Unfortunately, most existing algorithms designed to identify these domains assume that they are non-overlapping whereas there is substantial evidence to believe a nested structure exists. We present an efficient methodology to predict hierarchical chromatin domains using chromatin conformation capture data. Our method predicts domains at different resolutions and uses these to construct a hierarchy that is based on intrinsic properties of the chromatin data. The hierarchy consists of a set of non-overlapping domains, that maximize intra-domain interaction frequencies, at each level. We show that our predicted structure is highly enriched for CTCF and various other chromatin markers. We also show that large-scale domains, at multiple resolutions within our hierarchy, are conserved across cell types and species. Our software, Matryoshka, is written in C++11 and licensed under GPL v3; it is available at https://github.com/COMBINE-lab/matryoshka.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted November 26, 2015.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Rich chromatin structure prediction from Hi-C data
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Rich chromatin structure prediction from Hi-C data
Laraib Iqbal Malik, Rob Patro
bioRxiv 032953; doi: https://doi.org/10.1101/032953
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Rich chromatin structure prediction from Hi-C data
Laraib Iqbal Malik, Rob Patro
bioRxiv 032953; doi: https://doi.org/10.1101/032953

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Subject Areas
All Articles
  • Animal Behavior and Cognition (4246)
  • Biochemistry (9174)
  • Bioengineering (6806)
  • Bioinformatics (24065)
  • Biophysics (12160)
  • Cancer Biology (9566)
  • Cell Biology (13828)
  • Clinical Trials (138)
  • Developmental Biology (7660)
  • Ecology (11738)
  • Epidemiology (2066)
  • Evolutionary Biology (15544)
  • Genetics (10672)
  • Genomics (14363)
  • Immunology (9515)
  • Microbiology (22913)
  • Molecular Biology (9135)
  • Neuroscience (49161)
  • Paleontology (358)
  • Pathology (1487)
  • Pharmacology and Toxicology (2584)
  • Physiology (3851)
  • Plant Biology (8351)
  • Scientific Communication and Education (1473)
  • Synthetic Biology (2301)
  • Systems Biology (6206)
  • Zoology (1303)