PT - JOURNAL ARTICLE AU - Hu Jin AU - Alex I. Finnegan AU - Jun S. Song TI - A unified computational framework for modeling genome-wide nucleosome landscape AID - 10.1101/202580 DP - 2017 Jan 01 TA - bioRxiv PG - 202580 4099 - http://biorxiv.org/content/early/2017/10/18/202580.short 4100 - http://biorxiv.org/content/early/2017/10/18/202580.full AB - Nucleosomes form the fundamental building blocks of eukaryotic chromatin, and previous attempts to understand the principles governing their genome-wide distribution have spurred much interest and debate in biology. In particular, the precise role of DNA sequence in shaping local chromatin structure has been controversial. This paper rigorously quantifies of the contribution of hitherto-debated sequence features – including G+C content, 10.5-bp periodicity, and poly(dA:dT) tracts – to three distinct aspects of genome-wide nucleosome landscape: occupancy, translational positioning and rotational positioning. Our computational framework simultaneously learns nucleosome number and nucleosome-positioning energy from genome-wide nucleosome maps. In contrast to other previous studies, our model can predict both in-vitro and in-vivo nucleosome maps in S. cerevisiae. We find that although G+C content is the primary determinant of MNase-derived nucleosome occupancy, MNase digestion biases may substantially influence this GC dependence. By contrast, poly(dA:dT) tracts are seen to deter nucleosome formation, regardless of the experimental method used. We further show that the 10.5-bp nucleotide periodicity facilitates rotational but not translational positioning. Applying our method to in-vivo nucleosome maps demonstrates that, for a subset of genes, the regularly-spaced nucleosome arrays observed around transcription start sites can be partially recapitulated by DNA sequence alone. Finally, in-vivo nucleosome occupancy derived from MNase-seq experiments around transcription termination sites can be mostly explained by the genomic sequence. Implications of these results and potential extensions of the proposed computational framework are discussed