TY - JOUR T1 - Genomic positional conservation identifies topological anchor point (tap)RNAs linked to developmental loci JF - bioRxiv DO - 10.1101/051052 SP - 051052 AU - Paulo P. Amaral AU - Tommaso Leonardi AU - Namshik Han AU - Emmanuelle Viré AU - Dennis Gascoigne AU - Raúl Arias-Carrasco AU - Magdalena Büscher AU - Anda Zhang AU - Stefano Pluchino AU - Vinicius Maracaja-Coutinho AU - Helder I. Nakaya AU - Martin Hemberg AU - Ramin Shiekhattar AU - Anton J. Enright AU - Tony Kouzarides Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/04/29/051052.abstract N2 - The mammalian genome is transcribed into large numbers of long noncoding RNAs (lncRNAs), but the definition of functional lncRNA groups has proven difficult, partly due to their low sequence conservation and lack of identified shared properties. Here we consider positional conservation across mammalian genomes as an indicator of functional commonality. We identify 665 conserved lncRNA promoters in mouse and human genomes that are preserved in genomic position relative to orthologous coding genes. The identified ‘positionally conserved’ lncRNA genes are primarily associated with developmental transcription factor loci with which they are co-expressed in a tissue-specific manner. Strikingly, over half of all positionally conserved RNAs in this set are linked distinct to chromatin organization structures, overlapping the binding sites for the CTCF chromatin organizer and located at chromatin loop anchor points and borders of topologically associating domains (TADs). These topological anchor point (tap)RNAs possess conserved sequence domains that are enriched in potential recognition motifs for Zinc Finger proteins. Characterization of these non-coding RNAs and their associated coding genes shows that they are functionally connected: they regulate each other’s expression and influence the metastatic phenotype of cancer cells in vitro in a similar fashion. Thus, interrogation of positionally conserved lncRNAs identifies a new subset of tapRNAs with shared functional properties. These results provide a large dataset of lncRNAs that conform to the “extended gene” model, in which conserved developmental genes are genomically and functionally linked to regulatory lncRNA loci across mammalian evolution. ER -