Endothelial, epithelial, and fibroblast cells exhibit specific splicing programs independently of their tissue of origin

  1. Didier Auboeuf1,2,3,4,7
  1. 1Inserm UMR-S1052, Centre de Recherche en Cancérologie de Lyon, 69008 Lyon, France;
  2. 2CNRS UMR5286, Centre de Recherche en Cancérologie de Lyon, 69008 Lyon, France;
  3. 3Université de Lyon, 69007 Lyon, France;
  4. 4Centre Léon Bérard, 69008 Lyon, France;
  5. 5UMR CNRS 5558, Laboratoire de Biométrie et Biologie Evolutive, INRIA Bamboo, Université Claude Bernard, Villeurbanne 69100, France
    1. 6 These authors contributed equally to this work.

    Abstract

    Alternative splicing is the main mechanism of increasing the proteome diversity coded by a limited number of genes. It is well established that different tissues or organs express different splicing variants. However, organs are composed of common major cell types, including fibroblasts, epithelial, and endothelial cells. By analyzing large-scale data sets generated by The ENCODE Project Consortium and after extensive RT-PCR validation, we demonstrate that each of the three major cell types expresses a specific splicing program independently of its organ origin. Furthermore, by analyzing splicing factor expression across samples, publicly available splicing factor binding site data sets (CLIP-seq), and exon array data sets after splicing factor depletion, we identified several splicing factors, including ESRP1 and 2, MBNL1, NOVA1, PTBP1, and RBFOX2, that contribute to establishing these cell type–specific splicing programs. All of the analyzed data sets are freely available in a user-friendly web interface named FasterDB, which describes all known splicing variants of human and mouse genes and their splicing patterns across several dozens of normal and cancer cells as well as across tissues. Information regarding splicing factors that potentially contribute to individual exon regulation is also provided via a dedicated CLIP-seq and exon array data visualization interface. To the best of our knowledge, FasterDB is the first database integrating such a variety of large-scale data sets to enable functional genomics analyses at exon-level resolution.

    Footnotes

    • Received July 3, 2013.
    • Accepted December 2, 2013.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported), as described at http://creativecommons.org/licenses/by-nc/3.0/.

    | Table of Contents

    Preprint Server