Defining cell-type specificity at the transcriptional level in human disease

  1. Matthias Kretzler1,2,10
  1. 1Division of Nephrology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA;
  2. 2Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA;
  3. 3Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA;
  4. 4Department of Computer Science, Princeton University, Princeton, New Jersey 08540, USA;
  5. 5Department of Genetics, The Geisel School of Medicine at Dartmouth, Hanover, New Hampshire 03755, USA;
  6. 6Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, USA;
  7. 7Laboratorio di Ricerca Nefrologica, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, 20122 Milano, Italy;
  8. 8Division of Nephrology, University of Zurich, 8057 Zurich, Switzerland
    1. 9 These authors contributed equally to this work.

    Abstract

    Cell-lineage–specific transcripts are essential for differentiated tissue function, implicated in hereditary organ failure, and mediate acquired chronic diseases. However, experimental identification of cell-lineage–specific genes in a genome-scale manner is infeasible for most solid human tissues. We developed the first genome-scale method to identify genes with cell-lineage–specific expression, even in lineages not separable by experimental microdissection. Our machine-learning–based approach leverages high-throughput data from tissue homogenates in a novel iterative statistical framework. We applied this method to chronic kidney disease and identified transcripts specific to podocytes, key cells in the glomerular filter responsible for hereditary and most acquired glomerular kidney disease. In a systematic evaluation of our predictions by immunohistochemistry, our in silico approach was significantly more accurate (65% accuracy in human) than predictions based on direct measurement of in vivo fluorescence-tagged murine podocytes (23%). Our method identified genes implicated as causal in hereditary glomerular disease and involved in molecular pathways of acquired and chronic renal diseases. Furthermore, based on expression analysis of human kidney disease biopsies, we demonstrated that expression of the podocyte genes identified by our approach is significantly related to the degree of renal impairment in patients. Our approach is broadly applicable to define lineage specificity in both cell physiology and human disease contexts. We provide a user-friendly website that enables researchers to apply this method to any cell-lineage or tissue of interest. Identified cell-lineage–specific transcripts are expected to play essential tissue-specific roles in organogenesis and disease and can provide starting points for the development of organ-specific diagnostics and therapies.

    Footnotes

    • 10 Corresponding authors

      E-mail ogt{at}genomics.princeton.edu

      E-mail kretzler{at}umich.edu

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.155697.113.

    • Received February 17, 2013.
    • Accepted August 14, 2013.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported), as described at http://creativecommons.org/licenses/by-nc/3.0/.

    | Table of Contents

    Preprint Server