Elsevier

Methods in Enzymology

Volume 470, 2010, Pages 281-315
Methods in Enzymology

Chapter 12 - High-Quality Binary Interactome Mapping

https://doi.org/10.1016/S0076-6879(10)70012-4Get rights and content

Abstract

Physical interactions mediated by proteins are critical for most cellular functions and altogether form a complex macromolecular “interactome” network. Systematic mapping of protein–protein, protein–DNA, protein–RNA, and protein–metabolite interactions at the scale of the whole proteome can advance understanding of interactome networks with applications ranging from single protein functional characterization to discoveries on local and global systems properties. Since the early efforts at mapping protein–protein interactome networks a decade ago, the field has progressed rapidly giving rise to a growing number of interactome maps produced using high-throughput implementations of either binary protein–protein interaction assays or co-complex protein association methods. Although high-throughput methods are often thought to necessarily produce lower quality information than low-throughput experiments, we have recently demonstrated that proteome-scale interactome datasets can be produced with equal or superior quality than that observed in literature-curated datasets derived from large numbers of small-scale experiments. In addition to performing all experimental steps thoroughly and including all necessary controls and quality standards, careful verification of all interacting pairs and validation tests using independent, orthogonal assays are crucial to ensure the release of interactome maps of the highest possible quality. This chapter describes a high-quality, high-throughput binary protein–protein interactome mapping pipeline that includes these features.

Introduction

Interactions mediated by proteins and the complex “interactome” networks resulting from these interactions are essential for biological systems. Mapping protein–protein, protein–DNA, protein–RNA, and protein–metabolite interactions that form “interactome” networks is a major goal of functional genomics, proteomics, and systems biology (Vidal, 2005). Information obtained from large-scale efforts to identify protein interaction partners yields crucial biological insights throughout a range of applications. At the single protein level, interactome maps have helped assign functions to both uncharacterized and well-studied gene products (Oliver, 2000). At the systems level, interactome maps have enabled investigations of how regulatory circuits and global cellular network properties relate to biological functions (Han et al., 2004, Jeong et al., 2001, Milo et al., 2002, Yu et al., 2008).

The two major high-throughput strategies used so far to delineate protein–protein interactome networks are: (i) binary protein–protein interaction assays, which detect direct pairwise interactions, and (ii) affinity purification followed by mass spectrometry (AP–MS) approaches, which detect biochemically stable, copurifying protein complexes containing both direct and indirect protein associations. Classically, binary interaction assays have been based on the yeast two-hybrid (Y2H) system developed 20 years ago (Fields and Song, 1989), and which has been improved over time to increase efficiency and quality (Durfee et al., 1993, Gyuris et al., 1993, Vidal et al., 1996). Of late, alternative approaches have been developed to detect binary interactions, such as protein arrays, protein complementation assays, and the split ubiquitin method (Miller et al., 2005, Tarassov et al., 2008, Zhu et al., 2001).

Until recently high-throughput methods were regarded as more likely to produce lower quality information than low-throughput experiments. It has now been shown that highly reliable interactome datasets can be obtained at the scale of the whole proteome (Braun et al., 2009, Cusick et al., 2009, Simonis et al., 2009, Venkatesan et al., 2009) provided that all experimental steps are thorough and all necessary controls and quality standards are included. Lastly, careful verification of all candidate interactions and experimental validation using independent interaction assays are necessary to ensure the release of interactome maps of the highest possible quality.

Even when highly reliable, interactome maps should be considered as network models of interactions that can happen between all proteins encoded by the genome of an organism of interest. As such, they correspond to static representations of collapsed time-, space-, and condition-dependent interactions that dynamically regulate the behavior and developmental fate of diverse tissues. Thus, interactome maps should be used as static scaffold-like information from which the dynamic features of biologically relevant interactions, that is, those that do happen in vivo, can be modeled by integrating additional functional information such as transcriptional and phenotypic profiling data (Ge et al., 2001, Ge et al., 2003, Gunsalus et al., 2005, Vidal, 2001). Ultimately, novel potentially insightful interactions need to be evaluated for their biological significance using genetic experiments, where specific cis-acting interaction-defective alleles (IDAs) of one or both proteins or trans-acting disruptors are tested functionally (Endoh et al., 2002, Vidal & Endoh, 1999, Vidal et al., 1996).

Section snippets

High-Quality Binary Interactome Mapping

The quality of any dataset can be affected by a high rate of “false positives” and need to be addressed in two fundamentally different contexts. One relates to avoidable experimental errors leading to wrong information, and the other relates to as yet undiscovered fundamental properties of proteins (Fig. 12.1). Our binary interactome mapping strategy is designed to differentiate between these two classes of issues designated “technical” and “biological” false positives.

  • Technical false positives

Assembly of DB-X and AD-Y expression plasmids

The first step towards binary interactome mapping is the generation of expression plasmids. For high-throughput experiments it is preferable to use sequence independent recombinational subcloning technologies such as Gateway cloning (Walhout et al., 2000). Large resources containing thousands of distinct ORFs in Gateway entry vectors are available for a few organisms (Lamesch et al., 2004, Lamesch et al., 2007, Reboul et al., 2003, Rual et al., 2004). These ORFs can be transferred into

Validation Using Orthogonal Binary Interaction Assays

Complementary assays are essential to assess the precision of a dataset against PRS and RRS (see 2.2.1.). The following complementary assays can be used to determine the precision of a dataset by testing a random sample, and as part of an interaction assay tool-kit for confidence scoring of individual interactions (Braun et al., 2009). We describe the yellow fluorescent protein (YFP) based protein complementation assay (Nyfeler et al., 2005) and the sandwich ELISA-like well-NAPPA protein

Conclusion

Information on interactome networks constitutes a critical element of systems biology. We have spelled out a general approach to high-quality interactome mapping in which a reliable high-throughput assay is used as a primary screening platform. Subsequently, alternative validation assays are used to demonstrate data quality in a way unprejudiced by preconceived ideas and biases about what protein interactions are supposed to look like. To produce high-quality data, appropriate controls need to

Acknowledgments

We thank past and current members of the Vidal Lab and the Center for Cancer Systems Biology (CCSB) for their help and constructive discussions over the course of developing our binary interaction mapping strategies, framework, and protocols. This work was supported by National Human Genome Research Institute grants R01-HG001715 awarded to M.V. and D.E.H and P50-HG004233 awarded to M.V., grant DBI-0703905 from the National Science Foundation to M. V. and D. E. H., and by Institute Sponsored

References (43)

  • A.J. Walhout et al.

    High-throughput yeast two-hybrid assays for large-scale protein interaction mapping

    Methods

    (2001)
  • J.S. Bader et al.

    Gaining confidence in high-throughput protein interaction networks

    Nat. Biotechnol.

    (2004)
  • P. Braun et al.

    An experimentally derived confidence score for binary protein–protein interactions

    Nat. Methods

    (2009)
  • S.R. Collins et al.

    Functional dissection of protein complexes involved in yeast chromosome biology using a genetic interaction map

    Nature

    (2007)
  • M.E. Cusick et al.

    Literature-curated protein interaction datasets

    Nat. Methods

    (2009)
  • A. De Nicolo et al.

    Multimodal assessment of protein functional deficiency supports pathogenicity of BRCA1 p.V1688del

    Cancer Res.

    (2009)
  • M. Dreze et al.

    ‘Edgetic’ perturbation of a C. elegans BCL-2 ortholog

    Nat. Methods

    (2009)
  • T. Durfee et al.

    The retinoblastoma protein associates with the protein phosphatase type 1 catalytic subunit

    Genes Dev.

    (1993)
  • S. Fields et al.

    A novel genetic system to detect protein–protein interactions

    Nature

    (1989)
  • H. Ge et al.

    Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae

    Nat. Genet.

    (2001)
  • K.C. Gunsalus et al.

    Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis

    Nature

    (2005)
  • Cited by (113)

    • Protein interactome mapping in Caenorhabditis elegans

      2019, Current Opinion in Systems Biology
    • Mapping the Polarity Interactome

      2018, Journal of Molecular Biology
    • Network Analysis of UBE3A/E6AP-Associated Proteins Provides Connections to Several Distinct Cellular Processes

      2018, Journal of Molecular Biology
      Citation Excerpt :

      pQZ212 is a modified version of pVV212, a high-copy 2-micron (2μ) vector, engineered to contain the LEU2 and CAN1 selection markers. Competent yeast cells of the Y8930c strain (MATα leu2-3,112 trp1-901 his3Δ200 ura3-52 gal4Δ gal80Δ GAL2::ADE2 GAL1::HIS3@LYS2 GAL7::lacZ@MET2 cyh2R) were transformed with the cloned DB-ORF constructs [37]. Yeast cells were plated on selective synthetic complete (SC) solid medium lacking leucine (SC-Leu) to select for transformants.

    View all citing articles on Scopus
    View full text