Chapter 12 - High-Quality Binary Interactome Mapping
Introduction
Interactions mediated by proteins and the complex “interactome” networks resulting from these interactions are essential for biological systems. Mapping protein–protein, protein–DNA, protein–RNA, and protein–metabolite interactions that form “interactome” networks is a major goal of functional genomics, proteomics, and systems biology (Vidal, 2005). Information obtained from large-scale efforts to identify protein interaction partners yields crucial biological insights throughout a range of applications. At the single protein level, interactome maps have helped assign functions to both uncharacterized and well-studied gene products (Oliver, 2000). At the systems level, interactome maps have enabled investigations of how regulatory circuits and global cellular network properties relate to biological functions (Han et al., 2004, Jeong et al., 2001, Milo et al., 2002, Yu et al., 2008).
The two major high-throughput strategies used so far to delineate protein–protein interactome networks are: (i) binary protein–protein interaction assays, which detect direct pairwise interactions, and (ii) affinity purification followed by mass spectrometry (AP–MS) approaches, which detect biochemically stable, copurifying protein complexes containing both direct and indirect protein associations. Classically, binary interaction assays have been based on the yeast two-hybrid (Y2H) system developed 20 years ago (Fields and Song, 1989), and which has been improved over time to increase efficiency and quality (Durfee et al., 1993, Gyuris et al., 1993, Vidal et al., 1996). Of late, alternative approaches have been developed to detect binary interactions, such as protein arrays, protein complementation assays, and the split ubiquitin method (Miller et al., 2005, Tarassov et al., 2008, Zhu et al., 2001).
Until recently high-throughput methods were regarded as more likely to produce lower quality information than low-throughput experiments. It has now been shown that highly reliable interactome datasets can be obtained at the scale of the whole proteome (Braun et al., 2009, Cusick et al., 2009, Simonis et al., 2009, Venkatesan et al., 2009) provided that all experimental steps are thorough and all necessary controls and quality standards are included. Lastly, careful verification of all candidate interactions and experimental validation using independent interaction assays are necessary to ensure the release of interactome maps of the highest possible quality.
Even when highly reliable, interactome maps should be considered as network models of interactions that can happen between all proteins encoded by the genome of an organism of interest. As such, they correspond to static representations of collapsed time-, space-, and condition-dependent interactions that dynamically regulate the behavior and developmental fate of diverse tissues. Thus, interactome maps should be used as static scaffold-like information from which the dynamic features of biologically relevant interactions, that is, those that do happen in vivo, can be modeled by integrating additional functional information such as transcriptional and phenotypic profiling data (Ge et al., 2001, Ge et al., 2003, Gunsalus et al., 2005, Vidal, 2001). Ultimately, novel potentially insightful interactions need to be evaluated for their biological significance using genetic experiments, where specific cis-acting interaction-defective alleles (IDAs) of one or both proteins or trans-acting disruptors are tested functionally (Endoh et al., 2002, Vidal & Endoh, 1999, Vidal et al., 1996).
Section snippets
High-Quality Binary Interactome Mapping
The quality of any dataset can be affected by a high rate of “false positives” and need to be addressed in two fundamentally different contexts. One relates to avoidable experimental errors leading to wrong information, and the other relates to as yet undiscovered fundamental properties of proteins (Fig. 12.1). Our binary interactome mapping strategy is designed to differentiate between these two classes of issues designated “technical” and “biological” false positives.
- •
Technical false positives
Assembly of DB-X and AD-Y expression plasmids
The first step towards binary interactome mapping is the generation of expression plasmids. For high-throughput experiments it is preferable to use sequence independent recombinational subcloning technologies such as Gateway cloning (Walhout et al., 2000). Large resources containing thousands of distinct ORFs in Gateway entry vectors are available for a few organisms (Lamesch et al., 2004, Lamesch et al., 2007, Reboul et al., 2003, Rual et al., 2004). These ORFs can be transferred into
Validation Using Orthogonal Binary Interaction Assays
Complementary assays are essential to assess the precision of a dataset against PRS and RRS (see 2.2.1.). The following complementary assays can be used to determine the precision of a dataset by testing a random sample, and as part of an interaction assay tool-kit for confidence scoring of individual interactions (Braun et al., 2009). We describe the yellow fluorescent protein (YFP) based protein complementation assay (Nyfeler et al., 2005) and the sandwich ELISA-like well-NAPPA protein
Conclusion
Information on interactome networks constitutes a critical element of systems biology. We have spelled out a general approach to high-quality interactome mapping in which a reliable high-throughput assay is used as a primary screening platform. Subsequently, alternative validation assays are used to demonstrate data quality in a way unprejudiced by preconceived ideas and biases about what protein interactions are supposed to look like. To produce high-quality data, appropriate controls need to
Acknowledgments
We thank past and current members of the Vidal Lab and the Center for Cancer Systems Biology (CCSB) for their help and constructive discussions over the course of developing our binary interaction mapping strategies, framework, and protocols. This work was supported by National Human Genome Research Institute grants R01-HG001715 awarded to M.V. and D.E.H and P50-HG004233 awarded to M.V., grant DBI-0703905 from the National Science Foundation to M. V. and D. E. H., and by Institute Sponsored
References (43)
- et al.
Towards a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae
Mol. Cell Proteomics
(2007) - et al.
Protein interactions: Two methods for assessment of the reliability of high throughput observations
Mol. Cell Proteomics
(2002) - et al.
Integrated version of reverse two-hybrid system for the postproteomic era
Methods Enzymol.
(2002) - et al.
Integrating ‘omic’ information: A bridge between genomics and systems biology
Trends Genet.
(2003) - et al.
Cdi1, a human G1 and S phase protein phosphatase that associates with Cdk2
Cell
(1993) - et al.
hORFeome v3.1: A resource of human open reading frames representing over 10, 000 human genes
Genomics
(2007) A biological atlas of functional maps
Cell
(2001)Interactome modeling
FEBS Lett.
(2005)- et al.
Prospects for drug screening using the reverse two-hybrid system
Trends Biotechnol.
(1999) - et al.
Increasing specificity in high-throughput yeast two-hybrid screens
Methods
(2004)
High-throughput yeast two-hybrid assays for large-scale protein interaction mapping
Methods
Gaining confidence in high-throughput protein interaction networks
Nat. Biotechnol.
An experimentally derived confidence score for binary protein–protein interactions
Nat. Methods
Functional dissection of protein complexes involved in yeast chromosome biology using a genetic interaction map
Nature
Literature-curated protein interaction datasets
Nat. Methods
Multimodal assessment of protein functional deficiency supports pathogenicity of BRCA1 p.V1688del
Cancer Res.
‘Edgetic’ perturbation of a C. elegans BCL-2 ortholog
Nat. Methods
The retinoblastoma protein associates with the protein phosphatase type 1 catalytic subunit
Genes Dev.
A novel genetic system to detect protein–protein interactions
Nature
Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae
Nat. Genet.
Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis
Nature
Cited by (113)
Structure-based drug design of multi-targeting inhibitors of human pathogen fungi
2023, Journal of Molecular StructureProtein interactome mapping in Caenorhabditis elegans
2019, Current Opinion in Systems BiologyMapping the Polarity Interactome
2018, Journal of Molecular BiologyNetwork Analysis of UBE3A/E6AP-Associated Proteins Provides Connections to Several Distinct Cellular Processes
2018, Journal of Molecular BiologyCitation Excerpt :pQZ212 is a modified version of pVV212, a high-copy 2-micron (2μ) vector, engineered to contain the LEU2 and CAN1 selection markers. Competent yeast cells of the Y8930c strain (MATα leu2-3,112 trp1-901 his3Δ200 ura3-52 gal4Δ gal80Δ GAL2::ADE2 GAL1::HIS3@LYS2 GAL7::lacZ@MET2 cyh2R) were transformed with the cloned DB-ORF constructs [37]. Yeast cells were plated on selective synthetic complete (SC) solid medium lacking leucine (SC-Leu) to select for transformants.
Next-Generation Yeast Two-Hybrid Screening to Discover Protein–Protein Interactions
2023, Methods in Molecular Biology