Abstract
Protein-protein interactions (PPIs) regulate almost all cellular processes, and engineered PPIs have cell and gene therapy applications. Identifying all disruptive variants in human PPIs or characterizing all possible interactions in panels of rationally designed proteins requires experimental workflows that can scale to thousands of interactions. We here introduce massively parallel protein-protein interaction measurement by sequencing (MP3-seq), an easy-to-use and highly scalable yeast-two-hybrid (Y2H) approach to address this need. In MP3-seq, DNA barcodes are associated with specific protein pairs; enrichment of these barcodes during yeast selection can be read by sequencing to provide a direct measure of interaction strength. We show that MP3-seq is quantitative over several orders of magnitude of interaction strengths and can scale to measure over 100,000 interactions at once. We apply MP3-seq to characterize interactions between several families of rationally designed heterodimers and develop a greedy algorithm to reduce large-scale MP3-seq screens to identify sets of potentially orthogonal heterodimers. Finally, we use MP3-Seq to delve into and verify the elements that confer specificity to interactions between coil-hairpin-coil binders with designed hydrogen-bond networks.
Introduction
Synthetic protein binders that can mediate interactions between other proteins or cells have the potential to revolutionize fields from cell therapy (Cho et al., 2018) to synthetic biology (Chen et al., 2020; Chen and Elowitz, 2021; Gao et al., 2018; Groves et al., 2016) and material science (Ben-Sasson et al., 2021; Gonen et al., 2015; Ljubetič et al., 2017b). Early work in synthetic biology often relied on natural interaction domains such as SH3 and PDZ (Kuriyan and Cowburn, 1997). However, such components provide a poor starting point for rationally designing large-scale assemblies due to crosstalk. Consequently, synthetic protein circuits’ scale and complexity still pale compared to biological PPI networks. In order to scale up synthetic protein-based circuits, we need large-scale libraries of modular interaction domains.
Fulfilling this need through rational heterodimer design has yielded promising results, but creating large, totally orthogonal sets of dimers remains challenging. Early rational design work focused on coiled-coil dimers (1×1s): while most PPIs involve backbone shape complementarity, 1×1 binding is primarily determined by complementary electrostatic and hydrophobic interactions at specific heptad positions. Because the biophysical rules guiding these interactions are comparably well understood and can be captured by predictive models (Fong et al., 2004; Potapov et al., 2015), orthogonal sets of up to six 1×1 heterodimers have been generated (Lebar et al., 2020; Thompson et al., 2012). Recently, a set of orthogonal 1×1s (consisting primarily of homodimers) was identified in a high-throughput screen (Boldridge et al., 2020). However, the restricted geometry required for 1×1 interactions leads to some degree of interaction promiscuity which fundamentally hampers designing larger orthogonal sets. The Baker lab expanded the coiled-coil toolbox by introducing helical bundle heterodimers, with protomers consisting of two coils connected by a hairpin loop, where interactions are determined by designed hydrogen bond networks between the bundles (2×2s) (Boyken et al., 2016; Chen et al., 2019). This multi-helix bundle approach increases the alphabet of possible coil interactions and could pave the way to generating larger orthogonal sets. However, reliably minimizing off-target interactions (e.g., due to proteins associating in an unexpected register or orientation) remains challenging as these are not captured in the typical biophysical design objective to produce 2×2s. An effective alternative is to prepare diverse libraries of de novo binders designed to have strong on-target interactions and then experimentally measure an all-by-all matrix of interactions. Afterward, an orthogonal set can be extracted (Brodnik et al., 2019).
Mass spectrometry and protein-array based methods can measure PPIs but require laborious protein purification steps (Rao et al., 2014; Smits and Vermeulen, 2016). Yeast-or phage-display methods use next-generation sequencing technologies to increase throughput, but are limited to “several-versus-many” screening (Rouet et al., 2018). Alpha-seq, a recent high-throughput method that takes advantage of the yeast mating pathway, overcomes this limitation and allows library-on-library screening (Younger et al., 2017). However, throughput is still limited; moreover, not all proteins of interest will correctly fold when displayed on a surface. Yeast two-hybrid (Y2H) methods are a powerful alternative to surface display for characterizing PPIs (Fields and Song, 1989). In Y2H, one protein of interest is fused to a DNA binding domain (DBD) and the second is fused to a transcriptional activation domain (AD). If the proteins interact, a functional transcription factor is reconstituted and drives the expression of a growth-essential enzyme. Early Y2H approaches tested small numbers of PPIs using plate-based selection, but lab automation and pooling strategies (Luck et al., 2020; Weile et al., 2017) have enabled proteome-scale screens. To further address Y2H scaling issues, high-throughput Y2H (HT-Y2H) and enzyme complementation methods have been developed leveraging next-generation sequencing to read out interaction strength (Diss and Lehner, 2018; Erffelinck et al., 2018; Jin et al., 2007; Rajagopala and Uetz, 2009; Trigg et al., 2017; Weimann et al., 2013; Yachie et al., 2016; F. Yang et al., 2018; J.-S. Yang et al., 2018; Yu et al., 2011). Still, most of these methods require library construction in E. coli before interaction screening in yeast or rely on yeast mating and thus necessitate two separate protein libraries to transform MATα and MATa yeast. High-throughput bacterial two-hybrid assays have been developed, which provide a non-eukaryotic alternative for screening PPIs (Boldridge et al., 2020). Concomitantly with the development of experimental methods, custom workflows for analyzing HT-Y2H data were developed (Banerjee et al., 2021; Velásquez-Zapata et al., 2021).
Here, we introduced MP3-seq, a massively parallel Y2H workflow for measuring PPIs using sequencing. In MP3-seq, the identity of each protein is encoded in a DNA barcode, and the relative barcode-barcode pair abundance before and after a selection experiment serves as a proxy for interaction strength. Plasmids encoding the protein pairs of interest and their associated barcodes and all other elements required for Y2H experiments are assembled in yeast through homologous recombination. Thus, the experimental workflow bypasses the need for plasmid cloning in E. coli or yeast mating. We first validate MP3-seq using well-characterized coiled-coil heterodimer interactions and synthetic binders for different members of the human BCL2 protein family. Then, we apply MP3-seq to characterize interactions between rationally designed 2×2 and 1×2 heterodimers and demonstrate that it scales to measuring over 100,000 PPIs in a single experiment. We identify successful designs and use a greedy algorithm to findy potentially orthogonal subsets. Finally, we use MP3-seq to delve into the elements expected to confer 2×2 interaction specificity by screening variations of a successful 2×2 pair.
Results
MP3-seq workflow
In MP3-seq, all molecular components required for measuring interactions between two specific proteins are encoded on a single plasmid (Figure 1A). A plasmid library is constructed directly through homologous recombination in yeast to measure all possible interactions for a set of proteins. To do so, we transform haploid MATa-type yeast with a mixture of DNA fragments. One of these fragments is a backbone carrying a centromere sequence, the selection marker, a DBD, and an AD. On average, the centromere sequence ensures that only one pair of proteins is expressed in each cell (Scanlon et al., 2009). We use a Cys(2)His(2) zinc-finger domain of the mouse transcription factor Zif268 (McIsaac et al., 2013) as the DBD and its corresponding promoter to drive the expression of the growth essential enzyme his3. The herpes simplex virus-derived protein domain VP16 is used as the AD. Additional fragments each contain one of the proteins of interest and its associated unique barcode separated by a terminator sequence. In the experiments described here, these fragments are ordered as a single oligonucleotide but for longer proteins a barcode can be added to a protein of interest by PCR. Due to their short length and distinct sequences, we used Tsynth23 and Tsynth27 as terminators in most of our experiments (Curran et al., 2015).
A. Barcoded DNA fragments encoding proteins to be screened, a fragment encoding two terminators, and a backbone fragment bearing a centromere signal, a DBD, and an AD are transformed into yeast cells via electroporation. Homology of fragment 5’ and 3’ ends allows plasmid assembly in yeast. The centromere sequence ensures the expression of one pair of hybrid proteins per cell on average. Upon transformation, some cells are stored, and some are subjected to growth selection in media lacking histidine. If the hybrid proteins interact, the DBD and AD form a transcription factor that drives HIS3 expression. Therefore, interacting pairs should be enriched in the population post-selection. Cells are lysed, and barcode-containing fragments are amplified and sequenced. B. Enrichment can be calculated for each PPI i using library size normalized read counts with a pseudo count of the minimum detected value per condition. C. After enrichment calculation, each replicate is screened for autoactivators and corrected with Autotune. Replicate pre- and post-selection barcode counts are merged directly with DESeq2 or split into pseudoreplicates and then merged.
After transformation, we perform selection in media without Tryptophan (Trp) to ensure plasmid maintenance. A portion of the cells is frozen, while the rest undergo selection in media lacking histidine (His). Plasmid DNA is extracted from pre and post-His selection cells, and the barcode-containing regions are amplified (Figure 1A, right). The barcode-barcode amplicons are sequenced, and their relative enrichment can be calculated from barcode counts to serve as a proxy for interaction strength (Figure 1B). Some proteins’ expression or folding may be impacted by fusion with either the AD or DBD; therefore, we test all proteins fused to both the AD and DBD
MP3-Seq analysis pipeline
As a first step in the analysis workflow we calculate the enrichment between the pre and post His selection stages of MP3-Seq normalized by library size (Figure 1B). Next, we detect autoactivators, and optionally replace their His selection values with values extracted from the functioning orientation (see Autotune in Methods); autoactivation is an error mode in Y2H experiments wherein high enrichment values are observed for a protein of interest for all interaction partners, suggesting that the protein non-specifically activates expression of the selection marker (Shivhare et al., 2021). Typically, this behavior is observed fused to either the AD or the DBD but not in both orientations. Note that if an autoactivator is found, its homodimer is not recoverable with Autotune as there is no second His measurement set to use for correction.
Following autoactivator removal, enrichment values can be averaged together to obtain interaction strength values. Alternatively, to correct for variation in the read count distribution between experimental replicates resulting from different sequencing depths, selection times, and other experimental factors, we calculate log2 fold changes (LFC) using the DESeq2 package (Love et al., 2014). DESeq2 calculates differential enrichment across multiple replicates and provides Hochberg-adjusted Wald test p-values, identifying PPIs with LFCs significantly different from an LFC of zero.
In some cases, it is desirable to combine MP3-seq measurements for both fusion orderings (e.g., P1-DBD + P2-AD and P2-DBD + P1-AD -> P1-P2). For example, comparing MP3-seq LFCs to Kds collected for the pair of interest (P1-P2). The simplest method for merging these measurements is applying an operation to both, such as averaging or taking the max of the values. Here, we use an alternative approach and treat each fusion ordering as an independent set of measurements, turning them into two pseudoreplicates. These pseudoreplicates are then combined with DESeq2 to calculate LFCs (Figure 1B). We refer to these values as pseudoreplicate LFCs (P-LFCs).
MP3-seq benchmarking with orthogonal coiled-coil dimers
To validate MP3-seq, we screened 144 pairwise interactions between six orthogonal 1×1s from Lebar et al. (Lebar et al., 2020) (Figure 2A). A pool of 24 DNA fragments where each of the 12 proteins was fused to either the AD or DBD was ordered. Then, all interaction pairs were assembled in a pooled experiment, and interaction strengths were quantified with MP3-seq. We confirmed that selections were occurring as expected by checking pre-His selection barcode counts remained steady over time (Figure S1A), that applying different levels of 3-Amino-1,2,4 triazole (3-AT), a competitive inhibitor of His3, affected post-His selection counts (Figure S1B-C), and that counts correlated more for post-His than pre-His counts between replicates (Figure S1E). LFCs for all orientations were calculated from three experimental replicates which confirmed that the interactions occur almost exclusively between intended partners (i.e., P1:P2, P3:P4, etc., Figure 2B). Homodimer plasmid assembly is less efficient in MP3-Seq and generally results in lower input barcode counts for these PPIs, but coverage was sufficient for inclusion in the analysis. This phenomenon is likely due to increased sequence homology, which interferes with the correct ordering of DNA fragments during plasmid assembly. For a more quantitative analysis, we correlated MP3-seq LFCs with luciferase expression assay interaction scores in HEK293T cells from (Lebar et al., 2020) and found good agreement (R2=0.83, Figure 2C). Figure 2D represents P-LFC MP3-Seq data for these PPIs as a graph, where each protein is a vertex and edges are significant measured interactions weighted by P-LFC.
A. P-series 1×1s from (Lebar et al., 2020) with complementary heptads designed to interact shown in like colors. B. MP3-Seq LFC of the P-series coil interactions. All values were calculated from 3 experimental replicates except for those labeled “2,” where only 2 were available. Yellow outlines denote on-target interactions. C. Correlation of the on- and off-target P-series MP3-seq LFCs with fold activation fluorescence values from (Lebar et al., 2020) D. Filtering P-series Jerala interactions and E. P, PA, and N series designed coil interactions (Plaper et al., 2021) to include only those with padj ≤ 0.05. Line weights correspond to MP3-Seq P-LFCs. F. A designed 1×1 and its truncations from (Thomas et al., 2013) G. MP3-Seq enrichment for the AN and BN coils and their truncations. H. Correlation of MP3-Seq P-LFCs with Kd values from (Thomas et al., 2013) for the AN and BN truncations.
In a separate all-by-all experiment with 28 proteins, we screened the P-series 1×1s and two sets of 1×1s (N1,N2,N5-N8 and P5A-P8A) derived from the P-series but modified in AN attempt to increase the thermodynamic stability of their on-target interactions (Plaper et al., 2021). N1 exhibited strong auto-activation properties when fused to the DBD and these data were infilled with AD fusion measurements using Autotune. In this experiment, we expect related coils to have similar interaction patterns (e.g. P5A can bind to P6A, N6, and P6) which agrees with the interaction graph created from significant MP3-Seq P-LFCs in Figure 2E.
To assess if MP3-seq values provide quantitative information about interaction strength, we measured interactions between a set of varying-length 1×1s designed to span a wide range of Kds (Thomas et al., 2013) (Figure 2F). We performed AN all-by-all measurement; these interactions are not expected to be orthogonal as (Thomas et al., 2013) achieved weaker interactions by truncating the two parent four-heptad binders (A N4, B N4) by a half or full heptad (Figure 2G). The MP3-seq P-LFCs correlated very well with previously measured dissociation constants over approximately three decades (r2=0.94, Figure 2H). We note that A N4 was a weak auto-activator when fused to the DBD, and these data were infilled using Autotune for the P-LFC values in Figure 2H.
MP3-seq benchmarking with BCL2 family binders
To validate MP3-seq outside of 1×1 interactions, we tested a set of proteins previously characterized by biolayer interferometry (Berger et al., 2016) and Alpha-seq (Younger et al., 2017) comprised of six homologous proteins from the BCL2 family (Bcl-2, Bcl-xL, Bcl-w, Mcl-1, Bfl-1, Bcl-B) and nine de novo designed inhibitors of those homologs.
A crystal structure of one of the synthetic binders bound to its target is shown in Figure 3A, while Figure 3B shows the domain organization of the six human proteins. All inhibitors were designed to interact with the BH3 domain binding pocket of the homologs. To better compare with the HT-Y2H method Alpha-seq, a truncated version of Mcl-1 was used. AN MP3-seq screen of all BCL2 homologs against all inhibitors is shown in Figure 3C. AN older version of MP3-seq was used for the first two replicates of this experiment. Unlike the current version, barcodes were inserted into the 3’UTRs of the binders upstream of the terminators and could thus differentially impact mRNA stability and protein levels. However, the inter-replicate correlations suggest that the data is consistent between versions (Figure S2A).
A) Colored crystal structure of Bcl-2 and its designed BH3 binding inhibitor from (Berger et al., 2016) (PDB: 5JSN). Binder in blue, Bcl-2 in gray with different domains in the binding region highlighted in color. (B) BH3 binding domain annotations of the six human Bcl homologs measured by (Berger et al., 2016; Younger et al., 2017) with the experimental truncation used by Younger and in this work shown. C) MP3-Seq P-LFCs for inhibitor and Bcl-2 interactions. Yellow boxes highlight intended on-target interactions D) Correlations with Kd measurements from biolayer interferometry. E) MP3-Seq P-LFC (F) and Alpha-Seq distributions for interactions that failed Kd measurement due to instrument detection limits.
Only binders beginning with alpha correspond to the final, specific designs of (Berger et al., 2016), while the others are intermediate or failed designs. This can be seen in Figure 3C, with a divide between the largely orthogonal rightmost four columns of the heatmap of MP3-seq P-LFCs and the left, less specific columns.
Our data agree well with dissociation constant measurements obtained from biolayer interferometry by (Berger et al., 2016) (R2=0.54, Figure 3D). We also exhibit good agreement with Alpha-Seq percent survival for both their low-throughput pairwise assay and high-throughput batched assay (r2: 0.45, r2:0.69, Figure S3). Some variation between our results and those published earlier might be due to the different interaction measurement settings. In our case, interactions are measured with proteins expressed in yeast, while biolayer interferometry uses purified proteins, and Alpha-seq displays proteins on the yeast surface.
Some of the BCL2 inhibitors failed to produce Kd values when measured with biolayer interferometry, likely because the interactions were weak and below detection limits. We examined the failed (n=11) versus successfully detected interactions (n=43) and found that the mean MP3-seq value of PPIs for which biolayer interferometry succeeded was significantly greater than for the subset for which biolayer interferometry failed (independent t-test, H0: µsuccess > µfail) (Figure 3E). Similarly, pairwise Alpha-Seq also had the mean of successful interactions significantly greater than that of the failures. However, batched Alpha-Seq, the high throughput version of the assay, did not exhibit this (Figure 3F). Together, these results show that MP3-seq can work with globular proteins in addition to coils and that MP3-seq results agree with those obtained by Alpha-seq and biolayer interferometry. The significant difference between the successes and failures provides evidence that MP3-seq excels at separating very weak PPIs from others at high throughput.
Large-scale assay of designed heterodimers
To explore the feasibility of using MP3-seq for large-scale screening, we prepared a library of designed heterodimers (DHDs) following (Chen et al., 2019). On-target pairs were designed as three-or four-helix bundles with buried hydrogen bond networks (HB-Nets) connecting all helices and then split into a helix-turn-helix hairpin and a single helix (1×2s) or 2×2s. Initially, we chose 100 designs for testing (DHD1, Figure 4A), later adding 52 more designs in a second round of experiments (DHD2, Figure 4A). As controls, we selected nine previously tested 2×2s (Chen et al., 2019). We also included several binders derived from these controls through modification of the hairpin loops or truncation of the helices for a total of 35 pairs (DHD0, Figure 4A). Finally, we added AN additional series derived from a common parent 2×2 through truncating heptads (mALB, Figure 4A). For each on-target pair, one protomer is designated “A” and the other “B”. Interactions between and within these groups were tested in MP3-seq experiments of varying sizes. We performed two replicates of AN all-by-all screen including DHD0, DHD, DHD2, and mALb proteins as well as the BCL2 homologs and cognate binders described above, resulting in a matrix of 337×337 =113,569 interactions. Additionally, we performed three replicates of a screen using only DHD1 with a subset of DHD0, and one screen with DHD0, DHD2, and mALB. The older MP3-seq was used for most experiments in this section. Still, the experiment highlights the large experimental scales achievable with MP3-seq and the inter-replicate correlations suggest that the data is internally consistent (Figure S2 B-C).
A. Summary of the synthetic binder types. B. Successful designs for the three main sets. Gray bars are possible pairs for the set, and colored bars indicate the number of successes. C. Top 5 successful designs from each set by P-LFC. Yellow boxes show on-target interactions. D. From left to right, pre-DUET positive, padj ≤ 0.01 P-LFC PPI network for the DHD1 and DHD0 subset, and the network at DUET iterations 50, 75, and 113. E. Pre-DUET network for the DHD0, DHD2, and mALb sets, and the network at DUET iterations 10, 15, and 37 (final). F. Pre-DUET network for all designs and the network at DUET iterations 50, 75, and 114. G. Orthogonality gaps of the DUET final networks without significance filtering. Left and right dashed lines show the MP3-seq orthogonality gaps for the BCL-2 and inhibitor interactions and the P-series interactions, respectively. Yellow squares correspond to half of the starting networks remaining. H. DUET final networks reduced to half their final iteration size.
We applied autotune corrections and symmetrized the read counts to calculate P-LFCs for all possible pairs. On-target interactions, where the A and B protomers were designed to interact, were evaluated for success. In this case, we defined a ‘success’ as any PPI with a padj ≤ 0.01 and a P-LFC ≥ 4. The number of successes out of the total possible on-target PPIs in each design set can be seen in Figure 4B. Every design set had AN approximately 20% success rate in the largest screen. This is consistent with the 22% success rate for a-Helical bundles (Ljubetič et al., 2017a). The top 5 P-LFC on-target interactions for each design category from the are shown in Figure 4C. Note that the successes, though strong in the intended interaction, are not necessarily orthogonal with other designs, even when examining only the PPIs of the A and B protomers of the top five designs.
Finding orthogonal subsets
For all four sets of designs (DHD1-3, mALB), most measured interactions were off-target interactions between protomers screened in the same experiment (e.g. A1 and B2 of DHD1, instead of A1 and B1). For example, in the all-by-all screen with all of DHD1-3 and mALB, only 33 of 914 total PPIs with padj ≤ 0.01 and P-LFC ≥ 4 were on-target.
Given the number of strong but unintended interactions, we asked whether we could extract potentially useful orthogonal PPI subsets from the data. To this end, we used significant (alpha = 0.01), positive P-LFC values to construct a weighted undirected graph, as in Figure 2D-E. Our goal can then be rephrased as finding a graph of degree one vertices (1-regular graph) without self-edges. To do this, we developed a simple scoring function that rewards graphs based on existing orthogonal edges or those over a desired orthogonality cutoff and punishes graphs for non-orthogonal edges. This scoring function was used in a greedy graph reduction method, Deleting Undirected Edges Thoughtfully (DUET), which removes a vertex and its associated edges each iteration until a 1-regular graph remains (see Methods). We applied DUET to different data sets. For DHD1 and the DHD0 subset, we went from 2,001 edges between 202 vertices to 18 DUET pairs (Figure 4D), for the DHD0/DHD2/mALb data from 279 edges between 85 vertices to 11 DUET pairs (Figure 4E), and for all designs (DHD0-2,mALb) from 1,562 edges between 270 vertices to 36 DUET pairs (Figure 4F). Of these, 2, 2, and 4 DUET pairs were on-target for DHD0/DHD1, DHD0/DHD2/mALb, and all designs, respectively.
The DUET final results are only orthogonal if all non-highly significant interactions are considered truly non-interacting. As this may not be the case, we used all P-LFC >0 between DUET pair protomers regardless of significance for a more conservative analysis. First, we removed interactions with protomoers for which the DUET pair P-LFC was lower than the highest non-DUET pair P-LFC. Then, we reduced the remaining DUET pairs by removing whichever pair had the largest non-DUET P-LFC one by one (Figure 4F). When reduced to half the starting DUET pairs (squares in Figure 4F), the orthogonal sets shown in Figure 4G are left, which all have gaps between those of the P-series in Figure 2 and the BCL2-inhibitor pairs in Figure 3.
Elucidating the rules of specific helix-loop-helix binding
High-throughput, high-quality data can be used to probe novel design rules to improve protein design. Here, we demonstrate how MP3-Seq can be used to understand how hydrogen bond networks and binder length can confer binding specificity in a designed pair by generating and testing variants of a target design. Our target pair, mALb8, is a high-affinity 2×2 heterodimer with three HB-Nets between its A and B protomers and was included in the mALb set introduced earlier (Figure 5A).
A. The mALb8 helix-loop-helix heterodimer with HB-Nets 1 and 2 shown. B. mALb8 and the two shorter variants (T1; one helical turn shorter and T2; two helical turns shorter). C. LFC of the truncated mALb8 interactions. Gray X squares indicate insufficient reads. We observed a markedly non linear response to the length of the binder. T2 variants which are 13 residues shorter than the original do not bind. D. AN example of HB-Net removal using the small and large hydrophobic replacement protocols for the first HB-Net E. MP3-seq log2 enrichment for the mALb8 with removed HB-Nets. S and L designate the replacement protocol used, while R1, R2, or R12 denote if the first, second, or both HBNets were replaced. Gray X indicates insufficient reads. The yellow boxed regions show that two HB-Net mismatches are enough to specify orthogonality. Hydrophobic residues alone cannot confer specificity.
First, two mALb protomer truncations were designed, removing one or two turns from the end of each alpha helix (Figure 5B). Truncation 1 (T1) versions are one turn shorter than the original monomers, while Truncation 2 (T2) versions are two full turns shorter (see Figure S4). The original binders and the truncations were screened in AN all-by-all screen with MP3-Seq and LFCs were calculated from two replicates. It can be seen from Figure 5B that truncating the binders by two turns (T2) eliminates binding activity. The relationship between length and binding affinity is highly non-linear, since binding disappears once the minimum length is reached. We can use this data to infer the shortest possible length (3.5 heptades) for 2×2 binders containing two buried HB-Nets. This pattern is similar to that seen for the AN and BN single helix truncation experiments in Figure 1D-E.
To probe the specificity conferred by the buried HB-Nets in the 2×2 binders designed with the approach from (Chen et al., 2019), we have removed either the first, the second, or both buried HB-Nets from the mALb8 dimer. This was done by replacing HB-Net amino acids in both A and B protomers simultaneously with hydrophobic residues that can not form hydrogen bonds. Two Rosetta-design replacement protocols were used ;one using smaller and one using larger hydrophobic residues for replacement.. The first network’s replacements in A and B can be seen in Figure 5D, while all replaced designs can be found in Figure S5. We observe that the designs with removed HB-Nets (R variants) largely do not homodimerize, as interactions between various A and B protomers are low. These results show that, at minimum, two hydrogen bond network mismatches are needed to confer specificity and orthogonality (Figure 5E, boxed areas). For example, the binding of AR1 and BR2 would bury half of the hydrogen bond network on protomer A and half of the hydrogen bond network on protomer B. The binding of AR12 and BR2 would result in only one HB-Net mismatch on chain B. The binding data suggest that pairs with one HB-NetT mismatch can still exhibit partial binding affinity, for example BR12(S) weakly binds to both AR1 and AR2. Additionally, hydrophobic residue size alone is not sufficient to confer orthogonality. For example, both BR1(S) and BR1(L) bind to both AR1(L) and AR1(S).
Discussion
Here, we introduce MP3-seq, AN easy-to-use HT-Y2H method that can measure pairwise PPIs in a single yeast strain. In MP3-seq, the identity of each protein is encoded in a DNA barcode, and the relative barcode-barcode pair abundance before and after a selection experiment serves as a proxy for interaction strength. We also developed a data analysis workflow based on DESeq2 that makes it easy to merge replicates, remove autoactivators, and identify statistically significant interactions. In future work, the MP3-seq workflow could be further generalized by adding autoactivator screening or adjusting the selection scheme to eliminate autoactivators (Shivhare et al., 2021). MP3-seq could also be combined with protein stability and expression assays to confirm that proteins tested in interaction screens are folded correctly and that a negative interaction measurement is not due to reduced protein levels. In particular, it is easy to imagine integrating MP3-seq with other growth selection-based workflows such as Stable-seq (Kim et al., 2013). We expect stability screening to be more important for human protein than for the synthetic binders tested here, which are generally designed to have very stable folds.
We validated MP3-seq using several sets of proteins for which interactions had been previously characterized. In particular, we used a family of human pro-apoptotic proteins BCL2 and their de novo designed inhibitors, and three different sets of coiled-coils peptides. We generally found quantitative agreement between our results and those reported previously. We then applied our method to characterize interactions in a pool of de novo coil-based heterodimers that contain buried hydrogen bond networks. We showed that our approach could scale to measuring more than 100,000 interactions in a single experiment. Our computational workflow enabled us to identify potentially orthogonal subsets, at various orthogonality gaps, within larger all-by-all screens.
Moreover, by screening interactions between protomers with truncations and modified hydrogen bond networks, we probed important design rules for 2×2 binders with two buried HB-nets. Firstly, we showed that the minimum length for strong binding is 3.5 heptads and that binding affinity drops sharply for shorter designs. Next, we showed that at least two hydrogen bond network mismatches are needed for orthogonality, thereby setting minimum requirements for future sets of orthogonal 2×2 binders with buried HB-nets.
We hope that the increased scale and streamlined workflow of MP3-seq will further accelerate the adoption of HT-Y2H methods and facilitate their use in additional applications. For example, we expect that high-throughput data collected with MP3-seq could be used to train predictive models of PPIs and that such models could be combined with generative algorithms to iteratively design heterodimers with desirable properties or with neural network interpretation to uncover determinants of binding specificity. Moreover, MP3-seq could be applied to characterize interactions between human proteins or to screen variants’ impact on PPIs at high throughput.
Methods
Experimental workflow Plasmids and fragments
All protein binder coding sequences were ordered as double-stranded gene fragments from Twist Biosciences. In the gene fragment, the binder’s sequence is preceded by the coding sequence for a peptide linker, either the poly-Glycine-Serine (G4S) linker or the Glycine-Glycine Poly-Serine (GGS4) linker for the DBD and AD fusion proteins, respectively. This sequence also serves as a primer handle for initial fragment amplification and a homology sequence during plasmid assembly in yeast. The binder’s sequence is followed by a stop codon and a synthetic terminator from (Curran et al., 2015). Tsynth23 and Tsynth27 are used to terminate the transcription of DBD and AD fusion proteins, respectively. The terminators are followed by 21nt PCR handles for barcode amplification and predetermined 20nt long DNA barcode sequences. Barcodes were followed by 22nt long insulation sequences. This insulation sequence serves as a homology sequence during plasmid assembly in yeast.
The plasmid vector contains 30nt end homologies with the G4S peptide linker and GGS4 peptide linkers in the binder fragments for assembly in yeast. The vector also contains the AD (VP16) and the DBD (zinc finger DNA-binding domain of mouse transcription factor Zif268). AN ColE1 origin and beta lactamase expression for antibiotic resistance are also included to enable cloning in E. coli for troubleshooting. The vector uses a CEN/ARS sequence for yeast replication, expressing the plasmid genes on a stable, independent mini chromosome. TRP1 expression is included for tryptophan selection.
Competent Yeast Cell Preparation
The yeast transformation protocol from (Benatuil et al., 2010) was adapted for this workflow. S. cerevisiae cells (EBY100) were grown overnight to stationary phase (OD600 around 3.0) in YPD media (10 g/L yeast nitrogen base, 20 g/L Peptone, and 20 g/L D-(+)-Glucose) on a platform shaker at 225 rpm and 30 °C. The following morning, we inoculated 100 mL of YPD media with the overnight culture at AN initiate 0.3 OD600. The inoculated cells were grown on a platform shaker at 30 °C and 225 rpm until OD600 was approximately 1.6 (or for about 5 hours). Yeast cells were collected by centrifugation at 3000 rpm for 3 minutes and washed twice with 50 mL ice-cold water and once with 50 mL of ice-cold electroporation buffer (1 M Sorbitol / 1 mM CaCl2). Cells were resuspended in 20 mL 0.1 M LiAc/10 mM DTT in a culture flask and incubated for 30 mins at 30 °C and 225 rpm. We conditioned the cells by collecting them with centrifugation, washing once with 50 mL ice-cold electroporation buffer, and resuspending in 100 to 200 μL electroporation buffer to reach a final volume of 1 mL. This corresponds to approximately 1.6 × 109 cells/mL and is sufficient for two electroporation reactions of 400 μL each. The cell culture and preparation can be proportionally scaled up if more electrocompetent cells are needed to make larger or more libraries.
Library Transformation
Prepared electrocompetent yeast cells were kept on ice until electroporation. We prepared the DNA by combining 4 μg of digested vector backbone and 12 μg of each DNA insert for each 400 μL electroporation reaction. The DNA mixture was reduced via precipitation and resuspension as necessary to reach a volume of less than 50 μL. We combined 400 μL of electrocompetent cells and the DNA mixture and transferred it to a pre-chilled BioRad GenePulser cuvette (0.2 cm electrode gap), then put it on ice for 5 minutes. Cells were electroporated at 2.5 kV and 25 μF. Typical time constant ranges ranged from 3.0 to 4.5 milliseconds. Electroporated cells were transferred from each cuvette into 8 mL of 1:1 mix of 1 M sorbitol: YPD media in a culture flask and incubated on a platform shaker at 225 rpm and 30°C for 1 hour. Cells were collected by centrifugation and resuspended in synthetic complete media lacking tryptophan amino acid, SC-TRP media, (20 g/L glucose, 6.7 g/L yeast nitrogen base without amino acids, 5.4 g/L Na2HPO4, 8.6 g/L NaH2PO4⋅H2O and 5 g/L casamino acids [CSM-TRP]). 250 mL SC-TRP media were used for every 400 μL transformation reaction. The optical density value OD600 of resuspended cells varied between 0.3 and 0.4 depending on the experiment. A small aliquot of the resulting cell suspension was diluted 10−3, 10−4, 10−5, and 10−6. We plated 100 μL of each dilution onto SC-TRP agar plates and incubated at 30 °C in a static incubator. Library size was determined from the colony counts on the agar plates after three days of incubation. The remaining transformed cells in SC-TRP media were grown in a baffled flask overnight at 225 rpm and 30 °C. The resulting transformed library was glycerol stocked the following day after about 20 hours of incubation at OD600 of 1.0 by combining 750μL of culture with 750μL of 50% v/v sterile glycerol in a cryovial. Glycerol stock was stored at -80°C.
Library Selection
After the 20-hour incubation period above, the passage process was started by diluting 20-hour transformed library cell culture in a clean autoclaved baffled flask with fresh 250 mL of SC-TRP media at a 1:400 ratio and incubating at 30 °C and 225 rpm. Alternatively, the transformed library can be inoculated from a glycerol stock at the same dilution and final volume. The cells were grown to approximately 1.0 OD600 value at which point the dilution procedure was repeated as needed with the same parameters until 36 hours had passed.. This passage process allows yeast cells to drop redundant plasmid copies and achieve AN average of a single plasmid per cell. At the end of the passage process, a fraction of cells was inoculated in a baffled flask with SC-HIS-TRP media (20 g/L glucose, 6.7 g/L yeast nitrogen base without amino acids, 5.4 g/L Na2HPO4, 8.6 g/L NaH2PO4⋅H2O and 5 g/L casamino acids [CSM-TRP-HIS]) at a 1:400 ratio to start selecting for interactions. The remaining SC-TRP cells were collected by centrifugation at 5000rpm (3000xg) for 5 minutes and resuspended in 200 μLof Zymo Research Solution 1 Digestion Buffer per 50 mL of cells and stored in -80°C freezer until plasmid extraction. At least 100-fold over library size number of cells were collected. The selection for interaction step usually lasted between 24 and 36 hours depending on the library and concluded when cells in the SC-HIS-TRP flask reached OD600 value of 1.0. At that point, cells were collected by centrifugation, resuspended in Solution 1 buffer, and frozen at -80°C.
Yeast Plasmid Extraction Protocol
Cells after TRP selection and HIS selection were collected by centrifugation as described in the library selection protocol. Cells were resuspended in 200 μL Zymo Research Solution 1 Digestion Buffer per 50 mL of cell culture. Resuspended cells were frozen at -80°C. We then thawed the cells and added 20 μL of Zymolyase per each 200 μL volume of cells. Cells were then incubated at 37°C for 4 hours, mixing once per hour by inverting the tubes ∼ 20 times each. Cells were again frozen at -80°C and thawed. 200 μL of Zymo Research Solution 2 Lysis Buffer was added to each sample, inverted to mix, and incubated at room temperature for 3-5 minutes. We then added 400 μL Zymo Research Solution 3 Neutralizing Buffer to each sample and inverted to mix. Lysed cell were centrifuged at max speed (about 17,000 xg) for 5 minutes. Supernatant was transferred to a new microcentrifuge tube and spun again at 17,000 xg for 5 minutes. Supernatant was transferred to Zymo-Spin I Columns in a collection tube and centrifuged at 13000 xg for 30 seconds. Flowthrough was discarded. Columns were then washed by adding 550 μL of Zymo Research DNA Wash Buffer onto the column and centrifuged for 1-2 minutes at 13,000 xg. Flowthrough was discarded and wash was repeated. Empty column was spun for 1 min at 13,000 xg to dry. DNA was then eluted by adding 20 μL of molecular grade water to the column, allowing it to incubate for 5 minutes at room temperature, and spinning for 1 min at 13000 xg. Eluate was then reloaded onto the column and the spin was repeated.
Quantitative PCR (qPCR) and Next Generation Sequencing Preparation
Using plasmid as the template DNA, the first round of qPCR amplified the barcode region and added a 20nt long unique molecular identifier (UMI) and Illumina Read1 and Read2 primer sites. KAPA HiFi HotStart ReadyMix and suggested reagent and PCR cycle parameters were used. The annealing temperature was 67°C. The reaction was allowed to proceed until the qPCR curve reached AN inflection point, and the resulting fragments were purified using KAPA Pure Beads. The resulting product was used as the template for the second stage of qPCR to add sample indexes and P5/P7 flow cell adaptor sites. This reaction was allowed to proceed for 6-7 cycles and used the same parameters as the previous qPCR reaction. AN Agilent real-time PCR machine was used for all qPCR steps.
Analysis Workflow Barcode counting
Fastq files were obtained from Illumina NextSeq 550 or MiSeq sequencers using bcl2fastq with the default parameters. Cutadapt (Martin, 2011) was then used to trim out fastq barcode regions, and barcodes were clustered using Bartender (Zhao et al., 2018) with d=2.
Autoactivator screening
We defined autoactivators as proteins for which the trimmed interquartile mean of the library normalized enrichments is AN extreme positive outlier (interquartile range > 3) with respect to the other assayed proteins. Autoactivator screening was carried out for both possible domain fusions.
Autotune autoactivator correction
Following autoactivator detection, the non-autoactivating alternate fusion ordering data was used to infill the pre-His selection read count according to Equation 1 and the post-His barcode read count according to Equation 2.
Where FA values are for the fusion order values which has autoactivation, FNA are for the fusion order values which does not have autoactivation, and i is the PPI in question. The summation of the His counts for selection are done pre-update of all new barcode count corrections. Note that since there is only set of values for homodimers there can be corrections made.
For Autotune of homodimers, Equation 3 is used to infill post-His selection values for both pseudoreplicates, using the same starting pre-His selection barcode count.
Where F is the fusion order being used to construct the pseudoreplicate, and i is the homodimer PPI.
Undetected PPI filtering
For all experiments, if a PPI had a barcode count of 0 in the pre-His and post-His selection stages, it was considered undetected. Otherwise, if it was only undetected in one stage, the other was infilled using the corresponding minimum (missing pre-His reads were infilled using the minimum of detected pre-HIs reads, etc.). For any missing PPIs, the largest possible set of replicates were used during data processing for DESeq2 analysis.
DeSeq2 Settings
DESeqDataSet objects were prepared with raw pre and post-His read counts, without infilling. A contrast using the pre and post-His columns as the conditions was used, and ashr lfcShrink was used to get final Mp3-seq LFC values. For pseudoreplicate MP3-seq, pre and post-His conditions were used along with a fusion order batch effect for a contrast of ∼cond + batch for DESeqDataSet creation. P-LFCs were obtained by ashr lfcShrink.
DUET
A Pre-DUET weighted undirected graph G was created from MP3-seq values with padj ≤ 0.01 and P-LFC > 0, and vertices with degree 0 removed. Then, until |G|=0 or Δ(G)=1, all G\{v} were scored for with Equation 4, using $c=4$ and the highest scoring vertex removal was used to reduce the graph. The greedy reduction algorithm pseudocode can be seen in Figure 6.
DUET pseudocode

Orthogonality gap reduction
To reduce the DUET pairs, any P-LFC>0 interaction between the set of proteins involved in the DUET pairs was considered a real interaction. Non DUET-pair interactions were sorted by P-LFC, and the largest value interaction was selected to remove a DUET-pair. Whichever protein was involved with the smaller P-LFC DUET-pair and its DUET partner was removed. This was done one protein pair at a time until only one pair remained.
Computational protein design Design of helical bundles
Helical bundles were designed using methods developed in (Chen et al., 2019). Briefly, systematic sampling of Crick coiled-coil parameters was used to independently sample all helices of the heterodimers supercoiled around the same axis. The Rosetta HB-Net protocol (Boyken et al., 2016) was used to design buried HB-Nets in the core of the designs.
Hydrogen bond network removal
The design model of mALB8 was used as a starting point. The two hydrogen bond networks were manually annotated (R1 has residues 23+40+84+96, R2 has residues 16+43+103), see Figure S5. Two alternative protocols were used for each redesign. Either a FastRepack with a fixed static backbone, which results in replacement with smaller amino acid residues (T, A), or a FastDesign protocol, where the backbone was allowed to move, which resulted in replacement with larger amino acid residues (V, L, F).
The residues of the HB-net and all residues within a 6 A shell were redesigned, and a shell of 8 was repacked. The beta_nov2016 score function was used. The charge was fixed to -5 for each chain. Five sequences were made for each design. The sequences were filtered by score_per_res>-3.5, shape complementarity >0.7, sbuns<2, and ranked by ddg (the lowest ddg was chosen for the final design). Rosetta version 2021-07 (c04b624d89b45b55f573bbf2f6685ffbf2e4fcaf) was used for the calculations.
Structure visualization
Visualization of 3D structures
All structural images were generated using PyMOL version 2.5.2.
Structure prediction
For non-DHD proteins without PDB codes noted, structures for visualization purposes were generated with ESMFold (Lin et al., 2022).
Data and code availability
Barcode count data and scripts necessary to recreate the MP3-seq pipelines and analysis can be found at https://github.com/Seeliglab/MP3-DUET-AUTOTUNE
Supplementary Figures
A. Pre-His selection barcode counts hold steady over three samples taken from a P-series all by all screen at 24, 44, and 60 h. B. Two concentrations of 3-AT were tested during selection for two P-series all-by-all replicates. C. Higher concentration 3-AT post-His selection barcode counts are lower when normalized to library size compared to lower 3-AT selection, as is expected. D. Relative changes between the two 3-AT conditions for both replicates. E. Correlation of normalized barcode counts for pre and post-His selection conditions. Note the drastic increase in on-target P-series normalized barcode counts between conditions, and that overall, correlation is weaker pre-His than post-His selection.
A. Correlations of BCL-2 and inhibitor P-LFC for replicates and combined replicates. R3 was collected with the final assay version, while R1 and R2 were collected with the earlier. B. Correlations between replicates and combined replicates for the DHD0/DHD1 proteins. Subscripts denote size of the library for all the all-by-all screen. C. Correlations between replicates and combined replicates for the DHD0/DHD2/mALb proteins. Subscripts denote size of the library for all the all-by-all screen.
A. MP3-Seq P-LFC versus batched survival for only interactions which succeeded biolayer interferometry B. MP3-Seq P-LFC versus batched survival for all BCL-2 homolog and inhibitor interactions C. MP3-Seq P-LFC versus pairwise survival for only interactions which succeeded biolayer interferometry D. MP3-Seq P-LFC versus pairwise survival for all BCL-2 homolog and inhibitor interactions
Original mALb8 protomers at top of rows, followed by T1 and T2 variants
Original mALb8s are at tops of the rows. Names with x1 correspond to R1, x2 to R2, and x12 to R12. The small replacement protocol is rprtc and large replacement protocol is fdrtc.
Acknowledgments
This work was supported by NIH Award R01GM120379 and ONR Award N00014-16-1-3189 to G.S. A.L. would like to acknowledge funding of European Commission MSCA CC-LEGO 792305 and Slovenian research agency project CC-TRIGGER J1-4406.