Abstract
We present a method for high-throughput screening of protein variants where the signal is enhanced by micro-encapsulation of single cells into 20-30 μm agarose beads. Cells inside beads are propagated using standard agitation in liquid media and grow clonally into micro-colonies harboring several hundred bacteria. We have, as a proof-of-concept, analyzed random amino acid substitutions in the five C-terminal β-strands of the Green Fluorescent Protein (GFP). Starting from libraries of variants, each bead represents a clonal line of cells that can be separated by Fluorescence Activated Cell Sorting (FACS). Pools representing collections of individual variants with desired properties are subsequently analyzed by deep sequencing. Notably, the encapsulation approach described holds the potential for high-throughput analysis of systems where the fluorescence signal from a single cell is insufficient for detection. Fusion to GFP, or use of fluorogenic substrates, allows coupling protein levels or activity to sequence for a wide range of proteins. Here we analyzed more than 10,000 individual variants to gauge the effect of mutations on GFP-fluorescence. In the mutated region, we observed virtually all amino acid substitutions that are accessible by single nucleotide exchange. Lastly, we assessed the performance of biophysical protein stability predictors, FoldX and Rosetta, in predicting the outcome of the experiment. Both tools display good performance on average, suggesting that loss of thermodynamic stability is a key mechanism for the observed variation of the mutants. This, in turn, suggests that deep mutational scanning datasets may be used to more efficiently fine-tune such predictors, especially for mutations poorly covered by current biophysical data.