Abstract
Structures of two globular proteins were determined ab initio using microcrystal electron diffraction (MicroED) data that was collected on a direct electron detector in counting mode. Microcrystals were identified using a scanning electron microscope (SEM) and thinned with a focused ion-beam (FIB) to produce crystalline lamellae of ideal thickness. Continuous rotation data were collected using an ultra-low exposure rate on a Falcon 4 direct electron detector in electron-counting mode. For the first sample, triclinic lysozyme extending to 0.87 Å resolution, an ideal helical fragment of only three alanine residues provided initial phases. These phases were improved using density modification, allowing the entire atomic structure to be built automatically. A similar approach was successful on a second macromolecular sample, proteinase K, which is much larger and diffracted to a modest 1.5 Å resolution. These results demonstrate that macromolecules can be determined to sub-Ångström resolution by MicroED and that ab initio phasing can be successfully applied to counting data collected on a direct electron detector.
Main
The cryogenic electron microscopy (cryoEM) method microcrystal electron diffraction (MicroED) can be used to determine the atomic structures of inorganic materials, small organic molecules, natural products, soluble proteins, and membrane proteins from vanishingly small crystals1. The data are collected as a movie on a fast camera while the stage of the electron microscope is continuously rotating the crystal in a parallel electron beam2. Since the method is analogous to the rotation method used in X-ray crystallography3, data processing is conducted using standard X-ray crystallography software. For MicroED of small molecules and short peptides which typically diffract to atomic resolution, phases are commonly determined using ab initio direct methods4. Phases for electron diffraction can also be determined from images recorded on a transmission electron microscope (TEM)5–7, as demonstrated for two-dimensional membrane protein crystals. For MicroED data from three-dimensional macromolecular crystals, phases have thus far only been determined by molecular replacement1. Molecular replacement for MicroED data has been demonstrated with distant homologues8,9 or extracted fragments from homologues10, but there have been no reports of successful phasing in the absence of a previously known model. A major obstacle for phasing using idealized fragments in macromolecular MicroED data has been phase improvement10; to date, density modification algorithms have persistently failed to improve the maps for any MicroED data from macromolecules. Thus, even if idealized fragments had been placed accurately, phasing was intractable. Robust phasing by any means other than molecular replacement has remained elusive for macromolecular MicroED1.
MicroED data have been collected using a variety of camera systems and microscopes. While a charge-coupled device (CCD) can be used11, these cameras tend to be slow and relatively insensitive, which makes data collection by continuous rotation difficult. Cameras based on complementary metal-oxide-semiconductor (CMOS) technology allow faster readout and often have better signal-to-noise ratios1,12. The shorter dead time between frames allows CMOS cameras to operate in rolling-shutter mode, which facilitates continuous rotation experiments2. Hybrid pixel detectors (HPDs) have also been successful for collecting macromolecular MicroED data9,13. HPDs have a smaller array of large pixels that can complicate data collection when working with large unit cells such as proteins. In contrast, direct electron detectors offer distinct advantages and were pivotal for the “resolution revolution” in single-particle cryoEM14,15. In electron-counting mode, these low-noise cameras can resolve individual electron events. The increased accuracy of electron counting in combination with faster readout rates promises to deliver superior data quality for MicroED. Direct electron detectors have been used in prior MicroED investigations, but only in integrating (linear) mode16,17. It was believed that the number of electrons in the diffraction spots was too high to reasonably count given the low frame rate of the detector16.
Here, a structure of triclinic lysozyme at 0.87 Å using a Falcon 4 direct electron detector in electron counting mode is reported (Figure 1). The exposure rate, frame rate, and total exposure time of the MicroED experiment were optimized to allow data collection within the specifications of the camera (Materials and Methods, Supplementary Figure 1). Using this approach, far superior data was collected when compared with previous studies, and phases were determined ab initio using an idealized fragment of only 15 atoms. Initial phases from these atoms were improved by density modification18, resulting in a map showing individually resolved atoms for essentially the entire protein. A model was built automatically into this map using standard crystallographic software19 without user intervention. To test the robustness of this approach, data were collected from crystals of the serine protease proteinase K (Supplementary Figure 2). The data extended to 1.5 Å resolution with better statistics than previously reported structures of this protein using other detectors20,21. This structure was initially phased by automatically placing four ideal helices followed by successive rounds of density modification and backbone tracing. The backbone model was then completed by automated means. This study demonstrates electron-counting MicroED data collection to sub-Ångström resolution using a direct electron detector. Even at near-atomic resolutions, density modification could be successfully applied, leading to ab initio phasing of macromolecular MicroED data. These structures set a new benchmark for achievable quality of MicroED data and increase the scope of possible experiments targeting proteins with unknown structures.
Crystals of triclinic lysozyme were grown in batch (Materials and Methods). The crystals were visible under a light microscope and were initially about 10 μm in size. A slurry of these microcrystals were vitrified on EM grids. Grids were loaded into a dual-beam FIB/SEM where individual crystals that were at least 5 μm from a grid bar and at least 3 grid squares away from the edge of the grid were identified. The crystals were then coated in multiple layers of platinum for protection from the ion beam during the milling process. Each crystal was then milled into a thin lamella approximately 300 nm thick, or about the ideal thickness for MicroED data collection at 300 kV accelerating voltage based on prior mean free path measurements22 (Figure 1A, B, Materials & Methods).
The milled lamellae were then loaded under cryogenic conditions into a Titan Krios. Lamellae sites were identified by low-magnification montages, where each site appeared as a semi-transparent shape suspended above a strip of empty area. 18 out of the 20 milled lamellae were found in the TEM-two did not survive the cryotransfer step. For each lamella, the eucentric height was carefully adjusted. Each lamella was inspected initially at an intermediate magnification to verify that no contamination had built up or compromised the sample (Figure 1B). Each lamella was checked for diffraction and the camera length was adjusted appropriately depending on the attainable resolution before data collection.
MicroED data was collected in electron counting mode using a Falcon 4 direct electron detector (Materials and Methods, Supplementary Figure 1, Figure 1C, D). In counting mode, this camera allows for highly accurate detection of single electron events. However, the number of electrons that can accurately be counted in each pixel of each frame is limited. To assure accurate reporting of the intensities, the exposure rate must be kept very low. This strategy reduces errors caused by too many electrons hitting the same pixel within a readout cycle of the detector, but risks missing weak reflections in the background. These stringent camera requirements could be met by greatly reducing the exposure rate and compensating by increasing the total exposure time (Materials and Methods). This strategy prevents the strong reflections from overwhelming or damaging the detector while sampling the weak reflections at high resolution was achieved at sufficient frequency to recover their intensities (Supplementary Figure 1).
Multiple settings had to be adjusted to achieve a suitably low exposure rate for these experiments in electron counting mode. Importantly the camera’s dose protector, which automatically retracts the camera when the microscope enters diffraction mode, must be disabled (Materials and Methods). The smallest C2 aperture was used coupled with the highest spot size possible on our instrument, 50 μm and spot size 11, respectively. The instrument was kept in microprobe mode to avoid an approximately fivefold increase in exposure rate that occurs by deactivating the condenser mini lens. Since these experiments were conducted on a Titan Krios, the beam size could be changed while maintaining a near-perfect parallel illumination. The beam diameter was spread to 25 μm to reduce the exposure per unit area further. Together, these modifications reduced the total exposure per area by a factor of up to 10 × compared to prior experiments on this instrument23. Further, the data was collected over a 420 s exposure time at 250 frames per second, which was the longest possible on the detector software, coupled to a very slow rotation rate of the stage at either 0.15 or 0.2 °/s, corresponding to a total real space-wedge of approximately 60° or 80°, respectively (Figure 1C). In this way the exposure was spread in space and time, allowing accurate measurements of single electron events even in diffraction mode. As a result, these datasets had total exposures up to 4 × lower than prior investigations for similar macromolecules even though the recording time here was more than twice as long24. The total exposure per lamella was ∼0.64 e-/ Å2. The background noise and total flux on the camera were further minimized by using a 100 μm selected area aperture that corresponds to a region about 2 μm in diameter on the specimen. Under these conditions, essentially all pixel values fall within the linear range of the detector and all are well below the threshold for damaging the detector (Materials and Methods).
The movies from the Falcon 4 were sliced into 0.5 s segments, or 840 individual frames, each 2048×2048 16-bit pixels in size, spanning 0.075°–0.08° rotation. Images were converted to SMV format using the MicroED tools12 adapted for the Falcon 4/EPU-D metadata format. The total size of a compressed MicroED movie in counting mode for these exposures is typically 2.2 GiB, regardless of the number of frames (up to 14,455 in 420 s). The size of these movies reflects the total information content captured. The converted frames were processed using standard crystallographic software25,26. The space group for all 18 lysozyme MicroED datasets were found to be P 1, and the typical unit cell was determined to be (a, b, c,) (Å) = (26.42, 30.72, 33.01), (α, β, γ) (°) = (88.319, 109.095, 112.075) (Table 1). The data were merged to increase completeness. The subsequent merging steps identified two lysozyme lamellae that correlated poorly with the other 16 integrated datasets. These two datasets were discarded. A high-resolution cutoff at 0.87 Å was applied. This corresponds to where the CC1/2, the correlation coefficient between randomly chosen half-datasets28, was significant at the 0.1% level. The overall completeness of the final dataset was 87.5%, with all resolution shells below 1 Å resolution being > 95% complete (Table 1, Supplementary Table 1, and Supplementary Figure 3).
Ab initio phasing was performed by automatically placing idealized helical triple-alanine fragment followed by density modification (Figure 1E, Materials and Methods). A single helical fragment of just three alanine residues was sufficient to determine the entire lysozyme structure. After automatic placement in PHASER29, an interpretable map was produced following 144 rounds of dynamic density modification18, resulting in a map showing individually resolved atoms throughout the entire unit cell where all residues could be identified unambiguously even by eye (Figure 2, Supplementary Movie 1 and 2). The density-modified map (E, PHI) was similar to the final (2mFo-DFc, PHI) map after refinement. A complete model was automatically built into the density modified map given only the sequence19 and without consulting their known structures (Figure 2). For this structure, two C-terminal residues, and several solvent-exposed sidechains were either partially or absent in the map, even after the final refinement. They were also poorly resolved in X-ray investigations of triclinic lysozyme at similar resolutions30. The final models were refined31 using electron scattering factors.
The results from lysozyme using electron counted MicroED data were very promising. However, it was not entirely surprising that sub-Ångstrom data could be determined ab initio even with the larger size of lysozyme compared with small molecules or small peptides. To test if other electron counted MicroED data would perform similarly, this approach was tested again using a sample of the serine protease, proteinase K, which is much larger and more representative of the average size of a globular protein in the cell. The crystals were grown in batch and the sample preparation was very similar to the approach used for the triclinic lysozyme microcrystals. The crystals were identically milled and screened (Supplementary Figure 2). Four proteinase K crystals were found on a single grid that met the selection criteria. MicroED data was collected using an overall similar approach using electron counting (Materials and Methods). It was similarly found that essentially all pixel values fell within the linear range of the detector and are still well below the threshold for damaging the detector (Supplementary Figure 1, Materials and Methods).
The data were indexed, integrated, and scaled using the same software as used for the triclinic lysozyme data. The space group was determined to be P 4 2 2 with a unit cell of, (a = b, c) (Å) = (67.08, 106.78), (α = β = γ) (°) = (90). Two of the four proteinase K lamellae integrated with lower internal consistency and were therefore discarded. A high-resolution cutoff was applied using the same criteria at 1.5 Å, and the overall completeness of the final dataset was 98.8% (Supplementary Tables 2 & 3, Supplementary Figure 4). The overall statistics for these merged crystals were better than any prior investigations on the same microscope using a CMOS22.
Ab initio phasing of larger proteins below atomic resolution is more challenging. The same approach was used as for lysozyme for phasing, namely automatically placing idealized helices and then extending the solution using density modification. Four 14 residue-long ideal alanine fragments were automatically placed using PHASER29. This solution was extended using multiple rounds of chain tracing and density modification32, similarly to the procedure implemented by ARCIMBOLDO_LITE32,33. The correct space group was determined by initially placing the fragments in all possible space groups for this point group, and then attempting to trace the best solutions. The solution in space group P 43 21 2 successfully built nearly the entire protein backbone. In the final round, this procedure traced 253 of 278 alanine residues (Supplementary Table 2, Supplementary Figure 5), and the map showed clear side-chain densities. The model was completed automatically19 from the backbone trace given only the sequence. The structure was similarly refined31 using electron scattering factors.
Ab initio phasing was successful for both lysozyme and proteinase K following a similar approach and using electron counted data. The lysozyme structure was determined from an idealized alanine helical fragment of only 15 atoms, corresponding to less than 1.5% of the structure. The maps calculated from only this small fragment showed density only around the placed atoms and uninterpretable noise elsewhere (Figure 1E) —not even the correct sidechains for the three placed residues could be reasonably modeled at this stage. However, the entire protein structure along with solvent ions and water molecules were individually resolved after density modification (Figure 2). The proteinase K structure was determined using four idealized helices, each made of fourteen alanine residues, or 280 atoms (Materials and Methods, Supplementary Figure 5), corresponding to about 13% of the structure. This solution also required density modification after placing the initial fragments. Its lower resolution and larger size meant that this structure required multiple rounds of chain tracing and density modification but was also determined ab initio using electron counting data. This procedure was repeatedly attempted using prior non-counting datasets but without success. The improved data quality from the electron counted data was critical to ab initio phasing macromolecular crystals using MicroED data.
This study demonstrated that MicroED data can be collected using a Falcon 4 direct electron detector in counting mode. Previous attempts at collecting MicroED data using counting on a Falcon 3 direct electron detector failed because the Falcon 3 detector is slower (40Hz) and had a much smaller linear range of only about 1 electron per pixel per second16. Therefore, the Falcon 3 was previously used in integrating (or linear) mode where it operates similarly to other CMOS-based detectors. In the current study, the Falcon 4 direct electron detector was used, which is capable of recording images at 250Hz. Combining the faster frame rate with an ultra-low dose spread over a long exposure meant that single electron events could be counted and the recorded intensities accurately integrated.
The data presented here were collected from crystal lamellae that were milled to an ideal thickness to match the mean free path of the microscope that was used31. Milling crystals can either be used to make large crystals into smaller ones11,21,34,35 to remove material that embeds small crystals and prevents accessing them for MicroED investigation, such as membrane proteins grown in viscous bicelles23 or LCP36. Regardless of the reason, milling crystals into an ideal thickness is good practice and recommended for extracting the most accurate data22. The lysozyme crystals milled in the current study diffract to sub-Ångström resolutions even after being milled by a gallium ion beam. This suggests that milling crystals does not compromise the sample or result in lower resolution. Indeed, the viability of collecting sub-Ångström data from crystalline lamellae further solidifies ion-beam milling as the preferred approach for preparing macromolecular crystals for MicroED experiments, since the crystals and surrounding material can be shaped and thinned to ideal thicknesses that match the inelastic mean free path for any accelerating voltage22. Importantly, at a mere 300nm, these lamellae matched the mean free path of a 300kV electron minimizing the probability of multiple scattering events and likely contributing the greater accuracy in the data.
The results presented here open the door for future investigations of phasing macromolecular protein crystals at or beyond near-atomic resolutions using MicroED data. Attempting to apply direct methods as implemented in e.g. Shake N Bake37,38 or SHELX39,40 are exciting avenues for future investigations since not all proteins will have helices to place for the starting phases. Further developments in ab initio phasing of MicroED data could be used to both automate the process and extend the necessary resolution to cover more challenging structures, such as membrane proteins that do not routinely reach resolutions better than about 3 Å. Regardless, the ability to probe membrane protein structure by MicroED is advantageous as electrons can probe the charge properties in the sample41. The quality of MicroED data obtained by counting would likely be further improved with the addition of an energy filter since this addition made large differences in the quality previously obtained on integrating detectors17,41–43. Given the importance of phase improvement in traditional X-ray crystallography44, the successful application of density modification algorithms compatible with MicroED data will be of critical importance for solving structures without known homologs or very difficult structures at lower resolutions.
Materials and Methods
Materials
Hen egg-white lysozyme (Gallus gallus) and Proteinase K (Engyodontium album) were purchased from Sigma-Aldrich (St. Louis, MO) and used without further purification. Sodium acetate pH 4.5, MES-NaOH pH 6.5, calcium chloride, and sodium nitrate stock solutions were used directly from Hampton (Viejo, CA) crystallization kits and diluted using milli-Q water as needed. The magic triangle kit was purchased from Hampton (Viejo, CA).
Crystallization of triclinic lysozyme
Crystals were prepared similarly to the protocol originally detailed by Legrand et al.45, and then subsequently described by Heijna et al.46. Hen egg-white lysozyme was dissolved in 0.2 M NaNO3 0.05 M sodium acetate pH 4.5 to a concentration of 10 mg / mL. A total volume of approximately 0.5 mL was prepared in a cold room and vortexed at the maximum setting for approximately 1 minute immediately after mixing. The tube containing this mixture was left at 4° C overnight. The next morning the tube was observed to be nearly filled with an opaque, white suspension. The sample was removed from the cold room, sealed with parafilm, and left on a benchtop at room temperature (approximately 23° C) for one week. After this time, very large, clear crystals accumulated on the bottom of the tube, and the rest of the liquid appeared transparent. Small (1 μL) aliquots from the center of the liquid that appeared clear by eye were found to contain a slurry of small, irregularly shaped crystals when viewed under a light microscope.
Crystallization of proteinase K
Proteinase K was crystallized as described47. Protein powder was dissolved at a concentration of 40 mg / mL in 20 mM MES–NaOH pH 6.5. Crystals were formed by mixing a 1:1 ratio of protein solution and a precipitant solution composed of 0.5 M NaNO3, 0.1 M CaCl2, 0.1 M MES–NaOH pH 6.5 in a cold room at 4 °C. Microcrystals grew overnight.
Grid preparation
Quantifoil Cu 200 R 2/2 holey carbon TEM grids were glow-discharged for 30 s at 15 mA on the negative setting immediately before use. Grids were loaded into a Leica GP2 vitrification robot. The robot sample chamber was loaded with filter paper and set to 4 °C and 95% humidity for 1 h before use. 3 μL of protein crystals from the center of either the proteinase K or lysozyme tubes were applied to the carbon side of the glow-discharged grid and allowed to incubate for 30 s. Grids were then gently blotted from the back for 20 s. For lysozyme, the grids were then immediately plunged into super-cooled liquid ethane. For proteinase K, 3 μL of 0.25M I3C 0.5 M NaNO3, 0.1 M CaCl2, 0.1 M MES–NaOH pH 6.5 was added as described48. The proteinase K grids were blotted once more from the back for 20 s and then immediately plunged into liquid ethane. Vitrified grids were stored at liquid nitrogen temperature before further experiments.
Focused ion-beam and scanning electron microscopy
The vitrified grids were inserted into autogrid clips at liquid nitrogen temperature. After clipping, the grids were loaded into a cryo-transfer shuttle and inserted into a Thermo-Fisher Aquilos dual-beam FIB/SEM operating at liquid nitrogen temperatures. The samples were covered with first fine, then rough, coats of sputtered platinum immediately after loading. An additional layer of protective platinum was added using the gas injection system (GIS). GIS platinum coating was conducted in the mapping position at a working distance of 12 mm. In this way, an approximately 1 μm thick platinum layer was slowly deposited over 30 s and continually monitored using the electron beam.
Whole grid montages of each grid were taken at low magnification using the MAPS software (Thermo-Fisher). Crystals on the vitrified grids were identified in the FIB/SEM such that each crystal was not within 5 μm of a grid bar, not within three grid squares of the edge, and not within 25 μm of another selected crystal (Figure 1A, Supplementary Figure 2A). Twenty such crystals across two grids were prepared over two days for lysozyme and five proteinase K crystals on one grid were prepared in one day. Identified crystals were brought to the eucentric position and inspected in both the electron and ion beams. Milling was conducted as described12. Briefly, each crystal was roughly milled to approximately 3 μm thick and 5–10 μm wide using an ion-beam current of 300 pA and the standard rectangular milling patterns. Each crystal was then finely milled to a thickness of approximately 500 nm thick and 5–10 μm wide using an ion-beam current of 100 pA. Finally, each lamella was polished using an ion-beam current of 10 pA to a thickness of approximately 300 nm and width of 3–8 μm using a cleaning cross-section. This thickness was found to be optimum for experiments at this accelerating voltage22. Each of these steps is typically conducted in 1 – 5 min intervals using rectangular patterns with at most a 1 μm height dimension to reduce the influence of sample drift. The sample is imaged in the ion beam using a current of 1.5 pA between milling steps to realign as needed and to assess the quality and thickness of the finished lamella.
Setting up the Falcon 4 for diffraction experiments
The dose protector that prevents the operation of this detector while in diffraction mode was disabled by a service engineer before experiments. The Falcon 4 direct electron detector internally operates at 250 frames per second, but every 32nd frame window is used to reset the detector and does not result in useful data. Owing to bandwidth limitations, the camera furthermore accumulates at least seven raw frames before transmitting their summed image to the controlling computer, which means that the user can obtain no more than ∼35 images per second. For the data presented here, the frames were internally summed to correspond to 0.5 s wedges (119 raw frames; Supplementary Figure 1).
The damage threshold for this system is described by a deterioration of the detector quantum efficiency (DQE) due to radiation damage over time. Specifically, the DQE for each pixel will decrease by 10% after a total exposure of 1.5×109 electrons. Ultimately, the smaller C2 aperture of 50 μm was chosen, with a spot size of 11, and a beam diameter of 20 or 25 μm which corresponded to a total exposure of ∼0.64 – 1.0 e- Å-2. The typical counts on the detector were within the linear region and a typical pixel experienced a smaller exposure than typical single-particle movies (Supplementary Figure 1). None of our measurements fell below a DQE of 0.6.
MicroED data collection
Grids containing milled protein crystals were rotated such that the TEM rotation axis was 90° from the FIB/SEM’s milling axis, and then loaded into a cryogenically cooled Thermo-Fisher Titan Krios 3Gi transmission electron microscope operating at an accelerating voltage of 300 kV. Low magnification montages of each grid were collected at a magnification of 64 × and used to locate the milled lamellae on each grid. Each lamella was brought to its eucentric position before data collection. MicroED data were collected by continuously rotating the stage at a rate of approximately 0.15° / s or 0.2° / s for 420 s, covering a total rotation range of approximately 63° or 84°, respectively. This typically spanned the real space wedge corresponding to -31.5° to +31.5°, or -42° to +42°. Data were collected using a 50 μm C2 aperture, a spot size of 11, and a beam diameter of 20 or 25 μm. Under these conditions, the total exposure per dataset was approximately 1.0 e- Å-2 or 0.64 e- Å-2, respectively. Diffraction data were isolated from a small area from the middle of each lamella of approximately 2 μm in diameter at the specimen level using the 100 μm selected area aperture to remove unwanted background noise. All data were collected using twofold binning and internally summed such that each image records a 0.5 s exposure spanning approximately 0.075– 0.1° of rotation. In this way, each image stack contained 840 images, the last of which was discarded. A single sweep of continuous rotation MicroED data was collected from each lysozyme lamella. For proteinase K, two sweeps were collected—a high-resolution dataset at a nominal camera distance of 960 mm, and then a subsequent low-resolution dataset collected identically, but at the longest possible nominal camera distance of 4300 mm. The post-column magnification on this system is 1.81×. The low-resolution pass was conducted after the high-resolution pass covering a resolution range from approximately 60–6 Å.
MicroED data processing
Movies in MRC format were converted to SMV format using a parallelized version of the MicroED tools12 (https://cryoem.ucla.edu/downloads). Each dataset was indexed and integrated using XDS. All datasets were scaled using XSCALE27 and xscale_isocluster49. For both crystals, the space group was verified using POINTLESS 50. For lysozyme, the data were merged without scaling using AIMLESS 50, the subsequent intensities were converted to amplitudes in CTRUNCATE 26, normalized structure factors were calculated using ECALC 26, and a 5% fraction of the reflections assigned to a free set using FREERFLAG software packages distributed in the CCP4 program suite26.
Phasing
The lysozyme data could be phased using either FRAGON51, or PHASER29 followed by ACORN18,52. An initial phasing solution was achieved using example parameters from the ACORN documentation (http://legacy.ccp4.ac.uk/html/acorn.html#example9). Here, a small fragment of idealized alpha helix is used for molecular replacement, and then the best solution is subjected to density modification. In this way, the number of atoms was systematically lowered in the idealized helix from the initial 50 and were able to achieve a phasing solution with as few as 15 total atoms. Using fragments smaller than 20 atoms required placement using PHASER rather than the internal ACORN procedure. The resolution limits were also tested using the same 50 atom fragment, and the structure was solved using the same procedure up to 1.15 Å resolution. The lysozyme model could also be solved in FRAGON starting from a penta-alanine fragment, the smallest fragment allowed in the CCP4I2 interface53.
Additional tests were set up to explore the limits of our solution under different circumstances. First, a 10 alanine (50 atoms) helix was used for molecular replacement at resolutions worse than 0.87 Å. A single helix could be placed with high accuracy to at least 1.5 Å resolution. Using this solution as the initial phases for the ACORN density modification procedure using the data to 1.0 Å resulted in a map nearly identical to the one that had the helix placed using the entire resolution range. However, individual atoms were no longer resolved after density modification with data worse than 1.15 Å resolution. No resolution extension features were used in any runs of ACORN.
For proteinase K, ideal helices were placed using PHASER29. Four copies of the idealized 14 amino acid alanine helix were placed in space groups #89 - #96. SHELXE was run by using the merged intensities using the MTZ2HKL utility. Surprisingly, using the unmerged intensities directly from XSCALE did not result in a successful trace. Of the attempts of SHELXE from all the space groups, only #96 (P 43 21 2) resulted in a well-traced structure with clear sidechain density. From here, 1, 2, and 3 copies of 14 long alpha-helices were placed in space group #96. This was attempted again using 10 amino acid long helices, where only the search for 4 copies resulted in a convincing solution. SHELXE was run with or without the “-q” flag to first search for helix shapes during the chain tracing. The first solution from 4 helices in space group #96 used the SHELXE command line: “shelxe 1.pda –s0.4 –a30.” Here, the default 10 rounds of density modification are followed by standard chain tracing. This is repeated 30 times. Among these trials, only the four placed 14-long helices gave an obvious solution after chain tracing. However, the solution in space group #96 with 3 helices placed was also able to give a similar solution upon adding the –o option to trim away the low CC amino acids, -q to search for helical shapes, and increasing the chain tracing rounds to 50 or 100.
For both lysozyme and proteinase K after the last round of density modification, the protein was built automatically by BUCCANEER19. For lysozyme, the entire protein was built into the map produced by ACORN18 except for two terminal residues that were not resolved upon inspection of the map in Coot54. For proteinase K, the traced backbone from SHELXE was used as a starting fragment for BUCCANEER. In both cases, electron scattering factors were used for the maps. The built structures were refined in REFMAC31 using electron scattering factors calculated by the Mott-Bethe formula. Initial refinements used isotropic atomic displacement (B) factors for individual atoms and waters were added automatically. Refinement was always followed by manual curation of the model using Coot. For lysozyme, NO3 ions were found in multiple locations that were not adequately modeled by single water molecules. For proteinase K, two I3C molecules with low occupancies were identified after the structure was entirely built and placed manually in Coot using the fragment code, I3C. Hydrogen atoms were added to the lysozyme model in their riding positions for the final rounds of refinement. Once the model was completely built, the models were refined again using anisotropic atomic displacement parameters for all but the hydrogen atoms.
Figure and table legends
Supplementary Movie 1. A rotation movie showing the normalized structure factor maps after density modification over the final structure of lysozyme at 0.87 Å resolution. The movie is sliced to a width of 8 Å.
Supplementary Movie 2. A rotation movie showing the normalized structure factor maps after density modification over the final structure showing a region rich with hydrophobic packing interactions. Tryptophan, isoleucine, tryptophan sandwich are shown with several other tryptophan residues flanking this region. The movie is sliced to a width of 8 Å.
Figure preparation
Figures were prepared using ChimeraX and Pymol, then arranged in PowerPoint. Tables were arranged in Excel. Images were cropped around areas of interest, and the brightness and contrast were adjusted in FIJI55.
Software availability
Software tools that convert summed or unsummed MRC stacks to Super Marty View or TIFF are available at https://cryoem.ucla.edu/MicroED.
Funding information
This study was supported by the National Institutes of Health P41GM136508. The Gonen laboratory is supported by funds from the Howard Hughes Medical Institute.
Acknowledgments
The authors would like to thank Drs. Lingbo Yu and Bart Buijsse (Thermo-Fisher) for information and discussions about the Falcon 4 and counting mode. Coordinates for triclinic lysozyme and proteinase K structures have been deposited to the PDB, and maps have been deposited to the EMDB.