Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Protein-protein docking with large-scale backbone flexibility

View ORCID ProfileMateusz Kurcinski, View ORCID ProfileSebastian Kmiecik, View ORCID ProfileMateusz Zalewski, View ORCID ProfileAndrzej Kolinski
doi: https://doi.org/10.1101/2021.02.22.432196
Mateusz Kurcinski
Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mateusz Kurcinski
Sebastian Kmiecik
Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sebastian Kmiecik
  • For correspondence: sebastian.kmiecik@gmail.com
Mateusz Zalewski
Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mateusz Zalewski
Andrzej Kolinski
Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andrzej Kolinski
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Structure prediction of protein-protein complexes is one of the most critical challenges in computational structural biology. It is often difficult to predict the complex structure, even for relatively rigid proteins. Modeling significant structural flexibility in protein docking remains an unsolved problem. This work demonstrates a protein-protein docking protocol with enhanced sampling that accounts for large-scale backbone flexibility. The docking protocol starts from unbound x-ray structures and is not using any binding site information. In docking, one protein partner undergoes multiple fold rearrangements, rotations, and translations during docking simulations, while the other protein exhibits small backbone fluctuations. Including significant backbone flexibility during the search for the binding site has been made possible using the CABS coarse-grained protein model and Replica Exchange Monte Carlo dynamics. In our simulations, we obtained acceptable quality models for the set of 12 protein-protein complexes, while for selected cases, models were close to high accuracy.

Introduction

Protein-protein interactions are fundamental in many biological processes. Their structural characterization is one of the biggest challenges of computational biology. A variety of docking methods are currently available for structure prediction of protein-protein complexes [1, 2]. They can be divided into free (global) and template-based docking. Free (global) docking methods are designed to generate many distinct binding configurations. Template-based methods restrict docking to a binding mode found in a structural template. As demonstrated in the blind docking challenge, Critical Assessment of PRediction of Interactions (CAPRI), template-based methods generate more accurate results but only if a good quality template exists [1–5]. In some cases lacking useful templates, free global docking can yield acceptable results. According to recent estimates, the best free docking methods find adequate models among the top 10 predictions for around 40% of the targets [1]. The CAPRI analysis also indicates that protein backbone flexibility is a big challenge; protein complexes that undergo substantial conformational changes upon docking get no successful predictions from any method [3–5].

Presently, most of the free docking methods treat the backbone of input protein structures as rigid. This approximation reduces the protein-protein docking problem to a 6D (three rotational and three translational degrees of freedom) search space. Rigid-body search for the binding site most often rely on the Fast Fourier Transform [6–8]. Other successful approaches include 3D Zernike descriptor-based docking [9, 10] or geometric hashing [11]. These rigid-body methods are often used as a first docking step, followed by scoring [12–16], using experimental data [17] and/or structural refinement to capture backbone flexibility [5, 18]. Molecular Dynamics is perhaps the most common refinement strategy, either in classic or enhanced sampling versions [17, 19–21]. Other tools use rotamer libraries to address side-chain flexibility [22] and Elastic Network Models (ENM) for modeling backbone rearrangements [23–27]. Accounting for backbone flexibility in the search for the binding site significantly increases the docking complexity and makes it practically intractable using conventional all-atom modeling approaches. This enormous computational complexity of flexible docking can be reduced using coarse-grained protein models [28–31]. The best-performing methods that can now include backbone flexibility during the docking calculations use coarse-grained models and/or ENM-driven simulations. These include RosettaDock combining coarse-grained generation of backbone ensembles and all-atom refinement [32–34]; ATTRACT combining coarse-grained docking with ENM and all-atom refinement [35, 36]; and SwarmDock using all-atom ENM [24, 37]. All these approaches show some advantages in modeling protein flexibility compared to rigid-body docking followed by structure refinements. However, effective modeling flexibility in protein-protein docking remains an unsolved problem, as demonstrated in the recent CAPRI round [24, 34, 36, 38].

In this work, we use a well-established CABS coarse-grained protein model [28] for protein-protein docking. During the CABS docking simulation, one of the docking partners undergoes a long random process of rotations, translations, and extensive backbone conformational rearrangements that significantly modify its fold. Simultaneously, the backbone of the second protein undergoes small fluctuations.

Methods

Docking simulation protocol

In this work, we present the protein-protein docking simulation protocol that relies on the CABS coarse-grained model. The CABS design and applications have been recently described in the reviews on protein coarse-grained [28] and protein flexibility [23, 39] modeling. In brief, CABS uses a coarse-grained representation of protein chains (see Figure 1), Replica Exchange Monte Carlo (REMC) dynamics, and knowledge-based statistical potentials.

Figure 1.
  • Download figure
  • Open in new tab
Figure 1.

Comparison of the all-atom (left) and the CABS coarse-grained model representation (right) for an example tripeptide. In the CABS model, protein residues are represented using C-alpha, C-beta, united side-chain atom, and the peptide bond center [28].

The modeling protocol consists of the following steps:

  1. Generating starting structures. The protocol requires two protein structures (single- or multi-chain) in the PDB format as the input. One of them has to be treated as the ligand and the second as the receptor (the ligand would be subjected to large conformational fluctuations, translations and rotations around the receptor). Starting conformations are built using C-alpha coordinates only (in the CABS model C-alpha traces define position of other united pseudo-atoms). The algorithm places the protein-ligand center at 20 random positions around the protein receptor at the approximate distance of 20 Å from the protein receptor’s surface. Next, these protein-ligand systems are used as starting conformations for the 20 replicas in the REMC CABS sampling scheme (each replica starts from a different ligand-receptor arrangement).

  2. Docking simulations using CABS coarse-grained model and REMC dynamics. During simulations, the protein receptor structure is kept close to the starting structure using distance restraints. Distance restraints are generated using the input coordinates of the C-alpha atoms. Two residues are automatically restrained if two conditions are met. First, their separation along the sequence has to be at least five residues. Second, the distance between their C□alpha atoms must be within the range of 5–15 Å. During simulations, the receptor restraints imply small-scale fluctuations of the protein receptor backbone in the range of 1 Å and, accordingly, more significant fluctuations of the side-chain atoms. A similar restraints scheme is applied to the protein-ligand but with tenfold weaker weights. During simulations, the ligand moves freely within the vicinity of the receptor and internal restraint allows for large-scale fluctuations of its structure. Usually, the ligand fluctuations are within the range of 2 and 12 Å to the input structure although folding-unfolding events are possible at highest temperatures. The docking simulation is conducted using CABS REMC pseudo□dynamics with simulated annealing. In this work, 20 replicas and 20 annealing steps have been used. All the REMC scheme parameters have been adjusted to allow for large□scale conformational transitions, rotations, and translations of the protein-ligand in a reasonable computational time. The modeling protocol collects trajectories from all 20 replicas. The protocol saves only a small fraction (2%) of the generated models for further analysis i.e., 500 models from each replica, thus 10,000 models in total.

  3. Reconstruction of the CABS model representation. The set of 10,000 models stored as C-alpha traces are reconstructed to complete CABS model representation (C-alpha, C-beta and Side-Chain united atoms).

  4. Clustering of contact maps. First, for all of the 10’000 models the contact maps between the receptor and the ligand proteins are calculated. Two residues are considered to form a contact if their Side Chain pseudoatoms are at most 6 Å apart (for Alanines the C-beta atoms are considered as the Side Chain; for Glycines – it’s the C-alpha atoms). Next, the algorithm sorts the models according to the number of the receptor-ligand contacts, and the set of top 1000 is kept for further processing. This way the transient and weakly bound complexes are removed from the solutions pool. In the next step, the 1000 contact maps are clustered together to identify the most frequently occurring contact patterns. The complete link hierarchical clustering was used with the Jaccard index as the distance metric between contact maps. Finally, the identified clusters are ranked according to their density, defined as the number of the cluster members divided by the average metric between them.

  5. Reconstruction to all-atom representation. Representative models from the ten most dense clusters are reconstructed to all-atom representation using Modeller-based rebuilding procedure [40] (or can be reconstructed using other rebuilding strategies, see review [41]).

In recent years, the CABS model has been used for modeling the flexibility of globular proteins [42–45] and various processes leading to large-scale conformational transitions. Those included: ab initio simulations of protein folding mechanisms [46, 47], folding and binding mechanisms [48, 49], and free protein-peptide docking within the CABS-dock tool [50–56]. The CABS-dock is a well-established peptide docking tool that has been made available as a web server [50, 51] and, most recently, as a standalone application [53]. Its distinctive feature among other tools is the possibility of fast simulation of the large backbone rearrangements of both peptide and protein receptor during binding (see the review on protein-peptide docking tools [57]). The CABS-dock has been used in multiple applications (recently reviewed [55]), including docking to receptors with disordered fragments [39, 49], GPCRs [58], and modeling proteolysis mechanisms [59].

The presented protocol for protein-protein docking utilizes the CABS-dock standalone package [53] developed primarily for protein-peptide docking. In order to tackle the protein-protein docking problem, key changes have been made to the docking algorithm that aimed mainly at the improvement of the conformational sampling. First of all, the temperature distribution between replicas in the REMC scheme was adjusted. Instead of constant temperature increment between consecutive replicas, as in the original CABS-dock, here we’ve implemented progressive geometric raise of the temperature increment. Furthermore, the number of simulation replicas was increased to twenty versus ten in the original CABS-dock. Besides the sampling improvement, a new clustering protocol was introduced. The original CABS-dock used RMSD-based clustering, which worked well for peptides. For the protein-protein complexes, however, purely geometrical similarity condition such as the RMSD is too severe. Namely, for two binding poses, where the mobile protein was docked in the exact same pocket but is slightly tilted in one of them, the RMSD difference would be considerable. Despite representing similar binding poses, the two structures would end up in different clusters. To overcome this, the current protocol uses clustering based on the similarity between receptor-ligand contact maps.

Results analysis and quality metrics

The docking simulation analysis was performed using Python and NumPy (Python library). Structural differences between experimentally determined structures and generated models were evaluated using Root Mean Square Deviations (RMSDs). Interface RMSD (iRMSD) is an RMSD calculated for interface residues of the receptor and the ligand separated by no more than 6 Angstroms. Ligand RMSD (LRMSD) is an RMSD computed for the ligands after the superimposition of the receptors. Ligand only RMSD (LoRMSD) is an RMSD computed for the ligand structure only. Root Mean Square Fluctuation (RMSF) is a measure of the amino acid’s flexibility. It is calculated for every residue as the square root of this residue’s variance around the reference residue position. The fraction of native contacts (fNAT) was calculated as a number of experimental structure contacts found in the generated structure divided by the total number of contacts found in the experimental structure. Rather restrictive criterion of a contact, distance up to 6 Å between side chain centers, was used. All figures presented in this work were generated using PyMOL, UCSF Chimera, and Matplotlib (Python library).

Dataset

In this docking study, we used the benchmark set from Chen et al. [60]. The set comprises 12 test systems with available unbound X-ray structures of both the protein-receptor and the protein-ligand. The unbound structures were used as the docking input. As the reference for calculating various similarity measures, we used the X-ray structures of the protein-ligand complexes. Table 1 lists all the PDB IDs of X-ray structures used in the study.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1.

Summary of the docking simulations. The table characterizes X-ray data used in the docking, average ligand flexibility during simulations (average LoRMSD from 10’000 models), and docking results. The table reports the best accuracy models out from all (10’000) and 10 top-scored models. The metrics definitions are provided in the Methods section.

Results

The CABS-based docking algorithm produced a significant number of near-native protein-protein arrangements of acceptable quality for most protein-protein cases (up to 4 Å using interface RMSD (iRMSD) values, according to CAPRI criteria). The Figure 2 characterizes iRMSD versus CABS model energy values for the two modeling cases (the remaining cases are presented in Table S1). The most accurate models for the entire test set have been characterized in Table 1.

Figure 2.
  • Download figure
  • Open in new tab
Figure 2.

Characterization of docking results using RMSD to the x-ray structure and system energy. The left panels show the interface-RMSD versus CABS energy values. Point color represents the temperature – from yellow(high) to pink(low). The molecular visualizations show x-ray structures and ensembles of predicted models corresponding to selected energy minima (numbered in the picture from 1 to 3). As presented in the picture, the minima numbered as 1st corresponds to near-native protein-protein arrangements, others to non-native ensembles, as presented in the picture. The presented ensembles are the sets of similar models found in the structural clustering of contact maps (see Methods). The figure shows two modeling cases: 1BRS and 2SNI (see Table x). The remaining cases from the modeling set are presented in Figure S1.

This paragraph focuses on system dynamics’ characterization using barnase/barstar complex (1BRS, also shown in the upper panel of Figure 2). Figure 3 shows example ensembles of barnase/barstar models and the most accurate model (iRMSD 1.9 Å). A single system replica can explore an ample conformational space that involves significantly different binding configurations and protein-ligand conformations, as demonstrated in Figure 3c and Movie S1. Figure 4a further characterizes this single replica’s using iRMSD and LoRMSD (RMSD for ligand only) values. As presented in the figure, the ligand structure fluctuates around 5 Å (the same fluctuations in the context of all replicas are shown in Figure 4b). The ligand becomes significantly more closer to the X-ray structure after binding to the native binding site as reflected by iRMSD values. Namely, after correct binding, LoRMSD values get noticeably lower to around 2 Å (see Figure 4a). In the following sections, we do not discuss this aspect of our method; however, it is worth mentioning that the proposed method enables a detailed analysis of plausible docking trajectories. The described docking procedure uses REMC protocol enhanced by simulated annealing of all 20 replicas. Figure 4c shows their evolution through different temperatures. Figures 5 provide more detailed pictures of structural flexibility for protein “receptor” and “ligand”. Protein-protein contacts defining the complex assembly are characterized in Figure 6. In the presented example, the most persistent protein-protein contacts occur in about 15% of snapshots. Therefore, they are significantly less stable compared to intramolecular protein contacts (Figure 5).

Figure 3.
  • Download figure
  • Open in new tab
Figure 3.

Protein-protein docking stages illustrated by barstar/barnase docking case. The figure shows the barnase receptor in magenta and the barstar ligand in rainbow colors. The respective panels show: (a) 20 starting structures for each replica of the system; (b) 10’000 models combined from 20 replicas (500 models per replica) in which the highly flexible ligand is covering the entire surface of the flexible receptor; (c) 500 models from 1 replica only, (d) the best model obtained for barnase/barstar system (the x-ray structure of the ligand is shown in thick ribbon, the modeled in thin ribbon).

Figure 4.
  • Download figure
  • Open in new tab
Figure 4.

Docking trajectory for the selected replica of barnase/barstar system. The presented replica reached the most accurate barnase/barstar complex structure. (a) iRMSD (interface RMSD) and LoRMSD (ligand only RMSD) values. Example simulation snapshots illustrate the plot. The ligand is presented in rainbow colors, the receptor in magenta. The lowest iRMSD model (1.9 A from x-ray structure) is presented on the right lower corner superimposed on the x-ray structure (the x-ray structure is shown in thick lines, the predicted model in thin lines). (b) ligand only RMSD (LoRMSD) values for all replicas. The thick red line presents selected replica. (c) exchange of system replicas between different temperatures driven by Replica Exchange Monte Carlo (REMC) system. The thick red line presents selected replica. The replica trajectory is also presented in the Movie S1.

Figure 5.
  • Download figure
  • Open in new tab
Figure 5.

Characterization of barnase/barstar flexibility in the docking simulation. The figure shows RMSF plots (upper panels) and contact maps (lower panels) for (a) the barnase receptor and (b) the barstar ligand. The RMSF profile (see Methods) and contact maps showing the frequency of contacts are derived from the entire simulation (derived from 10’000 models).

Figure 6.
  • Download figure
  • Open in new tab
Figure 6.

Characterization of barnase/barstar contacts. Panels show barnase/barstar models and contact maps for: (a) entire simulation (all, 10’000 models), (b) single selected replica (replica #6, 500 models) that reached a near-native arrangement. In the maps, green circles mark the native contacts.

An essential and unique feature of the presented docking simulations is the level of backbone flexibility during docking. In the example above, the ligand backbone fluctuations (LoRMSD) are in the range of 2-7 Å (Figure 4b), with the average LoRMSD value of 3.3 Å from the entire docking simulations. In other cases, the ligand fluctuations are at a similar level or higher (the average LoRMSD varies from 3.3 to 5.2 Å depending on the protein-protein case, see Table 1).

Finally, using structural clustering of contact maps (see Methods), we attempted to select the set of 10 top-scored models for each case. Table 1 reports the most accurate models out of the 10 top-scored.

Discussion

The progress in predicting protein assemblies’ structures is expected through closer integration of free docking approaches with template-based modeling techniques, refinement, scoring, and significant improvements in the underlying methodologies [2, 5]. This work demonstrates a significant improvement in the sampling of large-scale conformational transitions during global protein-protein docking. We show that modeling the large conformational changes is possible at a relatively low computational cost. The presented simulations took between 10 and 80 hours (depending on the system size) using a single standard CPU. The proposed modeling protocol can be used as the docking engine in template-based and integrative docking protocols using experimental structural data and additional information from various sources. We focused on the free docking of protein ligands with a highly flexible backbone in the present test simulations. Using unbound structures as the input, we produced acceptable accuracy models (iRMSD around 4 Å or lower) in all 12 cases. However, the selection procedure of the most accurate models needs further improvement. Namely, selecting the best ranked models led to acceptable models in about half of the tested cases.

Presently, the most common approach to account for conformational changes in protein docking is using ENM [23–27, 35–37]. The applicability of ENM to modeling protein flexibility is limited to specific systems and depends on how collective the protein motions are. Our method presents a conceptually different approach that seems to be more realistic (see review discussing coarse-grained CABS dynamics in the context of ENM approaches [23]). We demonstrated that it is possible to simulate effectively free docking of highly flexible protein ligands to quite elastic protein receptor structures. Such a significant degree of flexibility was achieved using a highly efficient simulation engine based on the coarse-grained representation of protein structures, Monte Carlo dynamics, and knowledge-based force field. CABS coarse-graining, enhanced by the discretized protein model and interaction patterns, significantly reduces the search space. Monte Carlo dynamics, enhanced by Replica Exchange annealing, leads to huge speed-up of the search procedures. Also a significant (although acceptable for many problems) flattening of energy surfaces by statistical potentials of CABS model simplifies simulations. As a result the flexible docking using CABS-dock is orders of magnitude faster than equivalent simulations based on classical modeling methods. Obviously the new method has also several limitations that have to be taken into consideration when designing new computational experiments. First, since the “ligand” protein is treated as a very elastic object (what is necessary to guarantee efficient search of the binding sites and poses) the cost of computations rapidly grows with the protein size. Thus, completely free global docking of protein ligands larger than in the present study (i.e. above 130 residues) may be impractical. Second, the coarse graining of the sampling space and simplification of interaction patterns (so important for the huge acceleration of the simulations) makes the docking energetics less sensitive. For these reasons the clustering procedures, refinement of the resulting structures and final model selection becomes very challenging and needs further developments. Also speeding-up the entire protocol can be useful. We estimate that the simulations could be easily speeded-up at least 10 times or more through algorithm parallelization. The speed-up would enable making the protocol available as the publicly accessible and automated web service.

In summary, we presented a proof of concept of free global docking with significant backbone flexibility. We believe that this work’s result initiates a new and promising approach to the protein-protein docking problem. The most encouraging future development area includes using the developed methodology in template-based and integrative docking guided by partial template information and experimental data [2, 61].

Acknowledgments

SK, MZ and AK acknowledge support by the National Science Centre (NCN), Poland Grant [MAESTRO2014/14/A/ST6/00088]. MK acknowledges support by the National Science Centre (NCN), Poland Grant [501/D112/66 GR-6271].

Footnotes

  • ↵+ to whom correspondence should be addressed

References

  1. 1.↵
    Porter KA, Desta I, Kozakov D, Vajda S (2019) What method to use for protein– protein docking? Curr Opin Struct Biol 55:1–7. https://doi.org/10.1016/j.sbi.2018.12.010
    OpenUrl
  2. 2.↵
    Rosell M, Fernández-Recio J (2020) Docking approaches for modeling multi-molecular assemblies. Curr Opin Struct Biol 64:59–65. https://doi.org/10.1016/j.sbi.2020.05.016
    OpenUrl
  3. 3.↵
    Lensink MF, Velankar S, Kryshtafovych A, et al. (2016) Prediction of homoprotein and heteroprotein complexes by protein docking and templateLbased modeling: A CASPLCAPRI experiment. Proteins Struct Funct Bioinforma 84:323–348. https://doi.org/10.1002/prot.25007
    OpenUrl
  4. 4.
    Lensink MF, Velankar S, Wodak SJ (2017) Modeling protein–protein and protein– peptide complexes: CAPRI 6th edition. Proteins Struct Funct Bioinforma 85:359–377. https://doi.org/10.1002/prot.25215
    OpenUrl
  5. 5.↵
    Lensink MF, Nadzirin N, Velankar S, Wodak SJ (2020) Modeling protein□protein, protein□peptide, and protein□oligosaccharide complexes: CAPRI 7th edition. Proteins Struct Funct Bioinforma 88:916–938. https://doi.org/10.1002/prot.25870
    OpenUrl
  6. 6.↵
    Pierce BG, Wiehe K, Hwang H, et al. (2014) ZDOCK server: interactive docking prediction of protein-protein complexes and symmetric multimers. Bioinformatics 30:1771–1773. https://doi.org/10.1093/bioinformatics/btu097
    OpenUrlCrossRefPubMedWeb of Science
  7. 7.
    Kozakov D, Hall DR, Xia B, et al. (2017) The ClusPro web server for protein–protein docking. Nat Protoc 12:255–278. https://doi.org/10.1038/nprot.2016.169
    OpenUrlCrossRefPubMed
  8. 8.↵
    Yan Y, Tao H, He J, Huang S-Y (2020) The HDOCK server for integrated protein– protein docking. Nat Protoc 15:1829–1852. https://doi.org/10.1038/s41596-020-0312-x
    OpenUrl
  9. 9.↵
    Venkatraman V, Yang YD, Sael L, Kihara D (2009) Protein-protein docking using region-based 3D Zernike descriptors. BMC Bioinformatics 10:407. https://doi.org/10.1186/1471-2105-10-407
    OpenUrlCrossRefPubMed
  10. 10.↵
    Christoffer C, Terashi G, Shin W, et al. (2020) Performance and enhancement of the LZerD protein assembly pipeline in CAPRI 38□46. Proteins Struct Funct Bioinforma 88:948–961. https://doi.org/10.1002/prot.25850
    OpenUrl
  11. 11.↵
    Estrin M, Wolfson HJ (2017) SnapDock—template-based docking by Geometric Hashing. Bioinformatics 33:i30–i36. https://doi.org/10.1093/bioinformatics/btx233
    OpenUrl
  12. 12.↵
    Gromiha MM, Yugandhar K, Jemimah S (2017) Protein–protein interactions: scoring schemes and binding affinity. Curr Opin Struct Biol 44:31–38. https://doi.org/10.1016/j.sbi.2016.10.016
    OpenUrl
  13. 13.
    Feng T, Chen F, Kang Y, et al. (2017) HawkRank: a new scoring function for protein– protein docking based on weighted energy terms. J Cheminform 9:66. https://doi.org/10.1186/s13321-017-0254-7
    OpenUrl
  14. 14.
    Geng C, Jung Y, Renaud N, et al. (2020) iScore: a novel graph kernel-based function for scoring protein–protein docking models. Bioinformatics 36:112–121. https://doi.org/10.1093/bioinformatics/btz496
    OpenUrl
  15. 15.
    . Yan Y, Huang S-Y (2019) Pushing the accuracy limit of shape complementarity for protein-protein docking. BMC Bioinformatics 20:696. https://doi.org/10.1186/s12859-019-3270-y
    OpenUrl
  16. 16.↵
    Siebenmorgen T, Zacharias M (2019) Evaluation of Predicted Protein–Protein Complexes by Binding Free Energy Simulations. J Chem Theory Comput 15:2071– 2086. https://doi.org/10.1021/acs.jctc.8b01022
    OpenUrl
  17. 17.↵
    GCP van Zundert, JPGLM Rodrigues, Trellet M, et al. (2016) The HADDOCK2.2 Web Server: User-Friendly Integrative Modeling of Biomolecular Complexes. J Mol Biol 428:720–725. https://doi.org/10.1016/j.jmb.2015.09.014
    OpenUrlCrossRefPubMed
  18. 18.↵
    Lensink MF, Brysbaert G, Nadzirin N, et al. (2019) Blind prediction of homo□ and heteroLprotein complexes: The CASP13□CAPRI experiment. Proteins Struct Funct Bioinforma 87:1200–1221. https://doi.org/10.1002/prot.25838
    OpenUrl
  19. 19.↵
    Harmalkar A, Gray JJ (2021) Advances to tackle backbone flexibility in protein docking. Curr Opin Struct Biol 67:178–186. https://doi.org/10.1016/j.sbi.2020.11.011
    OpenUrl
  20. 20.
    Siebenmorgen T, Engelhard M, Zacharias M (2020) Prediction of protein–protein complexes using replica exchange with repulsive scaling. J Comput Chem 41:1436– 1447. https://doi.org/10.1002/jcc.26187
    OpenUrl
  21. 21.↵
    Park T, Woo H, Baek M, et al. (2020) Structure prediction of biological assemblies using GALAXY in CAPRI rounds 38□45. Proteins Struct Funct Bioinforma 88:1009– 1017. https://doi.org/10.1002/prot.25859
    OpenUrl
  22. 22.↵
    Peterson LX, Kang X, Kihara D (2014) Assessment of protein side-chain conformation prediction methods in different residue environments. Proteins Struct Funct Bioinforma 82:1971–1984. https://doi.org/10.1002/prot.24552
    OpenUrl
  23. 23.↵
    Kmiecik S, Kouza M, Badaczewska-Dawid A, et al. (2018) Modeling of Protein Structural Flexibility and Large-Scale Dynamics: Coarse-Grained Simulations and Elastic Network Models. Int J Mol Sci 19:3496. https://doi.org/10.3390/ijms19113496
    OpenUrlCrossRef
  24. 24.↵
    Torchala M, Gerguri T, Chaleil RAG, et al. (2020) Enhanced sampling of protein conformational states for dynamic cross□docking within the protein□protein docking server SwarmDock. Proteins Struct Funct Bioinforma 88:962–972. https://doi.org/10.1002/prot.25851
    OpenUrl
  25. 25.
    . Jiménez-García B, Roel-Touris J, Romero-Durana M, et al. (2018) LightDock: a new multi-scale approach to protein–protein docking. Bioinformatics 34:49–55. https://doi.org/10.1093/bioinformatics/btx555
    OpenUrl
  26. 26.
    Kurkcuoglu Z, AMJJ Bonvin (2020) PreL and postLdocking sampling of conformational changes using ClustENM and HADDOCK for protein□protein and protein□DNA systems. Proteins Struct Funct Bioinforma 88:292–306. https://doi.org/10.1002/prot.25802
    OpenUrl
  27. 27.↵
    Schindler CEM, de Vries SJ, Zacharias M (2015) iATTRACT: Simultaneous global and local interface optimization for protein-protein docking refinement. Proteins Struct Funct Bioinforma 83:248–258. https://doi.org/10.1002/prot.24728
    OpenUrl
  28. 28.↵
    Kmiecik S, Gront D, Kolinski M, et al. (2016) Coarse-Grained Protein Models and Their Applications. Chem Rev 116:7898–7936. https://doi.org/10.1021/acs.chemrev.6b00163
    OpenUrlCrossRefPubMed
  29. 29.
    Baaden M, Marrink SJ (2013) Coarse-grain modelling of protein–protein interactions. Curr Opin Struct Biol 23:878–886. https://doi.org/10.1016/j.sbi.2013.09.004
    OpenUrlCrossRefPubMed
  30. 30.
    Roel-Touris J, AMJJ Bonvin (2020) Coarse-grained (hybrid) integrative modeling of biomolecular interactions. Comput Struct Biotechnol J 18:1182–1190. https://doi.org/10.1016/j.csbj.2020.05.002
    OpenUrl
  31. 31.↵
    Krupa P, Karczyńska AS, Mozolewska MA, et al. (2020) UNRES-Dock—protein– protein and peptide–protein docking by coarse-grained replica-exchange MD simulations. Bioinformatics. https://doi.org/10.1093/bioinformatics/btaa897
  32. 32.↵
    Kuroda D, Gray JJ (2016) Pushing the Backbone in Protein-Protein Docking. Structure 24:1821–1829. https://doi.org/10.1016/j.str.2016.06.025
    OpenUrlCrossRef
  33. 33.
    Marze NA, Roy Burman SS, Sheffler W, Gray JJ (2018) Efficient flexible backbone protein–protein docking for challenging targets. Bioinformatics 34:3461–3469. https://doi.org/10.1093/bioinformatics/bty355
    OpenUrlCrossRefPubMed
  34. 34.↵
    Roy Burman SS, Nance ML, Jeliazkov JR, et al. (2020) Novel sampling strategies and a coarse□grained score function for docking homomers, flexible heteromers, and oligosaccharides using Rosetta in CAPRI rounds 37–45. Proteins Struct Funct Bioinforma 88:973–985. https://doi.org/10.1002/prot.25855
    OpenUrl
  35. 35.↵
    Zacharias M (2005) ATTRACT: Protein-protein docking in CAPRI using a reduced protein model. Proteins Struct Funct Bioinforma 60:252–256. https://doi.org/10.1002/prot.20566
    OpenUrl
  36. 36.↵
    Glashagen G, Vries S, Uciechowska□Kaczmarzyk U, et al. (2020) Coarse□grained and atomic resolution biomolecular docking with the ATTRACT approach. Proteins Struct Funct Bioinforma 88:1018–1028. https://doi.org/10.1002/prot.25860
    OpenUrl
  37. 37.↵
    Moal IH, Bates PA (2010) SwarmDock and the Use of Normal Modes in Protein-Protein Docking. Int J Mol Sci 11:3623–3648. https://doi.org/10.3390/ijms11103623
    OpenUrlCrossRefPubMed
  38. 38.↵
    Yan Y, He J, Feng Y, et al. (2020) Challenges and opportunities of automated protein□protein docking: HDOCK server vs human predictions in CAPRI <scp>Rounds</scp> 38L46. Proteins Struct Funct Bioinforma 88:1055–1069. https://doi.org/10.1002/prot.25874
    OpenUrl
  39. 39.↵
    Ciemny M, Badaczewska-Dawid A, Pikuzinska M, et al. (2019) Modeling of Disordered Protein Structures Using Monte Carlo Simulations and Knowledge-Based Statistical Force Fields. Int J Mol Sci 20:606. https://doi.org/10.3390/ijms20030606
    OpenUrlCrossRef
  40. 40.↵
    Badaczewska-Dawid AE, Khramushin A, Kolinski A, et al. (2020) Protocols for All-Atom Reconstruction and High-Resolution Refinement of Protein–Peptide Complex Structures. Methods Mol Biol 2165:273–287. https://doi.org/10.1007/978-1-0716-0708-4_16
    OpenUrl
  41. 41.↵
    Badaczewska-Dawid AE, Kolinski A, Kmiecik S (2020) Computational reconstruction of atomistic protein structures from coarse-grained models. Comput Struct Biotechnol J. 18:162–176. https://doi.org/10.1016/j.csbj.2019.12.007
    OpenUrl
  42. 42.↵
    Jamroz M, Orozco M, Kolinski A, Kmiecik S (2013) Consistent View of Protein Fluctuations from All-Atom Molecular Dynamics and Coarse-Grained Dynamics with Knowledge-Based Force-Field. J Chem Theory Comput 9:119–125. https://doi.org/10.1021/ct300854w
    OpenUrlCrossRefPubMed
  43. 43.
    Kuriata A, Gierut AM, Oleniecki T, et al. (2018) CABS-flex 2.0: A web server for fast simulations of flexibility of protein structures. Nucleic Acids Res 46:W338–W343. https://doi.org/10.1093/nar/gky356
    OpenUrl
  44. 44.
    Kurcinski M, Oleniecki T, Ciemny MP, et al. (2019) CABS-flex standalone: a simulation environment for fast modeling of protein flexibility. Bioinformatics 35:694– 695. https://doi.org/10.1093/bioinformatics/bty685
    OpenUrlCrossRef
  45. 45.↵
    Kmiecik S, Gront D, Kouza M, Kolinski A (2012) From Coarse-Grained to Atomic-Level Characterization of Protein Dynamics: Transition State for the Folding of B Domain of Protein A. J Phys Chem B 116:7026–7032. https://doi.org/10.1021/jp301720w
    OpenUrlCrossRefPubMed
  46. 46.↵
    Kmiecik S, Kolinski A (2007) Characterization of protein-folding pathways by reduced-space modeling. Proc Natl Acad Sci 104:12330–12335. https://doi.org/10.1073/pnas.0702265104
    OpenUrlAbstract/FREE Full Text
  47. 47.↵
    Kmiecik S, Kolinski A (2011) Simulation of Chaperonin Effect on Protein Folding: A Shift from Nucleation–Condensation to Framework Mechanism. J Am Chem Soc 133:10283–10289. https://doi.org/10.1021/ja203275f
    OpenUrlCrossRefPubMedWeb of Science
  48. 48.↵
    Kurcinski M, Kolinski A, Kmiecik S (2014) Mechanism of Folding and Binding of an Intrinsically Disordered Protein As Revealed by ab Initio Simulations. J Chem Theory Comput 10:2224–2231. https://doi.org/10.1021/ct500287c
    OpenUrl
  49. 49.↵
    Ciemny MP, Debinski A, Paczkowska M, et al. (2016) Protein-peptide molecular docking with large-scale conformational changes: The p53-MDM2 interaction. Sci Rep 6:. https://doi.org/10.1038/srep37532
  50. 50.↵
    Kurcinski M, Jamroz M, Blaszczyk M, et al. (2015) CABS-dock web server for the flexible docking of peptides to proteins without prior knowledge of the binding site. Nucleic Acids Res 43:W419–W424. https://doi.org/10.1093/nar/gkv456
    OpenUrlCrossRefPubMed
  51. 51.↵
    Blaszczyk M, Kurcinski M, Kouza M, et al. (2016) Modeling of protein–peptide interactions using the CABS-dock web server for binding site search and flexible docking. Methods 93:72–83. https://doi.org/10.1016/j.ymeth.2015.07.004
    OpenUrl
  52. 52.
    Ciemny MP, Kurcinski M, Kozak K, et al. (2017) Highly flexible protein-peptide docking using cabs-dock. Methods Mol Biol 1561:69–94. https://doi.org/10.1007/978-1-4939-6798-8_6
    OpenUrlCrossRef
  53. 53.↵
    Kurcinski M, Ciemny MP, Oleniecki T, et al. (2019) CABS-dock standalone: a toolbox for flexible protein–peptide docking. Bioinformatics 35:4170–4172. https://doi.org/10.1093/bioinformatics/btz185
    OpenUrl
  54. 54.
    Blaszczyk M, Ciemny MP, Kolinski A, et al. (2019) Protein-peptide docking using CABS-dock and contact information. Brief Bioinform 20:2299–2305. https://doi.org/10.1093/bib/bby080
    OpenUrl
  55. 55.↵
    Kurcinski M, Badaczewska□Dawid A, Kolinski M, et al. (2020) Flexible docking of peptides to proteins using CABSLdock. Protein Sci 29:211–222. https://doi.org/10.1002/pro.3771
    OpenUrl
  56. 56.↵
    Ciemny MP, Kurcinski M, Blaszczyk M, et al. (2017) Modeling EphB4-EphrinB2 protein–protein interaction using flexible docking of a short linear motif. Biomed Eng Online 16:71. https://doi.org/10.1186/s12938-017-0362-7
    OpenUrl
  57. 57.↵
    Ciemny M, Kurcinski M, Kamel K, et al. (2018) Protein–peptide docking: opportunities and challenges. Drug Discov Today 23:1530–1537. https://doi.org/10.1016/j.drudis.2018.05.006
    OpenUrlCrossRef
  58. 58.↵
    Badaczewska-Dawid AE, Kmiecik S, Koliński M (2020) Docking of peptides to GPCRs using a combination of CABS-dock with FlexPepDock refinement. Brief Bioinform. https://doi.org/10.1093/bib/bbaa109
  59. 59.↵
    Koliński M, Kmiecik S, Dec R, et al. (2020) Docking interactions determine early cleavage events in insulin proteolysis by pepsin: Experiment and simulation. Int J Biol Macromol 149:1151–1160. https://doi.org/10.1016/j.ijbiomac.2020.01.253
    OpenUrl
  60. 60.↵
    Chen R, Weng Z (2002) Docking unbound proteins using shape complementarity, desolvation, and electrostatics. Proteins Struct Funct Genet 47:281–294. https://doi.org/10.1002/prot.10092
    OpenUrlCrossRefPubMedWeb of Science
  61. 61.↵
    Roel-Touris J, Don CG, V. Honorato R, et al. (2019) Less Is More: Coarse-Grained Integrative Modeling of Large Biomolecular Assemblies with HADDOCK. J Chem Theory Comput 15:6358–6367. https://doi.org/10.1021/acs.jctc.9b00310
    OpenUrl
Back to top
PreviousNext
Posted February 23, 2021.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Protein-protein docking with large-scale backbone flexibility
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Protein-protein docking with large-scale backbone flexibility
Mateusz Kurcinski, Sebastian Kmiecik, Mateusz Zalewski, Andrzej Kolinski
bioRxiv 2021.02.22.432196; doi: https://doi.org/10.1101/2021.02.22.432196
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Protein-protein docking with large-scale backbone flexibility
Mateusz Kurcinski, Sebastian Kmiecik, Mateusz Zalewski, Andrzej Kolinski
bioRxiv 2021.02.22.432196; doi: https://doi.org/10.1101/2021.02.22.432196

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (2635)
  • Biochemistry (5222)
  • Bioengineering (3650)
  • Bioinformatics (15727)
  • Biophysics (7218)
  • Cancer Biology (5597)
  • Cell Biology (8050)
  • Clinical Trials (138)
  • Developmental Biology (4739)
  • Ecology (7471)
  • Epidemiology (2059)
  • Evolutionary Biology (10531)
  • Genetics (7702)
  • Genomics (10090)
  • Immunology (5155)
  • Microbiology (13828)
  • Molecular Biology (5354)
  • Neuroscience (30604)
  • Paleontology (212)
  • Pathology (872)
  • Pharmacology and Toxicology (1520)
  • Physiology (2235)
  • Plant Biology (4987)
  • Scientific Communication and Education (1037)
  • Synthetic Biology (1380)
  • Systems Biology (4131)
  • Zoology (804)