- Training Courses
- Workshops
- Grants & Fellowships
- European Conference in Functional Genomics
- Meeting Reports
- Online Registration

 

 

Workshop on
Flexible Macromolecular Docking

CECAM, Lyon, France, 28-30 April 2004

Organisers
Report
Programme
Abstracts
List of participants

Organisers

Chantal Prévost Laboratoire de Biochimie Théorique, IBPC, Paris France
Michael Sternberg Department of Biology and Biochemistry - Imperial College of Science, Technology and Medicine - South Kensington, London, SW7 2AY, U.K
Joël Janin Laboratoire d'Enzymologie et Biochimie Structurales, Gif-sur-Yvette, France

Report

Scientific background and objectives

The last 25 years have witnessed the development of potent algorithms for docking macromolecules1. Launched in 2001, the CAPRI experience (Critical Assessment of PRediction of Interactions) provides evaluation of these methods on a common ground and incentives new methodology development2. The motivation is to build three-dimensional structures of molecular machineries (the biological active species) starting from their separate macromolecular components. Making such a tool available is an essential complement of experimental approaches as Protein-Interaction Maps become available for whole proteomes and as the number of protein with known or built-up structures increases. The structure of the proteins is generally available in their unbound form and may arise from precise X-ray or NMR structure resolution, but also from low resolution EM reconstruction or from homology modeling. In addition to side-chain fluctuations, likely to occur between the free and bound forms, the main chain can undergo conformational changes during association3. Structural elements may also be missing in the unbound form, particularly in the case of low resolution structures. Proteins loops are generally poorly defined in structures resulting from homology modelling4.

Evaluation of the docking results in CAPRI has established that some complexes cannot be correctly predicted without explicitly taking into account the internal flexibility of the receptor as well as that of the ligand macromolecule5. This is a difficult task to achieve while conserving the search rapidity necessary for post-genomics applications. Systematic exploration, in terms of position and rotation, of the possible arrangement of the two partners considered as rigid bodies already necessitates the generation of hundreds of thousands trial conformations for the complex. Introducing every degree of protein internal flexibility from the beginning of a systematic search procedure is clearly not manageable due to combinatorial explosion. However, there is still no systematic way to know which of these internal degrees of freedom would be essential for a particular docking problem. In fact, incorporating the internal flexibility of proteins in the docking methods has been identified as a major bottleneck to improve the field and this CECAM workshop intended to specifically tackle this problem. The other identified bottleneck, which is the determination of scoring functions able to discriminate between a correct docking result and a false positive result, was not the subject of this workshop but has however been addressed in part during the discussions since it is hardly separable from the flexible docking problem.

Many of the groups which develop docking programs have already devised a way of considering side chain conformational rearrangement during docking, whether implicitly or explicitly. Methods for considering higher levels of flexibility, involving the rearrangement of segments of the protein backbone, i.e. loops, domains or the whole protein, are currently being explored by an increasing number of groups. At the same time, protein flexibility is also being addressed in other fields of molecular modeling, whether per se, to better understand the mechanism underlying induced fit, or to deal with related problems like small molecule docking or protein folding. Since these explorations are not necessarily submitted to the restrictions inherent to the docking methods (systematic search, need of rapidity), the information they provide can be very helpful for the development of flexible docking methods. The present workshop on flexible macromolecular docking came as a convergence between these different approaches. Its object was to bring together the groups which have experienced some of the aspects of flexible docking as well as those specifically working on protein flexibility, in order to share the experience accumulated by each of them, to inventory methodologies devoted to handle protein flexibility and to evaluate their potentiality in flexible docking, to identify the difficulties inherent to flexible macromolecular docking and to tackle the next steps to be performed in the field.


Program and Participants

The workshop gathered fifty participants, either developing docking methods in relation to the CAPRI experience, involved in specific docking projects or specifically working on protein flexibility in the frame of neighboring fields. European participants came from seven different countries. In addition to academic institutions, four industrial companies were represented. The scientific background of the participants ranged from biophysics to robotics, informatics or genomics. The workshop consisted in series of 35 min oral presentations followed by discussion sessions and poster sessions.

As a guideline for the workshop, three principal sessions had been proposed (listed below). The issues raised in each session have all been largely covered by the speakers, though each speaker generally addressed several of these issues. People involved in the CAPRI experience have already begun to address the flexibility problem and several groups involved in the macromolecular docking field are also involved in small molecule docking or protein folding. It must be emphasized that an important effort has been made by all the speakers to focus their intervention on the theme of the workshop and to extract from their work and experience what is related to flexibility. This resulted in a high quality level of the lectures and related discussions.

• Presentation and analysis of the possible types of deformations which can be expected to occur within macromolecules during their association
- analysis of the impact of such deformations on the results of the Capri experience
- examples where conformational changes have or have not impeded good prediction

• How flexibility is accounted for in the present docking methods :
- implicitly or explicitly
- at the level of side-chains, loops or domain
- at the refinement stage or during the docking process
- advantages and/or problems related with each level of representation, in terms of prediction efficiency and processing time

• Possible methods that can be used to treat protein flexibility at different levels: local or global, full-atom or reduced representation, harmonic or anharmonic movements; experience accumulated in neighboring fields, small molecule docking or protein folding.


Results and Conclusions

The participants identified the following issues as important clues to further advance the flexible macromolecular docking problem.

How do we know a protein is flexible; how can we know which parts are flexible?
Several methods presented by the participants can be helpful to address these questions. The graph-theoretic algorithm FIRST6 [http://firstweb.asu.edu] has been specifically developed to identify flexible regions in a protein. Other hints have been proposed, some of them based on experimental data from NMR or biochemistry, other based on theoretical calculation like Molecular Dynamics (MD) simulation, enhanced MD, normal mode analysis or Principal Component Analysis (PCA). These calculations may be performed on a protein or more generally on a representant of a family of proteins. Indirect indication of flexibility based on genome analysis has also been reported. The Evolutionary Trace (ET) method7, based on the analysis of sequences of divergently related proteins, identifies patches of residues involved in the docking interface. In the case of flexible proteins, it happens that these patches are situated on the protein surface only after a conformational change.


If the flexibility characteristics are known, what methods can be used?
Even if a protein has been detected as internally flexible, it can happen that flexible parts are situated outside a docking interface, as observed for one of the Capri targets. In this case it needs not be taken into account. Different levels of flexibility should also be considered, side chains, loops, domains or the whole protein. Concerning side chains, it is not clear from the examples described during the workshop whether it is sufficient to account implicitly for side chain flexibility (using a soft representation) or if the explicit level is necessary. Side chain refinement has been reported to improve the predictions in some cases but in other cases it could alter the prediction. However, a soft representation is clearly not adapted to loop movements or domain movements since the volume scanned by such moves is very large. One solution is to delete these flexible parts during a first step of systematic rigid body search, then possibly reintroduce them. Alternatively, methods have been presented to explicitly account for loop or domain movements during docking. In the case of loop movement, a mutlicopy approach was used with pre-generated loop conformations. In the case of domain movements of the hinge-bend type, a multi-component docking approach was used. Interestingly, multi-component docking appeared much more efficient in predicting correct protein arrangements than successively docking pairs of separate protein elements. Contributions from the graph theory for multi-component docking or from robotics to generate possible deformations and articulate movements appear very promising. In the case of global deformations of the protein backbone, docking on a sample of conformations issued from MD or from PCA-enhanced MD appeared to improve the predictions, as did PCA-based conformational adjustment performed during the docking process.

In any case, any information allowing restriction of the search space is welcome. This allows efficient use of an all-atom, internal variable representation of the protein during the final stages of docking, coupled with MD or minimization. Precise information on the docking interface can be obtained by NMR or ET. Information can also be deduced from published biochemical experiments. Such information must be used with caution since a wrong interpretation of the data systematically leads to wrong predictions.

Other points under discussion
Several studies presented during the workshop aimed at better understanding the induced fit process during macromolecular docking. In particular, it was discussed whether protein deformation occurs as a result of the modification of the external field sensed by the protein, or if the docking process involves a selection between conformations already present in the solution ensemble.

The importance of the scoring function was also emphasized. In intermediate stages of the docking process, an inadequate scoring function may lead to reject conformations that would have led to correct predictions. It was discussed whether an universal scoring function exists, based on a precise account of all free energy components, or if it is more adequate to use different scoring functions, adapted to the different levels of protein representation during the docking process.

Finally, the need to learn from bad predictions has been stressed. Particularly, it may be useful to elaborate a more complex analysis of the predictions. Poorly scored complexes containing zones with good interactions coexisting with zones with bad interactions ("white" + "black" = "grey") should be distinguished from complexes with uniformly poor interactions (all "grey"). This distinction is particularly important in the case of flexible docking and may lead to identify induced deformations in a protein. To perform such an analysis, much can be learned from the huge amount of data generated by the various groups in view of the Capri experience and discarded when selecting the structures to be submitted. Comparative studies underway between true dimeric proteins and proteins submitted to crystal packing interaction are also very important for this purpose. It seems that the spatial repartition of interactions is more important than simply the total sum of interactions.

In conclusion, the workshop successfully reached its goal in specifically addressing the problem of accounting for flexibility in macromolecular docking. Thanks to the high-level contributions of all participants, we were able to determine the important issues to be addressed, to identify methods potentially useful to solve these questions and interesting tracks to be explored. We hope the constructive discussions that took place during the workshop announce a new stage of development of the macromolecular docking field.

References

1. Smith GR, Sternberg MJE. Prediction of protein-protein interactions by docking methods. Curr. Opin. Struct. Biol. 2002, 12, 28-35
2. Janin J, Henrick K, Moult J, Eyck LT, Sternberg MJ, Vajda S, VakserI, Wodak SJ. CAPRI: A Critical Assessment of PRedicted Interactions. Proteins 2003, 52, 2-9. [http://capri.ebi.ac.uk]
3. Betts MJ, Sternberg MJE. An analysis of conformational changes on protein-protein association: implications for predictive docking. Protein Engineering 1999, 12, 271-283
see also http://molmovdb.org/molmovdb for an overview of protein movements
4. Rodriguez R, Chinea G, Lopez N, Pons T, Vriend G. Homology modeling, model and software evaluation: three related resources. Bioinformatics. 1998, 14, 523-8
5. Mendez R, Leplae R, De Maria L, Wodak SJ. Assessment of blind predictions of protein-protein interactions: Current status of docking methods. Proteins 2003, 52, 51-67
6. Jacobs DJ, Radler, AJ, Kuhn LA and Thorpe MF. Protein Flexibility predictions using graph theory. Proteins, 2001, 44, 150-165
7. Lichtarge O, Bourne HR and Cohen FE. The evolutionary trace method defines the binding surfaces common to a protein family. J Mol Biol 1996, 257, 342-358

Programme

Wednesday 28th April

8.45 - 9.00 Welcome and practical information
9.00 - 9.20 Chantal Prévost: Presentation of the workshop: principal issues and objectives
9.20 - 9.55 Joël Janin: Conformation changes and specificity of protein-protein interaction
9.55 - 10.30 Shoshana Wodak: Possible types of protein deformations and summary of some of the CAPRI results with focus on the influence of conformational changes.
10.30 - 11.00 Coffee break
11.00 - 11.35 Miriam Eisenstein:Analysis of the impact of deformations on the results of the CAPRI contest
11.35 - 12.10 Carlos J. Camacho: How Flexible your Rigid-Body Docking Should Be
12.10 - 14.15 Lunch break
14.15 - 14.50 Juan Fernández-Recio: ICM optimization of flexible interface side-chains in protein-protein docking: successes and limitations
14.50 - 15.25 Ludwig Kripphal: Flexibility as part of the geometric filtering problem
15.25 - 17.00 Poster session and coffee break
17.00 - 17.35 Michael J.E. Sternberg: Modelling the structure of protein-protein complexes

Thursday 29th April

9.00 - 9.35 Ruben Abagyan: Simulating induced fit in molecular docking
9.35 - 10.10 Dina Schneidman: Modeling large-scale hinge-bent motions in docking
10.10 - 10.30 Discussion session
10.30 - 10.50 Coffee break
10.50 - 11.25 Karine Bastard: Accounting for protein loop flexibility during macromolecular docking
11.25 - 12.00 Martin Zacharias: How to efficiently account for side chain flexibility and global motions during docking
12.00 - 14.00 Lunch break and poster session
14.00 - 14.35 Alexandre M.J.J Bonvin: HADDOCK: an information-driven flexible docking approach
14.35 - 15.10 Graham R. Smith: How may the use of MD and rigid-body docking algorithms overcome the protein flexibility problem associated with complex formation?
15.10 -19.30 Afternoon free
19.30 Dinner

Friday 30th April

9.00 - 9.35 Yuval Inbar: Combinatorial docking for multi-molecular assembly and protein structure prediction
9.35 - 10.10 Raik Gruenberg: Complementarity of structure ensembles in protein-protein binding
10.10 - 10.30 Discussion session
10.30 - 10.50 Coffee break
10.50 - 11.25 Olivier Lichtarge: Prediction of interacting surfaces by the Evolutionary Trace method
11.25 - 12.00 Leslie Kuhn: Modeling Correlated Protein Main-chain Motions in Proteins and their Ligands
12.00 - 13.30 Lunch break
13.30 - 14.05 Heather A. Carlson: Protein flexibility and drug design: How to hit a moving target
14.05 - 15.05 Discussion session

Abstracts

Conformation changes and the specificity of protein-protein interaction
04/28 - 9.20
Joël Janin, Laboratoire d'Enzymologie et de Biochimie Structurales, CNRS, Gif-sur-Yvette, France
Molecular docking algorithms assemble a two-pieces puzzle, which would be a child game if the components were rigid like a lock and a key. In reality, molecules (small or large) change conformation as they associate, a feature that all protein-protein docking procedures must take into account. Whereas existing procedures generally succeed when the conformation changes are small, they fail to reproduce large changes. These are nevertheless common, and in many biological systems, they are essential to the function. Changes seen upon association can be local (loop movement) or global (dimerization), and they may include disorder-to-order transitions, making protein-protein interaction of similar complexity to protein folding.
Specific protein-protein complexes and homodimeric proteins form interfaces that are large and compact, with close-packed interface atoms. In contrast, the non-specific interaction observed in protein crystal packing generate small, loosely packed interfaces. These structural differences are easily interpreted in terms of geometric complementarity in cases where conformation changes are small and recognition takes place between preformed surfaces. In contrast, large changes at an interface imply that recognition first occurs between surfaces that are not complementary. A basic question in molecular assembly is how this process takes place, and whether we can reproduce it in docking procedure.

Analysis of the impact of deformations on the results of the CAPRI experiment
04/28 - 11.00
Miri Eisenstein, Weizmann Institute of Science, Rehovot, Israel
The results of the CAPRI experiment indicate that rigid body docking procedures are able to tolerate considerable structural deformations (e.g. T01, T10 and T11). However, larger deformations that involve hinge movements, are a problem not only when they are real, as in T09, but also when they are anticipated, as in T11 and T13. It appears that the current scoring functions cannot be used to predict if a hinge movement is likely to occur or not. In cases when a hinge movement is known to occur, a multi-rigid-body approach can be used to predict the structure of the complex.

How Flexible your Rigid-Body Docking Should Be
04/28 - 11.35
Carlos J. Camacho, Boston University, USA
I will discuss the feasibility of structural refinement in the context of protein-protein docking, emphasizing the benefits/disadvantages of backbone and side chain flexibility. I will mention the biophysical motivation for addressing the side chain refinement problem, and suggest how to solve it. Examples from blind predictions in the CAPRI experiment will also be mentioned.

ICM optimization of flexible interface side-chains in protein-protein docking: successes and limitations
04/28 - 14.15
Juan Fernandez-Recio, University of Cambridge, UK
The ICM Docking and Interface Side-Chain Optimization (ICM-DISCO) was benchmarked in 24 unbound pairs for protein-protein docking [Fernandez-Recio et al. (2002) Protein Sci. 11, 280-291] and successfully evaluated in the blind CAPRI experiment (http://capri.ebi.ac.uk) [Fernandez-Recio et al. (2003) Proteins 52, 113-117]. The rigid-body docking step is able to provide thousands of candidate poses ranked by interacting energy, and gives important information about the location of the putative protein-protein interaction sites [Fernandez-Recio et al. (2004) J.Mol.Biol. 335, 843-865]. However, it is the global energy optimization of the flexible ligand interface side-chains that ultimately helps to identify the correct geometry of the complex. This flexible refinement step is especially efficient in protease-inhibitors, and generally, in cases where only a few steric clashes between the unbound side-chains need to be resolved in order to achieve the final complex structure. However, in some other cases the ligand side-chain optimization protocol is not enough to achieve the optimized fit of the interacting molecules. Both successful and not-so-successful stories will be analyzed here,and we will discuss new ways of improving flexible refinement of the interfaces in internal coordinates.

Flexibility as part of the geometric filtering problem.
04/28 - 14.50
Ludwig Krippahl, Universidade Nova de Lisboa, Portugal
When modelling protein complexes, geometric complementarity is generally the most important criterion for filtering the large set of possible models and reducing it to a manageable sub-set. This necessary reliance on geometry makes protein flexibility a major problem. Two solutions for this problem are either to account for flexibility implicitly by relaxing the stringency of the filter, or to retain a stringent filter and model different conformations explicitly. Though the latter approach has become more popular with the increase in computation power, we propose that, in many cases, flexibility can be considered as part of the general problem of the reliability geometric complementarity as a predictor of protein interaction.
To this end, our docking algorithm BiGGER [1] uses an implicit representation of protein flexibility that can distinguish rigid and flexible regions, and can incorporate experimental data as an additional filter to compensate the lower stringency of the geometric complementarity filter [2].
[1] Palma, P.N., Krippahl, L.,Wampler, J.W., Moura, J.G., A New (Soft) Docking Algorithm for Predicting Protein Interactions. Protein:Struc. Func. Gen. 2000 Jun 1;39(4):372-84.
[2] Krippahl, L., Moura, J.J., Palma, P.N., Modeling Protein Complexes with BiGGER. Prot: Struc.Funct. Gen, V. 52(1):19-23.

Modelling the structure of protein-protein complexes
04/28 - 17.00
Michael J E Sternberg, Patrick Aloy, Philip Carter, Henry Gaab, Suhail Islam, Richard Jackson, Victor Lesk, Gidon Moont, F. Pazos & Graham Smith.
Structural Bioinformatics Group, Dept of Biological Sciences, Imperial College London, UK This talk will describe the current status of the package 3D-DOCK that aims to predict the 3D structure of a protein-protein complex starting from the coordinates of the unbound components. Protein flexibility is introduced in the final stage via a program MULTIDOCK that performs a rigid-body refinement coupled with optimisation of side-chain / side-chain packing is performed The talk will describe the status of the above approach on a test data set. Finally the results of the recent blind trial of protein-protein docking (CAPRI, www.capri.ebi.ac.uk) will be reported.

The talk will also report recent work to predict functional residues from sequence and structure using a new approach that predicts protein function and the responsible residues over a wide range of functional specificities.

Simulating induced fit in molecular docking.
04/29 - 9.00
Ruben Abagyan, Maxim Totrov, Juan-Fernandez Recio, Julio Kovacs, Claudio Cavasotto.
The Scripps Research Institute, La Jolla, USA
The main complicating factor in molecular docking is receptor rearrangement upon ligand binding (induced fit). It is the induced fit that complicates cross-docking of ligands from different ligand receptor complexes. To improve on discriminating between binders and nonbinders in the virtual screening process we developed a protocol which performs receptor-flexible docking of known ligands in order to simulate possible pocket rearrangements. This protocol was applied to a benchmark of kinases and was demonstrated to improve both the cross-docking accuracy as well as the "enrichment" in virtual ligand screening. In protein-protein docking and peptide protein docking the side-chain sampling may be sufficient to account for induced fit. The induced changes of the backbone are more problematic. We show how the slow modes of soft harmonic Ca-model can be used to generate alternative conformations.

Modeling large-scale hinge-bent motions in docking.
04/29 - 9.35
Dina Schneidman & Ruth Nussinov, Tel Aviv University, Israel
Proteins are very flexible molecules. The flexibility may range from small-scale side-chain motions to large-scale intra and inter domain motions or even partial refolding. In my talk I will focus on approaches to handle hinge-bent protein flexibility in docking algorithms. Hinge detection strategies will also be mentioned. I will discuss the problems of the current methods and the challenges of the field. In addition, I will present various examples, including CAPRI targets.

Accounting for protein loop flexibility during macromolecular docking
04/29 - 10.50
Karine Bastard & Chantal Prévost, Laboratoire de Biochimie Théorique, Paris, France
Upon macromolecular association, some proteins undergo large conformational changes that can result in surface loop movements. When the Met repressor binds to DNA, an eight residue loop of Met repressor changes its hairpin conformation into a conformation that wraps around the DNA phosphate backbone. Such an examples confirm the necessity to account for induced surface remodeling during the search for interacting surfaces, by allowing the receptor to adapt to its partner in an induced fit process. To address this problem, we have recently developped a new docking method, termed MC2, which takes into account the loop and side-chain movements at the protein surface during macromolecular association. The objectives of MC2 are to precisely position the ligand, predict the loop conformations that optimally interact with the ligand and adjust the side-chain conformations, in order to predict the atomic level interactions between the two partners. The loop flexibility is artificially introduced by using a multiple copy representation. Each loop copy results from ab intio construction and represents one possible main-chain conformation of the loop with rigid backbone and flexible side-chains. The ligand position, the conformation of the protein side-chains and of the loop copy side-chains are sampled by a Monte-Carlo Simulated Annealing process. The multiple copy representation and Monte Carlo simulation are coupled via the copy weights which are recalculated at the end of each Monte Carlo cycle, finally resulting in selecting a unique loop copy at the end of MC2 process. Final loop adjustments, via energy minimzation, is found to play an important role in establishing the correct energy ranking. In a test-case study, the method was able to predict the structure of the complex at the atomic level and to unambiguously predict the conformation of an interfacial loop.
Bastard K, Thureau A, Lavery R, Prevost C. Docking macromolecules with flexible segments. J.Comput.Chem. 2003 Nov 30;24(15):1910-20.
http://www.ibpc.fr/~bastard/MC2/mc2.html

How to efficiently account for side chain flexibility and global motions during docking
04/29 - 11.25
Martin Zacharias, International University Bremen, Germany
Most current docking approaches to predict the binding geometry of protein-protein complexes use rigid protein partner structures. However, protein complex formation can involve both local conformational changes of side chains and loops at the protein-protein interface and global conformational relaxation of the protein partners. We have developed a docking approach that is based on energy minimization of translational and rotational degrees of freedom of protein partners and on a reduced protein representation allowing efficient search for docking minima. A multicopy approach is used to select the most favourable side-chain conformation at the protein-protein interface during the docking process [1]. To approximately account for possible global conformational adaptation a method has been developed that allows to relax the protein structure in pre-calculated flexible degrees of freedom (soft modes) during docking [2]. Such flexible modes can for example be obtained from molecular dynamics simulations or on the level of a reduced protein representation by employing an energy function that depends on the local protein density. Application of the approaches to test systems will be presented.
[1] Zacharias, M. 2003. Protein-protein docking with a reduced protein model accounting for side chain flexibility. Protein Sci. 12, 1271.
[2] Zacharias, M. 2004. Rapid protein-ligand docking using soft modes from molecular dynamics simulations to account for protein deformability:binding of FK506 to FKBP. Proteins 54, 759.

HADDOCK: an information-driven flexible docking approach
04/29 - 14.00
Alexandre M. J. J. Bonvin, Utrecht University, The Netherlands
In my talk, I will describe our recently developed information-driven flexible docking approach HADDOCK (High Ambiguity Driven protein protein DOCKing) (http://www.nmr.chem.uu.nl/haddock), that makes use of biochemical and/or biophysical information. The experimental information is introduced as highly ambiguous interaction restraints (AIRs) to drive the docking process.
HADDOCK uses an all-atom representation of the system. Flexibility is accounted for in different ways during the docking protocol:
i) in the initial rigid body energy minimization stage by starting the docking from ensembles of conformations (e.g. a NMR ensemble of structures, snapshots from a MD simulation)
ii) during the semi-flexible simulated annealing refinement stage by allowing flexibility at the interface first, only for side-chain atoms, and then, for both side-chain and backbone atoms
iii) in the final SA refinement in explicit water by progressively allowing flexibility in the remaining of the system in addition to the defined, flexible interface.
Reference:
Dominguez, C. Boelens, R. and Bonvin, A.M.J.J. (2003). J. Am. Chem. Soc. 113. 1731

How may the use of MD and rigid-body docking algorithms overcome the protein flexibility problem associated with complex formation?
04/29 - 14.35
Graham R. Smith, Cancer Research UK London Research Institute
Michael J. E. Sternberg, Imperial College London, UK.
Paul A. Bates, Cancer Research UK London Research Institute
The formation of a protein-protein complex is a key event in an enormous number of cellular biochemical processes. However, to predict a wild-type complex computationally given the structures of the components (the "protein docking problem") is still difficult in cases where there is any more than a very small change in the conformation of the components upon the formation of the complex. As a first step to addressing this flexible docking problem, we have used Molecular Dynamics (MD) simulations to investigate the extent to which the conformational fluctuations undergone by proteins in solution reflect the conformational changes that they undergo when they form protein-protein complexes ("induced fit"). To do this, we study a set of over thirty proteins that form such complexes and whose 3-dimensional structures are known, both bound in the complex and unbound. We carry out MD simulations of 5 ns duration with Gromacs, starting from the unbound structures, and analyse the resulting conformational fluctuations in comparison with the structures in the complex.
We find that in some cases the conformational fluctuations observed in MD correlate well with the regions of the proteins that move on complex formation, and in some cases take the protein towards its bound conformation.
Preliminary results are presented on how this information may be used to improve protein-protein docking, both for the test set described above and some targets from recent rounds of CAPRI.

Combinatorial docking for multi-molecular assembly and protein structure prediction
04/30 - 9.00
Yuval Inbar & Haim J. Wolfson, Tel Aviv University, Israel
The majority of proteins function when associated in multimolecular assemblies. Yet, prediction of the structures of multimolecular complexes has largely not been addressed, probably due to the magnitude of the combinatorial complexity of the problem. Docking applications have traditionally been used to predict pairwise interactions between molecules. We have developed an algorithm that extends the application of docking to multi-molecular assemblies.
We apply it to predict both quaternary structures of oligomers and multi-proteins complexes. Moreover, adapting the algorithm to consider backbone connectivity, we also show that it may be useful in the prediction of protein tertiary structures when the structures of the protein parts are available. This application was tested both on domain assembly in order to predict the spatial arrangement of domains in multi-domain proteins, and on protein building blocks (substructures of domains with relatively high population times) assembly to predict their arrangement within a domain in the native protein.

Complementarity of structure ensembles in protein-protein binding
04/30 - 9.35
Raik Grünberg*, Johan Leckner* & Michael Nilges. Institut Pasteur, Paris, France
Our understanding of protein-protein interaction is caught in a contradiction: on the one hand,experimental rates of association suggest that, in many cases, practically every collision between two partner proteins leads to the formation of the complex. On the other hand, we often fail to predict the correct orientation of a protein complex because the two free partners simply don't sufficiently fit. This discrepancy is commonly explained by a fuzzy notion of induced fit, or by the assumption that the bound conformations is present in the structure ensembles of the two unbound proteins. However, both models appear to be inconsistent with our current knowledge about the forces and time scales of recognition.
In this study, we try to incorporate the additional dimensions of receptor and ligand variability into our picture of the protein-protein binding process. We performed two sets of molecular dynamics simulations for the unbound (free) structures of 17 receptor and 16 ligand proteins and applied shape-driven rigid body docking to all combinations of representative receptor and ligand snapshots as well as the free structure. In total, we analysed and compared 2,106,368 solutions from 4114 exhaustive rigid body dockings between 693 conformations of 33 different proteins. The cross-docking of ensemble snapshots increases the chances to find near native orientations. Our results suggest that there are complementary conformations within the free receptor and ligand ensembles, which, however are in general not necessarily related to the bound structure. In addition, we also performed molecular dynamics simulations on all 17 complexes and analysed the flexibility of free and bound proteins. Our results indicate that binding may not necessarily occur at the cost of entropy. We propose a refined model of the protein-protein recognition process that is combining the ideas of conformer selection and induced fit and is in better aggreement with our current understanding of interaction forces, time scales and kinetic data.
*these authors contributed equally to the work

Prediction of interacting surfaces by the Evolutionary Trace method
04/29 - 10.50
Olivier Lichtarge, Baylor College of Medicine, Houston, USA
Protein-protein interactions are the elementary units from which molecular pathways and cellular networks are built. But the description of the functional surfaces that determine protein binding still elude us. The Evolutionary Trace (ET) approach to this problem is to combine sequences, evolutionary trees, and structures to reveal the canonical determinants of a protein¹s function. Large-scale studies show that these determinants cluster spatially in the structure and that they match functional sites on proteins surfaces. Their discovery allows experimentalists to rationally design activity through targeted mutagenesis, for example along the G protein-signaling pathway. The scalability and generality of ET further suggest that proteome-wide annotation of functional sites is within reach. The activity of many protein structures may then be traced to narrow sets of relevant amino acids that form ³elementary units of function and of interaction². From a practical viewpoint, these units can be engineered to analyze and manipulate the molecular basis of protein function.The majority of proteins function when associated in multimolecular assemblies. Yet, prediction of the structures of multimolecular complexes has largely not been addressed, probably due to the magnitude of the combinatorial complexity of the problem. Docking applications have traditionally been used to predict pairwise interactions between molecules. We have developed an algorithm that extends the application of docking to multi-molecular assemblies.
We apply it to predict both quaternary structures of oligomers and multi-proteins complexes. Moreover, adapting the algorithm to consider backbone connectivity, we also show that it may be useful in the prediction of protein tertiary structures when the structures of the protein parts are available. This application was tested both on domain assembly in order to predict the spatial arrangement of domains in multi-domain proteins, and on protein building blocks (substructures of domains with relatively high population times) assembly to predict their arrangement within a domain in the native protein.

Modeling Correlated Protein Main-chain Motions in Proteins and their Ligands
04/30 - 9.00
Leslie A. Kuhn(1), Maria I. Zavodszky(1), Sameer Arora(2), Ming Lei(3), and Michael F. Thorpe(4)
(1) Department of Biochemistry & Molecular Biology and Center for Biological Modeling, Michigan State University, 502C Biochemistry Building, East Lansing, MI 48824-1319; http://www.bch.msu.edu/labs/kuhn (2) Departments of Biochemistry & Molecular Biology and Computer Science & Engineering, Michigan State University, (3)Department of Biochemistry, Brandeis University, and (4)Physics & Astronomy Department, Arizona State University (KuhnL@msu.edu)
We describe a new method for modeling protein and ligand main-chain flexibility in docking. The goal is to sample the full conformational space, including conformations not yet observed by crystallography, MD, or NMR. Flexibility analysis is performed using the graph-theoretic algorithm FIRST, which identifies coupled networks of covalent and non-covalent bonds within the protein. ROCK then explores available conformations by only sampling dihedral angles that preserve the coupled bond network in the protein. A representative set of protein conformations can then be used as targets for docking with SLIDE, which models protein and ligand side-chain flexibility. This combined approach for incorporating main-chain flexibility in docking is illustrated for cyclophilin A-cyclosporin and estrogen receptor-zearalenol complexes. Very recent results show that the maintenance of correlated motions between hydrogen-bonded and hydrophobic side chains is also a key aspect of ligand recognition across diverse protein-ligand complexes.

Protein flexibility and drug design: How to hit a moving target
04/30 - 13.30
Heather A. Carlson, University of Michigan, USA (carlsonh@umich.edu)
The use of multiple protein structures (MPS) is a growing trend in structure-based drug design. Different techniques will be discussed, and our MPS method for developing receptor-based pharmacophore models will be highlighted. By using MPS, we are able to identify flexible and rigid regions within the binding site and use that information to our advantage. An additional advantage of the method is that an unbound protein structure can be used sucessfully for structure-based inhibitor design !

List of Participants

Name Postal address Email address
Ruben Abagyan Ruben Abagyan Research Group
Molecular Biology
The Scripps Research Institute
10550 North Torrey Pines Rd., TPC-28
La Jolla, CA 92037
abagyan@scripps.edu
Ludovic Autin INSERM U428 "Risque thrombotique et mécanisme de l'hémostase"
Equipe Bioinformatique Structurale
4, av de l'Observatoire
75006 Paris
ludovic.autin@univ-paris5.fr
Ranjit Bahadur Department of Biochemistry
Bose Institute
P-1/12 CIT Scheme VIIM
Calcutta 700 054
Kolkata, India
b_ranjit@bic.boseinst.ernet.in
Karine Bastard Laboratoire de Biochimie Théorique - CNRS UPR 9080
Institut de Biologie Physico-Chimique
13, rue Pierre et Marie Curie,
75005 Paris, France
karine.bastard@ibpc.fr
Paul A. Bates Cancer Research UK London Research Institute
44 Lincoln's Inn Fields,
London WC2A 3PX, U.K.
paul.bates@cancer.org.uk
Nora Benhabilès Clinigenetics
1105, Avenue Pierre Mendes-France
30000 Nimes, France
n.benhabiles@clinigenetics.com
Efrat Ben-Zeev The Weizmann Institute of Science, Revohot, IL
Ullman Blg. Room 249
Weizmann Institute of Science, Rehovot, Israel
efrat.ben-zeev@weizmann.ac.il
Julie Bernauer Equipe de Génomique Structurale
Laboratoire d'Enzymologie et Biochimie Structurales
UPR 9063 - Bâtiment 34 - CNRS
Avenue de la Terrasse
91198 Gif-sur-Yvette Cedex, France
bernauer@lebs.cnrs-gif.fr
A.M.J.J. Bonvin Associate Professor
Department of NMR Spectroscopy
Bijvoet Center for Biomolecular Research
Utrecht University
Padualaan 8, 3584 CH Utrecht
The Netherlands
a.m.j.j.bonvin@chem.uu.nl
Daniel Borgis Modélisation des Systèmes Moléculaires Complexes et LAE, UMR 8587
Université d'Evry-Val-d'Essonne
Equipe Membre de SIMU
Bâtiment Maupertuis
25 Rue du Père Jarland
91025 EVRY Cedex
dborgis@univ-evry.fr
Carlos J. Camacho Department of Biomedical Engineering
Boston University
44 Cummington Street
Boston MA 02215 US
ccamacho@bu.edu
Heather A. Carlson The University of Michigan
College of Pharmacy
428 Church St.
Ann Arbor, Michigan 48109-1065, US
carlsonh@umich.edu
Phil Carter Structural Bioinformatics Group
Biochemistry Building
Department of Biological Sciences
Imperial College
London SW7 2AY, U.K.
philip.carter@ic.ac.uk
Frederic Cazals Project Geometrica
INRIA Sophia-Antipolis
F-06902 Sophia-Antipolis
Frederic.Cazals@sophia.inria.fr
Aalt-Jan van Dijk Department of NMR Spectroscopy
Bijvoet Center for Biomolecular Research
Utrecht University
Padualaan 8, 3584 CH Utrecht
The Netherlands
a.j.vandijk@chem.uu.nl
Medhi Djafari Rouhani Microsystèmes et Intégration des Systèmes
LAAS-CNRS
7, avenue du Colonel Roche
31077 Toulouse Cedex 4
France
djafari@laas.fr
Miriam Eisenstein Ullman Blg. Room 249
Weizmann Institute of Science
Rehovot, Israel
miriam.eisenstein@weizmann.ac.il
Alain Esteve Microsystèmes et Intégration des Systèmes
LAAS-CNRS
7, avenue du Colonel Roche
31077 Toulouse Cedex 4
France
aesteve@laas.fr
Juan Fernández-Recio Marie Curie Research Fellow
Department of Biochemistry
University of Cambridge
80 Tennis Court Road
Cambridge CB2 1GA, UK
juan@cryst.bioc.cam.ac.uk
Delphine Flatters Equipe de Bioinformatique Génomique et Moléculaire
INSERM EMI 03-46
Université Denis Diderot (Paris 7)
Tour 53-54 , 1er étage, case 7113
2 place jussieu
75251 Paris Cedex 05
Delphine.Flatters@ebgm.jussieu.fr
Raik Gruenberg Institut Pasteur
Unité de Bioinformatique Structurale
25-28 rue du Dr Roux
F-75724 Paris CEDEX 15, France
raik@pasteur.fr
Tap Haduong Modélisation des Systèmes Moléculaires Complexes et LAE, UMR 8587
Université d'Evry-Val-d'Essonne
Equipe Membre de SIMU
Bâtiment Maupertuis
25 Rue du Père Jarland
91025 EVRY Cedex
thaduong@univ-evry.fr
Yuval Inbar Tel Aviv University
Schreiber Building, Room 010,
School of Computer Science
Tel-Aviv University
Tel-Aviv 69978, Israel
inbaryuv@tau.ac.il
Joël Janin Head of the Laboratoire d'Enzymologie et Biochimie Structurales
UPR 9063 - Bâtiment 34 - CNRS
Avenue de la Terrasse
91198 Gif-sur-Yvette Cedex, France
janin@lebs.cnrs-gif.fr
Quentin Kaas CNRS, IMGT, the international ImMunoGeneTics information system,
Laboratoire d'ImmunoGenetique Moleculaire, LIGM
Institut de Genetique Humaine IGH,
UPR CNRS 1142,
141 rue de la Cardonille,
F-34396 Montpellier
kaas@ligm.igh.cnrs.fr
István Kolossváry Novartis Institute for Biomedical Research
556 Morris Ave
Summit, NJ 07901, USA.
istvan.kolossvary@pharma.Novartis.com
Noga Kowalsman Department of Biological Chemistry,
Weizmann Institute of Science
76100 Rehovot, Israel
noga.kowalsman@weizmann.ac.il
Ludwig Kripphal Departamento de Química
Centro Quimica Fina e Biotecnologia
Faculdade de Ciências e Tecnologia.
Universidade Nova de Lisboa
Caparica, Portugal.
ludik@net.cabo.pt
Leslie Kuhn Center for Biological Modeling
Protein Structural Analysis and Design Laboratory
Department of Biochemistry
Michigan State University
East Lansing, MI 48824-1319, US
KuhnL@msu.edu
Richard Lavery
Head of the Laboratoire de Biochimie Théorique
CNRS UPR 9080
Institut de Biologie Physico-Chimique
13, rue Pierre et Marie Curie,
75005 Paris, France
richard.lavery@ibpc.fr
Wen Hwa Lee INSERM U428, Université Paris 5
4, av de l'Observatoire
75006 Paris
wen.hwa.lee@univ-paris5.fr
Victor Lesk Postdoctoral Researcher
Structural Bioinformatics Group
Biochemistry Building
Department of Biological Sciences
Imperial College
London SW7 2AY, U.K.
v.lesk@ic.ac.uk
Olivier Lichtarge Department of Molecular & Human Genetics
Baylor College of Medicine
Houston, Texas 77030, USA
lichtarge@bcm.tmc.edu
Kristin L. Meagher The University of Michigan
College of Pharmacy
428 Church St.
Ann Arbor, Michigan 48109-1065, US
kmeagher@umich.edu
Raul Mendez Service de Conformation des Macromolecules Biologiques et de Bioinformatique
Universite Libre de Bruxelles (ULB)
Campus Plaine, Bd du Triomphe - CP263
B-1050 Bruxelles, BELGIQUE
raul@scmbb.ulb.ac.be
Giacomo de Mori SCMBB - ULB
Service de Conformation des Macromolécules Biologiques et de Bioinformatique
Université Libre de Bruxelles
(Campus Plaine, building BC, C6, 6st level)
bd du Triomphe - CP 263
B-1050 Bruxelles, Belgium
giacomo@scmbb.ulb.ac.be
Florence Nosal Department of Bioinformatics & IT
GenOdyssee S.A.
Parc d'Affaires Technopolis
3 avenue du Canada
Batiment Alpha, Porte 6
B.P. 810 Les Ulis
91974 Courtaboeuf Cedex, France
nosal@genodyssee.com
Chantal Prévost Laboratoire de Biochimie Théorique - CNRS UPR 9080
Institut de Biologie Physico-Chimique
13, rue Pierre et Marie Curie,
75005 Paris, France
chantal.prevost@ibpc.fr
Ralph Nico Riemann School of Engineering and Science
Campus Ring 1
28759 Bremen, Germany
Phone: +49 421 200-3579
Fax: +49 421 200-3249
r.riemanniu-bremen.de
Sophie Sacquin-Mora Laboratoire de Biochimie Théorique - CNRS UPR 9080
Institut de Biologie Physico-Chimique
13, rue Pierre et Marie Curie,
75005 Paris, France
sacquin@ibpc.fr
Dina Schneidman (Duhovny) Schreiber Building,
Tel Aviv University, P.O.B. 39040,
Ramat Aviv, Tel Aviv 69978, Israel
duhovka@tau.ac.il
David Schwarz Department of Computer Science MS 132
Rice University
Physical and Biological Computing Group
6100 Main Street
Houston, TX 77005
dschwarz@rice.edu
Thierry Simeon LAAS/CNRS
Groupe Robotics and Artificial Intelligence
7 Av. du Colonel Roche
31077 Toulouse Cedex, France
nic@laas.fr
Graham R. Smith Biomolecular Modelling Laboratory,
Cancer Research UK London Research Institute,
(formerly Imperial Cancer Research Fund),
44 Lincoln's Inn Fields,
London WC2A 3PX, U.K.
graham.smith@cancer.org.uk
Michael J. E. Sternberg Head of Structural Bioinformatics Group &
Director of Imperial College Centre for Bioinformatics
Structural Bioinformatics Group
Biochemistry Building
Department of Biological Sciences
Imperial College
London SW7 2AY, U.K.
m.sternberg@imperial.ac.uk
Vinh Tran Unite de recherche sur la Biocatalyse (FRE-CNRS 2230)
Faculte des Sciences et Techniques
2 rue de la Houssiniere, BP 92208
44322 Nantes Cedex 03
Vinh.Tran@chimbio.univ-nantes.fr
Shoshana J. Wodak SCMBB - ULB
Service de Conformation des Macromolécules Biologiques et de Bioinformatique
Université Libre de Bruxelles
(Campus Plaine, building BC, C6, 6st level)
bd du Triomphe - CP 263
B-1050 Bruxelles
Belgium
shosh@ucmb.ulb.ac.be
Martin Zacharias School of Engineering and Science
Campus Ring 1
28759 Bremen
Germany
m.zacharias@iu-bremen.de
Ziding Zhang BioAnalytical Science, Nestlé Research Center
Vers-chez-les-Blanc PO Box 44,
CH-1000 Lausanne 26, Switzerland

Ziding.Zhang@rdls.nestle.com