|
Workshop
on
Flexible Macromolecular Docking
CECAM,
Lyon, France, 28-30 April 2004
Organisers
| Chantal
Prévost |
Laboratoire
de Biochimie Théorique, IBPC, Paris France |
| Michael
Sternberg |
Department
of Biology and Biochemistry - Imperial College of Science,
Technology and Medicine - South Kensington, London, SW7
2AY, U.K |
| Joël
Janin |
Laboratoire
d'Enzymologie et Biochimie Structurales, Gif-sur-Yvette,
France |
Report
Scientific
background and objectives
The
last 25 years have witnessed the development of potent algorithms
for docking macromolecules1. Launched in 2001,
the CAPRI experience (Critical Assessment of PRediction of
Interactions) provides evaluation of these methods on a common
ground and incentives new methodology development2.
The motivation is to build three-dimensional structures of
molecular machineries (the biological active species) starting
from their separate macromolecular components. Making such
a tool available is an essential complement of experimental
approaches as Protein-Interaction Maps become available for
whole proteomes and as the number of protein with known or
built-up structures increases. The structure of the proteins
is generally available in their unbound form and may arise
from precise X-ray or NMR structure resolution, but also from
low resolution EM reconstruction or from homology modeling.
In addition to side-chain fluctuations, likely to occur between
the free and bound forms, the main chain can undergo conformational
changes during association3. Structural elements
may also be missing in the unbound form, particularly in the
case of low resolution structures. Proteins loops are generally
poorly defined in structures resulting from homology modelling4.
Evaluation of the docking results in CAPRI has established
that some complexes cannot be correctly predicted without
explicitly taking into account the internal flexibility of
the receptor as well as that of the ligand macromolecule5.
This is a difficult task to achieve while conserving the search
rapidity necessary for post-genomics applications. Systematic
exploration, in terms of position and rotation, of the possible
arrangement of the two partners considered as rigid bodies
already necessitates the generation of hundreds of thousands
trial conformations for the complex. Introducing every degree
of protein internal flexibility from the beginning of a systematic
search procedure is clearly not manageable due to combinatorial
explosion. However, there is still no systematic way to know
which of these internal degrees of freedom would be essential
for a particular docking problem. In fact, incorporating the
internal flexibility of proteins in the docking methods has
been identified as a major bottleneck to improve the field
and this CECAM workshop intended to specifically tackle this
problem. The other identified bottleneck, which is the determination
of scoring functions able to discriminate between a correct
docking result and a false positive result, was not the subject
of this workshop but has however been addressed in part during
the discussions since it is hardly separable from the flexible
docking problem.
Many
of the groups which develop docking programs have already
devised a way of considering side chain conformational rearrangement
during docking, whether implicitly or explicitly. Methods
for considering higher levels of flexibility, involving the
rearrangement of segments of the protein backbone, i.e. loops,
domains or the whole protein, are currently being explored
by an increasing number of groups. At the same time, protein
flexibility is also being addressed in other fields of molecular
modeling, whether per se, to better understand the mechanism
underlying induced fit, or to deal with related problems like
small molecule docking or protein folding. Since these explorations
are not necessarily submitted to the restrictions inherent
to the docking methods (systematic search, need of rapidity),
the information they provide can be very helpful for the development
of flexible docking methods. The present workshop on flexible
macromolecular docking came as a convergence between these
different approaches. Its object was to bring together the
groups which have experienced some of the aspects of flexible
docking as well as those specifically working on protein flexibility,
in order to share the experience accumulated by each of them,
to inventory methodologies devoted to handle protein flexibility
and to evaluate their potentiality in flexible docking, to
identify the difficulties inherent to flexible macromolecular
docking and to tackle the next steps to be performed in the
field.
Program and Participants
The
workshop gathered fifty participants, either developing docking
methods in relation to the CAPRI experience, involved in specific
docking projects or specifically working on protein flexibility
in the frame of neighboring fields. European participants
came from seven different countries. In addition to academic
institutions, four industrial companies were represented.
The scientific background of the participants ranged from
biophysics to robotics, informatics or genomics. The workshop
consisted in series of 35 min oral presentations followed
by discussion sessions and poster sessions.
As
a guideline for the workshop, three principal sessions had
been proposed (listed below). The issues raised in each session
have all been largely covered by the speakers, though each
speaker generally addressed several of these issues. People
involved in the CAPRI experience have already begun to address
the flexibility problem and several groups involved in the
macromolecular docking field are also involved in small molecule
docking or protein folding. It must be emphasized that an
important effort has been made by all the speakers to focus
their intervention on the theme of the workshop and to extract
from their work and experience what is related to flexibility.
This resulted in a high quality level of the lectures and
related discussions.
Presentation and analysis of the possible types of deformations
which can be expected to occur within macromolecules during
their association
- analysis of the impact of such deformations on the results
of the Capri experience
- examples where conformational changes have or have not impeded
good prediction
How flexibility is accounted for in the present docking
methods :
- implicitly or explicitly
- at the level of side-chains, loops or domain
- at the refinement stage or during the docking process
- advantages and/or problems related with each level of representation,
in terms of prediction efficiency and processing time
Possible methods that can be used to treat protein
flexibility at different levels: local or global, full-atom
or reduced representation, harmonic or anharmonic movements;
experience accumulated in neighboring fields, small molecule
docking or protein folding.
Results and Conclusions
The
participants identified the following issues as important
clues to further advance the flexible macromolecular docking
problem.
How do we know a protein is flexible; how can we
know which parts are flexible?
Several methods presented by the participants can be helpful
to address these questions. The graph-theoretic algorithm
FIRST6 [http://firstweb.asu.edu]
has been specifically developed to identify flexible regions
in a protein. Other hints have been proposed, some of them
based on experimental data from NMR or biochemistry, other
based on theoretical calculation like Molecular Dynamics (MD)
simulation, enhanced MD, normal mode analysis or Principal
Component Analysis (PCA). These calculations may be performed
on a protein or more generally on a representant of a family
of proteins. Indirect indication of flexibility based on genome
analysis has also been reported. The Evolutionary Trace (ET)
method7, based on the analysis of sequences of
divergently related proteins, identifies patches of residues
involved in the docking interface. In the case of flexible
proteins, it happens that these patches are situated on the
protein surface only after a conformational change.
If the flexibility characteristics are known, what
methods can be used?
Even if a protein has been detected as internally flexible,
it can happen that flexible parts are situated outside a docking
interface, as observed for one of the Capri targets. In this
case it needs not be taken into account. Different levels
of flexibility should also be considered, side chains, loops,
domains or the whole protein. Concerning side chains, it is
not clear from the examples described during the workshop
whether it is sufficient to account implicitly for side chain
flexibility (using a soft representation) or if the explicit
level is necessary. Side chain refinement has been reported
to improve the predictions in some cases but in other cases
it could alter the prediction. However, a soft representation
is clearly not adapted to loop movements or domain movements
since the volume scanned by such moves is very large. One
solution is to delete these flexible parts during a first
step of systematic rigid body search, then possibly reintroduce
them. Alternatively, methods have been presented to explicitly
account for loop or domain movements during docking. In the
case of loop movement, a mutlicopy approach was used with
pre-generated loop conformations. In the case of domain movements
of the hinge-bend type, a multi-component docking approach
was used. Interestingly, multi-component docking appeared
much more efficient in predicting correct protein arrangements
than successively docking pairs of separate protein elements.
Contributions from the graph theory for multi-component docking
or from robotics to generate possible deformations and articulate
movements appear very promising. In the case of global deformations
of the protein backbone, docking on a sample of conformations
issued from MD or from PCA-enhanced MD appeared to improve
the predictions, as did PCA-based conformational adjustment
performed during the docking process.
In any case, any information allowing restriction of the search
space is welcome. This allows efficient use of an all-atom,
internal variable representation of the protein during the
final stages of docking, coupled with MD or minimization.
Precise information on the docking interface can be obtained
by NMR or ET. Information can also be deduced from published
biochemical experiments. Such information must be used with
caution since a wrong interpretation of the data systematically
leads to wrong predictions.
Other points under discussion
Several studies presented during the workshop aimed at better
understanding the induced fit process during macromolecular
docking. In particular, it was discussed whether protein deformation
occurs as a result of the modification of the external field
sensed by the protein, or if the docking process involves
a selection between conformations already present in the solution
ensemble.
The importance of the scoring function was also emphasized.
In intermediate stages of the docking process, an inadequate
scoring function may lead to reject conformations that would
have led to correct predictions. It was discussed whether
an universal scoring function exists, based on a precise account
of all free energy components, or if it is more adequate to
use different scoring functions, adapted to the different
levels of protein representation during the docking process.
Finally, the need to learn from bad predictions has been stressed.
Particularly, it may be useful to elaborate a more complex
analysis of the predictions. Poorly scored complexes containing
zones with good interactions coexisting with zones with bad
interactions ("white" + "black" = "grey")
should be distinguished from complexes with uniformly poor
interactions (all "grey"). This distinction is particularly
important in the case of flexible docking and may lead to
identify induced deformations in a protein. To perform such
an analysis, much can be learned from the huge amount of data
generated by the various groups in view of the Capri experience
and discarded when selecting the structures to be submitted.
Comparative studies underway between true dimeric proteins
and proteins submitted to crystal packing interaction are
also very important for this purpose. It seems that the spatial
repartition of interactions is more important than simply
the total sum of interactions.
In
conclusion, the workshop successfully reached its goal in
specifically addressing the problem of accounting for flexibility
in macromolecular docking. Thanks to the high-level contributions
of all participants, we were able to determine the important
issues to be addressed, to identify methods potentially useful
to solve these questions and interesting tracks to be explored.
We hope the constructive discussions that took place during
the workshop announce a new stage of development of the macromolecular
docking field.
References
1.
Smith GR, Sternberg MJE. Prediction of protein-protein interactions
by docking methods. Curr. Opin. Struct. Biol. 2002, 12, 28-35
2. Janin J, Henrick K, Moult J, Eyck LT, Sternberg MJ, Vajda
S, VakserI, Wodak SJ. CAPRI: A Critical Assessment of PRedicted
Interactions. Proteins 2003, 52, 2-9. [http://capri.ebi.ac.uk]
3. Betts MJ, Sternberg MJE. An analysis of conformational
changes on protein-protein association: implications for predictive
docking. Protein Engineering 1999, 12, 271-283
see also http://molmovdb.org/molmovdb for an overview of protein
movements
4. Rodriguez R, Chinea G, Lopez N, Pons T, Vriend G. Homology
modeling, model and software evaluation: three related resources.
Bioinformatics. 1998, 14, 523-8
5. Mendez R, Leplae R, De Maria L, Wodak SJ. Assessment of
blind predictions of protein-protein interactions: Current
status of docking methods. Proteins 2003, 52, 51-67
6. Jacobs DJ, Radler, AJ, Kuhn LA and Thorpe MF. Protein Flexibility
predictions using graph theory. Proteins, 2001, 44, 150-165
7. Lichtarge O, Bourne HR and Cohen FE. The evolutionary trace
method defines the binding surfaces common to a protein family.
J Mol Biol 1996, 257, 342-358
Programme
Wednesday 28th April
8.45
- 9.00 Welcome and practical information
9.00 - 9.20 Chantal Prévost: Presentation of the
workshop: principal issues and objectives
9.20 - 9.55 Joël Janin: Conformation changes and specificity
of protein-protein interaction
9.55 - 10.30 Shoshana Wodak: Possible types of protein
deformations and summary of some of the CAPRI results with
focus on the influence of conformational changes.
10.30 - 11.00 Coffee break
11.00 - 11.35 Miriam Eisenstein:Analysis of the impact
of deformations on the results of the CAPRI contest
11.35 - 12.10 Carlos J. Camacho: How Flexible your Rigid-Body
Docking Should Be
12.10 - 14.15 Lunch break
14.15 - 14.50 Juan Fernández-Recio: ICM optimization
of flexible interface side-chains in protein-protein docking:
successes and limitations
14.50 - 15.25 Ludwig Kripphal: Flexibility as part of the
geometric filtering problem
15.25 - 17.00 Poster session and coffee break
17.00 - 17.35 Michael J.E. Sternberg: Modelling the structure
of protein-protein complexes
Thursday 29th April
9.00 - 9.35 Ruben Abagyan: Simulating induced fit in molecular
docking
9.35 - 10.10 Dina Schneidman: Modeling large-scale hinge-bent
motions in docking
10.10 - 10.30 Discussion session
10.30 - 10.50 Coffee break
10.50 - 11.25 Karine Bastard: Accounting for protein loop
flexibility during macromolecular docking
11.25 - 12.00 Martin Zacharias: How to efficiently account
for side chain flexibility and global motions during docking
12.00 - 14.00 Lunch break and poster session
14.00
- 14.35 Alexandre M.J.J Bonvin: HADDOCK: an information-driven
flexible docking approach
14.35 - 15.10 Graham R. Smith: How may the use of MD and
rigid-body docking algorithms overcome the protein flexibility
problem associated with complex formation?
15.10 -19.30 Afternoon free
19.30 Dinner
Friday
30th April
9.00 - 9.35 Yuval Inbar: Combinatorial docking for multi-molecular
assembly and protein structure prediction
9.35 - 10.10 Raik Gruenberg: Complementarity of structure
ensembles in protein-protein binding
10.10 - 10.30 Discussion session
10.30 - 10.50 Coffee break
10.50 - 11.25 Olivier Lichtarge: Prediction of interacting
surfaces by the Evolutionary Trace method
11.25 - 12.00 Leslie Kuhn: Modeling Correlated Protein
Main-chain Motions in Proteins and their Ligands
12.00 - 13.30 Lunch break
13.30 - 14.05 Heather A. Carlson: Protein flexibility and
drug design: How to hit a moving target
14.05 - 15.05 Discussion session
Abstracts
Conformation
changes and the specificity of protein-protein interaction
04/28 - 9.20
Joël Janin, Laboratoire d'Enzymologie et de Biochimie
Structurales, CNRS, Gif-sur-Yvette, France
Molecular docking algorithms assemble a two-pieces puzzle,
which would be a child game if the components were rigid like
a lock and a key. In reality, molecules (small or large) change
conformation as they associate, a feature that all protein-protein
docking procedures must take into account. Whereas existing
procedures generally succeed when the conformation changes
are small, they fail to reproduce large changes. These are
nevertheless common, and in many biological systems, they
are essential to the function. Changes seen upon association
can be local (loop movement) or global (dimerization), and
they may include disorder-to-order transitions, making protein-protein
interaction of similar complexity to protein folding.
Specific protein-protein complexes and homodimeric proteins
form interfaces that are large and compact, with close-packed
interface atoms. In contrast, the non-specific interaction
observed in protein crystal packing generate small, loosely
packed interfaces. These structural differences are easily
interpreted in terms of geometric complementarity in cases
where conformation changes are small and recognition takes
place between preformed surfaces. In contrast, large changes
at an interface imply that recognition first occurs between
surfaces that are not complementary. A basic question in molecular
assembly is how this process takes place, and whether we can
reproduce it in docking procedure.
Analysis
of the impact of deformations on the results of the CAPRI
experiment
04/28 - 11.00
Miri Eisenstein, Weizmann Institute of Science, Rehovot, Israel
The results of the CAPRI experiment indicate that rigid body
docking procedures are able to tolerate considerable structural
deformations (e.g. T01, T10 and T11). However, larger deformations
that involve hinge movements, are a problem not only when
they are real, as in T09, but also when they are anticipated,
as in T11 and T13. It appears that the current scoring functions
cannot be used to predict if a hinge movement is likely to
occur or not. In cases when a hinge movement is known to occur,
a multi-rigid-body approach can be used to predict the structure
of the complex.
How
Flexible your Rigid-Body Docking Should Be
04/28 - 11.35
Carlos J. Camacho, Boston University, USA
I will discuss the feasibility of structural refinement in
the context of protein-protein docking, emphasizing the benefits/disadvantages
of backbone and side chain flexibility. I will mention the
biophysical motivation for addressing the side chain refinement
problem, and suggest how to solve it. Examples from blind
predictions in the CAPRI experiment will also be mentioned.
ICM
optimization of flexible interface side-chains in protein-protein
docking: successes and limitations
04/28 - 14.15
Juan Fernandez-Recio, University of Cambridge, UK
The ICM Docking and Interface Side-Chain Optimization (ICM-DISCO)
was benchmarked in 24 unbound pairs for protein-protein docking
[Fernandez-Recio et al. (2002) Protein Sci. 11, 280-291] and
successfully evaluated in the blind CAPRI experiment (http://capri.ebi.ac.uk)
[Fernandez-Recio et al. (2003) Proteins 52, 113-117]. The
rigid-body docking step is able to provide thousands of candidate
poses ranked by interacting energy, and gives important information
about the location of the putative protein-protein interaction
sites [Fernandez-Recio et al. (2004) J.Mol.Biol. 335, 843-865].
However, it is the global energy optimization of the flexible
ligand interface side-chains that ultimately helps to identify
the correct geometry of the complex. This flexible refinement
step is especially efficient in protease-inhibitors, and generally,
in cases where only a few steric clashes between the unbound
side-chains need to be resolved in order to achieve the final
complex structure. However, in some other cases the ligand
side-chain optimization protocol is not enough to achieve
the optimized fit of the interacting molecules. Both successful
and not-so-successful stories will be analyzed here,and we
will discuss new ways of improving flexible refinement of
the interfaces in internal coordinates.
Flexibility
as part of the geometric filtering problem.
04/28 - 14.50
Ludwig Krippahl, Universidade Nova de Lisboa, Portugal
When modelling protein complexes, geometric complementarity
is generally the most important criterion for filtering the
large set of possible models and reducing it to a manageable
sub-set. This necessary reliance on geometry makes protein
flexibility a major problem. Two solutions for this problem
are either to account for flexibility implicitly by relaxing
the stringency of the filter, or to retain a stringent filter
and model different conformations explicitly. Though the latter
approach has become more popular with the increase in computation
power, we propose that, in many cases, flexibility can be
considered as part of the general problem of the reliability
geometric complementarity as a predictor of protein interaction.
To this end, our docking algorithm BiGGER [1] uses an implicit
representation of protein flexibility that can distinguish
rigid and flexible regions, and can incorporate experimental
data as an additional filter to compensate the lower stringency
of the geometric complementarity filter [2].
[1] Palma, P.N., Krippahl, L.,Wampler, J.W., Moura, J.G.,
A New (Soft) Docking Algorithm for Predicting Protein Interactions.
Protein:Struc. Func. Gen. 2000 Jun 1;39(4):372-84.
[2] Krippahl, L., Moura, J.J., Palma, P.N., Modeling Protein
Complexes with BiGGER. Prot: Struc.Funct. Gen, V. 52(1):19-23.
Modelling
the structure of protein-protein complexes
04/28 - 17.00
Michael J E Sternberg, Patrick Aloy, Philip Carter, Henry
Gaab, Suhail Islam, Richard Jackson, Victor Lesk, Gidon Moont,
F. Pazos & Graham Smith.
Structural Bioinformatics Group, Dept of Biological Sciences,
Imperial College London, UK This talk will describe the current
status of the package 3D-DOCK that aims to predict the 3D
structure of a protein-protein complex starting from the coordinates
of the unbound components. Protein flexibility is introduced
in the final stage via a program MULTIDOCK that performs a
rigid-body refinement coupled with optimisation of side-chain
/ side-chain packing is performed The talk will describe the
status of the above approach on a test data set. Finally the
results of the recent blind trial of protein-protein docking
(CAPRI, www.capri.ebi.ac.uk)
will be reported.
The
talk will also report recent work to predict functional residues
from sequence and structure using a new approach that predicts
protein function and the responsible residues over a wide
range of functional specificities.
Simulating
induced fit in molecular docking.
04/29 - 9.00
Ruben Abagyan, Maxim Totrov, Juan-Fernandez Recio, Julio Kovacs,
Claudio Cavasotto.
The Scripps Research Institute, La Jolla, USA
The main complicating factor in molecular docking is receptor
rearrangement upon ligand binding (induced fit). It is the
induced fit that complicates cross-docking of ligands from
different ligand receptor complexes. To improve on discriminating
between binders and nonbinders in the virtual screening process
we developed a protocol which performs receptor-flexible docking
of known ligands in order to simulate possible pocket rearrangements.
This protocol was applied to a benchmark of kinases and was
demonstrated to improve both the cross-docking accuracy as
well as the "enrichment" in virtual ligand screening.
In protein-protein docking and peptide protein docking the
side-chain sampling may be sufficient to account for induced
fit. The induced changes of the backbone are more problematic.
We show how the slow modes of soft harmonic Ca-model can be
used to generate alternative conformations.
Modeling
large-scale hinge-bent motions in docking.
04/29 - 9.35
Dina Schneidman & Ruth Nussinov, Tel Aviv University,
Israel
Proteins are very flexible molecules. The flexibility may
range from small-scale side-chain motions to large-scale intra
and inter domain motions or even partial refolding. In my
talk I will focus on approaches to handle hinge-bent protein
flexibility in docking algorithms. Hinge detection strategies
will also be mentioned. I will discuss the problems of the
current methods and the challenges of the field. In addition,
I will present various examples, including CAPRI targets.
Accounting
for protein loop flexibility during macromolecular docking
04/29 - 10.50
Karine Bastard & Chantal Prévost, Laboratoire de
Biochimie Théorique, Paris, France
Upon macromolecular association, some proteins undergo large
conformational changes that can result in surface loop movements.
When the Met repressor binds to DNA, an eight residue loop
of Met repressor changes its hairpin conformation into a conformation
that wraps around the DNA phosphate backbone. Such an examples
confirm the necessity to account for induced surface remodeling
during the search for interacting surfaces, by allowing the
receptor to adapt to its partner in an induced fit process.
To address this problem, we have recently developped a new
docking method, termed MC2, which takes into account the loop
and side-chain movements at the protein surface during macromolecular
association. The objectives of MC2 are to precisely position
the ligand, predict the loop conformations that optimally
interact with the ligand and adjust the side-chain conformations,
in order to predict the atomic level interactions between
the two partners. The loop flexibility is artificially introduced
by using a multiple copy representation. Each loop copy results
from ab intio construction and represents one possible main-chain
conformation of the loop with rigid backbone and flexible
side-chains. The ligand position, the conformation of the
protein side-chains and of the loop copy side-chains are sampled
by a Monte-Carlo Simulated Annealing process. The multiple
copy representation and Monte Carlo simulation are coupled
via the copy weights which are recalculated at the end of
each Monte Carlo cycle, finally resulting in selecting a unique
loop copy at the end of MC2 process. Final loop adjustments,
via energy minimzation, is found to play an important role
in establishing the correct energy ranking. In a test-case
study, the method was able to predict the structure of the
complex at the atomic level and to unambiguously predict the
conformation of an interfacial loop.
Bastard K, Thureau A, Lavery R, Prevost C. Docking macromolecules
with flexible segments. J.Comput.Chem. 2003 Nov 30;24(15):1910-20.
http://www.ibpc.fr/~bastard/MC2/mc2.html
How
to efficiently account for side chain flexibility and global
motions during docking
04/29 - 11.25
Martin Zacharias, International University Bremen, Germany
Most current docking approaches to predict the binding geometry
of protein-protein complexes use rigid protein partner structures.
However, protein complex formation can involve both local
conformational changes of side chains and loops at the protein-protein
interface and global conformational relaxation of the protein
partners. We have developed a docking approach that is based
on energy minimization of translational and rotational degrees
of freedom of protein partners and on a reduced protein representation
allowing efficient search for docking minima. A multicopy
approach is used to select the most favourable side-chain
conformation at the protein-protein interface during the docking
process [1]. To approximately account for possible global
conformational adaptation a method has been developed that
allows to relax the protein structure in pre-calculated flexible
degrees of freedom (soft modes) during docking [2]. Such flexible
modes can for example be obtained from molecular dynamics
simulations or on the level of a reduced protein representation
by employing an energy function that depends on the local
protein density. Application of the approaches to test systems
will be presented.
[1] Zacharias, M. 2003. Protein-protein docking with a reduced
protein model accounting for side chain flexibility. Protein
Sci. 12, 1271.
[2] Zacharias, M. 2004. Rapid protein-ligand docking using
soft modes from molecular dynamics simulations to account
for protein deformability:binding of FK506 to FKBP. Proteins
54, 759.
HADDOCK:
an information-driven flexible docking approach
04/29 - 14.00
Alexandre M. J. J. Bonvin, Utrecht University, The Netherlands
In my talk, I will describe our recently developed information-driven
flexible docking approach HADDOCK (High Ambiguity Driven protein
protein DOCKing) (http://www.nmr.chem.uu.nl/haddock), that
makes use of biochemical and/or biophysical information. The
experimental information is introduced as highly ambiguous
interaction restraints (AIRs) to drive the docking process.
HADDOCK uses an all-atom representation of the system. Flexibility
is accounted for in different ways during the docking protocol:
i) in the initial rigid body energy minimization stage by
starting the docking from ensembles of conformations (e.g.
a NMR ensemble of structures, snapshots from a MD simulation)
ii) during the semi-flexible simulated annealing refinement
stage by allowing flexibility at the interface first, only
for side-chain atoms, and then, for both side-chain and backbone
atoms
iii) in the final SA refinement in explicit water by progressively
allowing flexibility in the remaining of the system in addition
to the defined, flexible interface.
Reference:
Dominguez, C. Boelens, R. and Bonvin, A.M.J.J. (2003). J.
Am. Chem. Soc. 113. 1731
How
may the use of MD and rigid-body docking algorithms overcome
the protein flexibility problem associated with complex formation?
04/29 - 14.35
Graham R. Smith, Cancer Research UK London Research Institute
Michael J. E. Sternberg, Imperial College London, UK.
Paul A. Bates, Cancer Research UK London Research Institute
The formation of a protein-protein complex is a key event
in an enormous number of cellular biochemical processes. However,
to predict a wild-type complex computationally given the structures
of the components (the "protein docking problem")
is still difficult in cases where there is any more than a
very small change in the conformation of the components upon
the formation of the complex. As a first step to addressing
this flexible docking problem, we have used Molecular Dynamics
(MD) simulations to investigate the extent to which the conformational
fluctuations undergone by proteins in solution reflect the
conformational changes that they undergo when they form protein-protein
complexes ("induced fit"). To do this, we study
a set of over thirty proteins that form such complexes and
whose 3-dimensional structures are known, both bound in the
complex and unbound. We carry out MD simulations of 5 ns duration
with Gromacs, starting from the unbound structures, and analyse
the resulting conformational fluctuations in comparison with
the structures in the complex.
We find that in some cases the conformational fluctuations
observed in MD correlate well with the regions of the proteins
that move on complex formation, and in some cases take the
protein towards its bound conformation.
Preliminary results are presented on how this information
may be used to improve protein-protein docking, both for the
test set described above and some targets from recent rounds
of CAPRI.
Combinatorial
docking for multi-molecular assembly and protein structure
prediction
04/30 - 9.00
Yuval Inbar & Haim J. Wolfson, Tel Aviv University, Israel
The majority of proteins function when associated in multimolecular
assemblies. Yet, prediction of the structures of multimolecular
complexes has largely not been addressed, probably due to
the magnitude of the combinatorial complexity of the problem.
Docking applications have traditionally been used to predict
pairwise interactions between molecules. We have developed
an algorithm that extends the application of docking to multi-molecular
assemblies.
We apply it to predict both quaternary structures of oligomers
and multi-proteins complexes. Moreover, adapting the algorithm
to consider backbone connectivity, we also show that it may
be useful in the prediction of protein tertiary structures
when the structures of the protein parts are available. This
application was tested both on domain assembly in order to
predict the spatial arrangement of domains in multi-domain
proteins, and on protein building blocks (substructures of
domains with relatively high population times) assembly to
predict their arrangement within a domain in the native protein.
Complementarity
of structure ensembles in protein-protein binding
04/30 - 9.35
Raik Grünberg*, Johan Leckner* & Michael Nilges.
Institut Pasteur, Paris, France
Our understanding of protein-protein interaction is caught
in a contradiction: on the one hand,experimental rates of
association suggest that, in many cases, practically every
collision between two partner proteins leads to the formation
of the complex. On the other hand, we often fail to predict
the correct orientation of a protein complex because the two
free partners simply don't sufficiently fit. This discrepancy
is commonly explained by a fuzzy notion of induced fit, or
by the assumption that the bound conformations is present
in the structure ensembles of the two unbound proteins. However,
both models appear to be inconsistent with our current knowledge
about the forces and time scales of recognition.
In this study, we try to incorporate the additional dimensions
of receptor and ligand variability into our picture of the
protein-protein binding process. We performed two sets of
molecular dynamics simulations for the unbound (free) structures
of 17 receptor and 16 ligand proteins and applied shape-driven
rigid body docking to all combinations of representative receptor
and ligand snapshots as well as the free structure. In total,
we analysed and compared 2,106,368 solutions from 4114 exhaustive
rigid body dockings between 693 conformations of 33 different
proteins. The cross-docking of ensemble snapshots increases
the chances to find near native orientations. Our results
suggest that there are complementary conformations within
the free receptor and ligand ensembles, which, however are
in general not necessarily related to the bound structure.
In addition, we also performed molecular dynamics simulations
on all 17 complexes and analysed the flexibility of free and
bound proteins. Our results indicate that binding may not
necessarily occur at the cost of entropy. We propose a refined
model of the protein-protein recognition process that is combining
the ideas of conformer selection and induced fit and is in
better aggreement with our current understanding of interaction
forces, time scales and kinetic data.
*these authors contributed equally to the work
Prediction
of interacting surfaces by the Evolutionary Trace method
04/29 - 10.50
Olivier Lichtarge, Baylor College of Medicine, Houston, USA
Protein-protein
interactions are the elementary units from which molecular
pathways and cellular networks are built. But the description
of the functional surfaces that determine protein binding
still elude us. The Evolutionary Trace (ET) approach to this
problem is to combine sequences, evolutionary trees, and structures
to reveal the canonical determinants of a protein¹s function.
Large-scale studies show that these determinants cluster spatially
in the structure and that they match functional sites on proteins
surfaces. Their discovery allows experimentalists to rationally
design activity through targeted mutagenesis, for example
along the G protein-signaling pathway. The scalability and
generality of ET further suggest that proteome-wide annotation
of functional sites is within reach. The activity of many
protein structures may then be traced to narrow sets of relevant
amino acids that form ³elementary units of function and
of interaction². From a practical viewpoint, these units
can be engineered to analyze and manipulate the molecular
basis of protein function.The majority of proteins function
when associated in multimolecular assemblies. Yet, prediction
of the structures of multimolecular complexes has largely
not been addressed, probably due to the magnitude of the combinatorial
complexity of the problem. Docking applications have traditionally
been used to predict pairwise interactions between molecules.
We have developed an algorithm that extends the application
of docking to multi-molecular assemblies.
We apply it to predict both quaternary structures of oligomers
and multi-proteins complexes. Moreover, adapting the algorithm
to consider backbone connectivity, we also show that it may
be useful in the prediction of protein tertiary structures
when the structures of the protein parts are available. This
application was tested both on domain assembly in order to
predict the spatial arrangement of domains in multi-domain
proteins, and on protein building blocks (substructures of
domains with relatively high population times) assembly to
predict their arrangement within a domain in the native protein.
Modeling
Correlated Protein Main-chain Motions in Proteins and their
Ligands
04/30 - 9.00
Leslie A. Kuhn(1), Maria I. Zavodszky(1), Sameer Arora(2),
Ming Lei(3), and Michael F. Thorpe(4)
(1) Department of Biochemistry & Molecular Biology and
Center for Biological Modeling, Michigan State University,
502C Biochemistry Building, East Lansing, MI 48824-1319; http://www.bch.msu.edu/labs/kuhn
(2) Departments of Biochemistry & Molecular Biology and
Computer Science & Engineering, Michigan State University,
(3)Department of Biochemistry, Brandeis University, and (4)Physics
& Astronomy Department, Arizona State University (KuhnL@msu.edu)
We describe a new method for modeling protein and ligand main-chain
flexibility in docking. The goal is to sample the full conformational
space, including conformations not yet observed by crystallography,
MD, or NMR. Flexibility analysis is performed using the graph-theoretic
algorithm FIRST, which identifies coupled networks of covalent
and non-covalent bonds within the protein. ROCK then explores
available conformations by only sampling dihedral angles that
preserve the coupled bond network in the protein. A representative
set of protein conformations can then be used as targets for
docking with SLIDE, which models protein and ligand side-chain
flexibility. This combined approach for incorporating main-chain
flexibility in docking is illustrated for cyclophilin A-cyclosporin
and estrogen receptor-zearalenol complexes. Very recent results
show that the maintenance of correlated motions between hydrogen-bonded
and hydrophobic side chains is also a key aspect of ligand
recognition across diverse protein-ligand complexes.
Protein
flexibility and drug design: How to hit a moving target
04/30 - 13.30
Heather A. Carlson, University of Michigan, USA (carlsonh@umich.edu)
The use of multiple protein structures (MPS) is a growing
trend in structure-based drug design. Different techniques
will be discussed, and our MPS method for developing receptor-based
pharmacophore models will be highlighted. By using MPS, we
are able to identify flexible and rigid regions within the
binding site and use that information to our advantage. An
additional advantage of the method is that an unbound protein
structure can be used sucessfully for structure-based inhibitor
design !
List
of Participants
| Name |
Postal
address |
Email
address |
| Ruben
Abagyan |
Ruben
Abagyan Research Group
Molecular Biology
The Scripps Research Institute
10550 North Torrey Pines Rd., TPC-28
La Jolla, CA 92037 |
abagyan@scripps.edu |
| Ludovic
Autin |
INSERM
U428 "Risque thrombotique et mécanisme de
l'hémostase"
Equipe Bioinformatique Structurale
4, av de l'Observatoire
75006 Paris
|
ludovic.autin@univ-paris5.fr
|
| Ranjit
Bahadur |
Department
of Biochemistry
Bose Institute
P-1/12 CIT Scheme VIIM
Calcutta 700 054
Kolkata, India |
b_ranjit@bic.boseinst.ernet.in |
| Karine
Bastard |
Laboratoire
de Biochimie Théorique - CNRS UPR 9080
Institut de Biologie Physico-Chimique
13, rue Pierre et Marie Curie,
75005 Paris, France |
karine.bastard@ibpc.fr
|
| Paul
A. Bates |
Cancer
Research UK London Research Institute
44 Lincoln's Inn Fields,
London WC2A 3PX, U.K. |
paul.bates@cancer.org.uk |
| Nora
Benhabilès |
Clinigenetics
1105, Avenue Pierre Mendes-France
30000 Nimes, France
|
n.benhabiles@clinigenetics.com |
| Efrat
Ben-Zeev |
The
Weizmann Institute of Science, Revohot, IL
Ullman Blg. Room 249
Weizmann Institute of Science, Rehovot, Israel |
efrat.ben-zeev@weizmann.ac.il
|
| Julie
Bernauer |
Equipe
de Génomique Structurale
Laboratoire d'Enzymologie et Biochimie Structurales
UPR 9063 - Bâtiment 34 - CNRS
Avenue de la Terrasse
91198 Gif-sur-Yvette Cedex, France |
bernauer@lebs.cnrs-gif.fr |
| A.M.J.J.
Bonvin |
Associate
Professor
Department of NMR Spectroscopy
Bijvoet Center for Biomolecular Research
Utrecht University
Padualaan 8, 3584 CH Utrecht
The Netherlands |
a.m.j.j.bonvin@chem.uu.nl |
| Daniel
Borgis |
Modélisation
des Systèmes Moléculaires Complexes et LAE,
UMR 8587
Université d'Evry-Val-d'Essonne
Equipe Membre de SIMU
Bâtiment Maupertuis
25 Rue du Père Jarland
91025 EVRY Cedex |
dborgis@univ-evry.fr
|
| Carlos
J. Camacho |
Department
of Biomedical Engineering
Boston University
44 Cummington Street
Boston MA 02215 US |
ccamacho@bu.edu
|
| Heather
A. Carlson |
The
University of Michigan
College of Pharmacy
428 Church St.
Ann Arbor, Michigan 48109-1065, US |
carlsonh@umich.edu
|
| Phil
Carter |
Structural
Bioinformatics Group
Biochemistry Building
Department of Biological Sciences
Imperial College
London SW7 2AY, U.K. |
philip.carter@ic.ac.uk |
| Frederic
Cazals |
Project
Geometrica
INRIA Sophia-Antipolis
F-06902 Sophia-Antipolis |
Frederic.Cazals@sophia.inria.fr
|
| Aalt-Jan
van Dijk |
Department
of NMR Spectroscopy
Bijvoet Center for Biomolecular Research
Utrecht University
Padualaan 8, 3584 CH Utrecht
The Netherlands
|
a.j.vandijk@chem.uu.nl
|
| Medhi
Djafari Rouhani |
Microsystèmes
et Intégration des Systèmes
LAAS-CNRS
7, avenue du Colonel Roche
31077 Toulouse Cedex 4
France |
djafari@laas.fr
|
| Miriam
Eisenstein |
Ullman
Blg. Room 249
Weizmann Institute of Science
Rehovot, Israel |
miriam.eisenstein@weizmann.ac.il
|
| Alain
Esteve |
Microsystèmes
et Intégration des Systèmes
LAAS-CNRS
7, avenue du Colonel Roche
31077 Toulouse Cedex 4
France
|
aesteve@laas.fr
|
| Juan
Fernández-Recio |
Marie
Curie Research Fellow
Department of Biochemistry
University of Cambridge
80 Tennis Court Road
Cambridge CB2 1GA, UK |
juan@cryst.bioc.cam.ac.uk |
| Delphine
Flatters |
Equipe
de Bioinformatique Génomique et Moléculaire
INSERM EMI 03-46
Université Denis Diderot (Paris 7)
Tour 53-54 , 1er étage, case 7113
2 place jussieu
75251 Paris Cedex 05 |
Delphine.Flatters@ebgm.jussieu.fr
|
| Raik
Gruenberg |
Institut
Pasteur
Unité de Bioinformatique Structurale
25-28 rue du Dr Roux
F-75724 Paris CEDEX 15, France |
raik@pasteur.fr
|
| Tap
Haduong |
Modélisation
des Systèmes Moléculaires Complexes et LAE,
UMR 8587
Université d'Evry-Val-d'Essonne
Equipe Membre de SIMU
Bâtiment Maupertuis
25 Rue du Père Jarland
91025 EVRY Cedex |
thaduong@univ-evry.fr
|
| Yuval
Inbar |
Tel
Aviv University
Schreiber Building, Room 010,
School of Computer Science
Tel-Aviv University
Tel-Aviv 69978, Israel |
inbaryuv@tau.ac.il |
| Joël
Janin |
Head
of the Laboratoire d'Enzymologie et Biochimie Structurales
UPR 9063 - Bâtiment 34 - CNRS
Avenue de la Terrasse
91198 Gif-sur-Yvette Cedex, France |
janin@lebs.cnrs-gif.fr |
| Quentin
Kaas |
CNRS,
IMGT, the international ImMunoGeneTics information system,
Laboratoire d'ImmunoGenetique Moleculaire, LIGM
Institut de Genetique Humaine IGH,
UPR CNRS 1142,
141 rue de la Cardonille,
F-34396 Montpellier |
kaas@ligm.igh.cnrs.fr |
| István
Kolossváry |
Novartis
Institute for Biomedical Research
556 Morris Ave
Summit, NJ 07901, USA. |
istvan.kolossvary@pharma.Novartis.com |
| Noga
Kowalsman |
Department
of Biological Chemistry,
Weizmann Institute of Science
76100 Rehovot, Israel |
noga.kowalsman@weizmann.ac.il |
| Ludwig
Kripphal |
Departamento
de Química
Centro Quimica Fina e Biotecnologia
Faculdade de Ciências e Tecnologia.
Universidade Nova de Lisboa
Caparica, Portugal. |
ludik@net.cabo.pt |
| Leslie
Kuhn |
Center
for Biological Modeling
Protein Structural Analysis and Design Laboratory
Department of Biochemistry
Michigan State University
East Lansing, MI 48824-1319, US |
KuhnL@msu.edu
|
Richard
Lavery
|
Head
of the Laboratoire de Biochimie Théorique
CNRS UPR 9080
Institut de Biologie Physico-Chimique
13, rue Pierre et Marie Curie,
75005 Paris, France |
richard.lavery@ibpc.fr |
| Wen
Hwa Lee |
INSERM
U428, Université Paris 5
4, av de l'Observatoire
75006 Paris |
wen.hwa.lee@univ-paris5.fr
|
| Victor
Lesk |
Postdoctoral
Researcher
Structural Bioinformatics Group
Biochemistry Building
Department of Biological Sciences
Imperial College
London SW7 2AY, U.K. |
v.lesk@ic.ac.uk
|
| Olivier
Lichtarge |
Department
of Molecular & Human Genetics
Baylor College of Medicine
Houston, Texas 77030, USA |
lichtarge@bcm.tmc.edu
|
| Kristin
L. Meagher |
The
University of Michigan
College of Pharmacy
428 Church St.
Ann Arbor, Michigan 48109-1065, US |
kmeagher@umich.edu
|
| Raul
Mendez |
Service
de Conformation des Macromolecules Biologiques et de Bioinformatique
Universite Libre de Bruxelles (ULB)
Campus Plaine, Bd du Triomphe - CP263
B-1050 Bruxelles, BELGIQUE |
raul@scmbb.ulb.ac.be
|
| Giacomo
de Mori |
SCMBB
- ULB
Service de Conformation des Macromolécules Biologiques
et de Bioinformatique
Université Libre de Bruxelles
(Campus Plaine, building BC, C6, 6st level)
bd du Triomphe - CP 263
B-1050 Bruxelles, Belgium |
giacomo@scmbb.ulb.ac.be
|
| Florence
Nosal |
Department
of Bioinformatics & IT
GenOdyssee S.A.
Parc d'Affaires Technopolis
3 avenue du Canada
Batiment Alpha, Porte 6
B.P. 810 Les Ulis
91974 Courtaboeuf Cedex, France |
nosal@genodyssee.com
|
| Chantal
Prévost |
Laboratoire
de Biochimie Théorique - CNRS UPR 9080
Institut de Biologie Physico-Chimique
13, rue Pierre et Marie Curie,
75005 Paris, France |
chantal.prevost@ibpc.fr
|
| Ralph
Nico Riemann |
School
of Engineering and Science
Campus Ring 1
28759 Bremen, Germany
Phone: +49 421 200-3579
Fax: +49 421 200-3249 |
r.riemanniu-bremen.de
|
| Sophie
Sacquin-Mora |
Laboratoire
de Biochimie Théorique - CNRS UPR 9080
Institut de Biologie Physico-Chimique
13, rue Pierre et Marie Curie,
75005 Paris, France |
sacquin@ibpc.fr
|
| Dina
Schneidman (Duhovny) |
Schreiber
Building,
Tel Aviv University, P.O.B. 39040,
Ramat Aviv, Tel Aviv 69978, Israel |
duhovka@tau.ac.il |
| David
Schwarz |
Department
of Computer Science MS 132
Rice University
Physical and Biological Computing Group
6100 Main Street
Houston, TX 77005 |
dschwarz@rice.edu |
| Thierry
Simeon |
LAAS/CNRS
Groupe Robotics and Artificial Intelligence
7 Av. du Colonel Roche
31077 Toulouse Cedex, France |
nic@laas.fr
|
| Graham
R. Smith |
Biomolecular
Modelling Laboratory,
Cancer Research UK London Research Institute,
(formerly Imperial Cancer Research Fund),
44 Lincoln's Inn Fields,
London WC2A 3PX, U.K. |
graham.smith@cancer.org.uk
|
| Michael
J. E. Sternberg |
Head
of Structural Bioinformatics Group &
Director of Imperial College Centre for Bioinformatics
Structural Bioinformatics Group
Biochemistry Building
Department of Biological Sciences
Imperial College
London SW7 2AY, U.K. |
m.sternberg@imperial.ac.uk
|
| Vinh
Tran |
Unite
de recherche sur la Biocatalyse (FRE-CNRS 2230)
Faculte des Sciences et Techniques
2 rue de la Houssiniere, BP 92208
44322 Nantes Cedex 03 |
Vinh.Tran@chimbio.univ-nantes.fr
|
| Shoshana
J. Wodak |
SCMBB
- ULB
Service de Conformation des Macromolécules Biologiques
et de Bioinformatique
Université Libre de Bruxelles
(Campus Plaine, building BC, C6, 6st level)
bd du Triomphe - CP 263
B-1050 Bruxelles
Belgium |
shosh@ucmb.ulb.ac.be |
| Martin
Zacharias |
School
of Engineering and Science
Campus Ring 1
28759 Bremen
Germany |
m.zacharias@iu-bremen.de
|
| Ziding
Zhang |
BioAnalytical
Science, Nestlé Research Center
Vers-chez-les-Blanc PO Box 44,
CH-1000 Lausanne 26, Switzerland |
Ziding.Zhang@rdls.nestle.com
|
|