Proteomics

 

 

Standardisation, benchmarking and comparision of different experimental systems

Co-ordinators:
Pierre Legrain Hybrigenics, Paris, France more
Chris Mundy Oxford BioMedica Ltd, Medawar Centre, Oxford, UK more

Large-scale functional genomics experiments are becoming more widespread to take advantage of sequence databases. These experiments rely mostly on new protocols or technologies that are still developed and practised by only a relatively small number of laboratories. However, similar biological questions may be addressed by various techniques and results may not be comparable without clear definitions, creating an urgent need for standardised criteria for different technological approaches. In addition, it is essential to provide a framework for comparison of results obtained with the same underlying technology in different laboratories. As an example, different methods have been developed for production of large scale cDNA microarrays for gene expression experiments. The use of different array systems has created a particular need for comparison of results and benchmarking between different laboratories. A publicly available database for storage and analysis of microarray data is now needed. This subject will become increasingly important, especially as the scientific community plans to have biological databases dealing with heterogeneous data to be exploited as a guide to define new biological experiments.

Similarly, analysis of the complex heterogeneous data needs to be done in a way which allows results to be related meaningfully. This can only be achieved in a multidisciplinary approach in cooperation with all the major laboratories involved in the different levels of functional genomics studies. Bioinformatics and database developments have to be in close collaboration with the experimental research teams in order to produce databases which can collect genomic data and make them accessible to the scientific community, in a similar way that sequence databases do today for sequence data. The success of functional genomics efforts will depend to a great extent on parallel approaches of standardisation both on the bioinformatics as well on the biological side.

The objectives of standardisation and benchmarking can only be achieved through extensive discussions between relevant experts and by distributing specialised tasks among the members of the programme network. Materials such as high density DNA filters or glass slides containing minimal clone sets, protein expression libraries or two-hybrid systems are very costly and resource intensive to produce, requiring in most cases expensive equipment and experienced staff. It would therefore be advantageous to have resource centres to produce such materials in a standardised way. This would produce the benefits that, firstly, all laboratories would be able to use high quality and affordable material for their research and, secondly, it would be possible to compare experimental results more easily due to the common source materials used. This could be applied similarly to the successful use of common high quality reference libraries for mapping and sequencing projects (e.g. the CEPH YAC library, Pieter de Jong´s BAC and PAC libraries, the chromosome specific cosmid libraries from LLNL and LANL, and the IMAGE clone collection) and specialised but linked databases (e.g. GDB, RZPD, EMBL, Genbank, Swissprot, dbest and others) to store data produced with these materials. The agreement to standardise and benchmark experimental techniques and analysis tools, as well as the access to standardised reference material, are the prerequisites for development of major functional genomics databases. Standardisation of mutational analysis is particularly important for the delivery of the new paradigm in healthcare, to enable reliable definition of human genotypes.

An ultimate goal of the programme would be to standardise all relevant items involved in data production and to develop databases which allow groups working in similar areas to store their data in a common 'virtual centre' and thus make them accessible to other scientists. It is essential to ensure that all data can be traced to the originating laboratory; in this way it will be possible for individual scientists to resolve conflicting data by direct comparison of methods and experimental systems. It is intended that the programme will act as a forum for standardisation of functional genomics technologies in Europe. This includes the development of database schemas as well as user-friendly graphical interfaces (preferably using the WWW) to submit, update, search, display and analyse the data and to link them to existing databases storing mapping, sequence and other structural information. The ambition would be to link hundreds of European laboratories that would agree on standard formalised presentation of results. Global integration of standards will be a major objective.

Contacts within the programme

Pierre Legrain
Chris Mundy