pad.png

  Menu

 

   E4.2 Project title: Quantitation, processing and classification algorithms for mass spectrometry

PDF Print E-mail
Project leader      
Péter Horvatovich
Address



Analytical Biochemistry, Department of Pharmacy
Faculty of Mathematics and Natural Sciences    
Antonius Deusinglaan 1
9713 AV  Groningen
Phone +31 (0)50 3633341
Fax +31-(0)50-363 7582
E-mail This e-mail address is being protected from spambots. You need JavaScript enabled to view it


Summary
Accurate data processing and statistical analysis of label-free and stable isotope labeled LC-MS data used in comparative profiling is of primary importance for proper interpretation of proteomics experiences and to suitably answer the underlying biological questions. In addition to the need of the development of specific algorithms solving particular problems related to the data properties, the performance of data processing algorithms and statistical analysis have to be evaluated in a systematic manner.
The project proposal intends to develop modularization of existing open-source data processing pipelines within a framework enabling to interconnect those modules combined with a mechanism to select the best combination of data processing modules for given datasets using spiking experiences.
Development of time alignment algorithms enabling proper analysis of charge state and isotopic distribution shifted LC-MS data and dataset obtained from other laboratories with different sample preparation techniques having a low fraction of compound overlap, will enable to expand accurate quantitative proteomics for a wider range of proteomics applications and will increase knowledge extraction for life science research. This will be supported by developing data integration techniques for data obtained from multidimensional techniques. Finally specific bioinformatics tools for quality control of data processing steps to increase the measured dynamic concentration range of quantitative proteomics techniques will be developed.
Statistical simulation assisted through spiked datasets will enable us to asses the global and individual performance of data processing, statistical analysis and validation modules. Statistical simulation using analytical and biological variance, measured dynamic concentration range and a noise model obtained from real datasets will enable us to select the best statistical strategy to detect class compound concentration differences.