CPTC Reagents Data Portal

Discovery Stage

Data Analysis/Acquisition Discovery Proteomics Pipeline

Figure captions for software pipelines:  discovery

One of the major outputs from the Clinical Proteomic Technologies for Cancer initiative has been the development of software analysis tools.  Analysis of mass spectrometry data for protein identification includes a number of steps, as depicted above.  Briefly, the raw mass spectra is first processed to improve the quality of the spectra.  Poor spectra are discarded.  Next, peptides are identified from the spectra.  If one is searching for post-translational modifications, those would also be identified at this point.  After that, protein identities are inferred from the identified peptides.  Finally, a number of quantitative and semi-quantitative methods are available to differentiate proteins upregulated in specific disease states.  These disease-linked proteins may then comprise a biomarker candidate list.  See below for further descriptions of these tools.

Data Pre-processing

   Peptide ID

   PTM Assignment

   Protein ID

   ID-based Differentiation

   Intensity-based Differentiation