A New Strategy of Protein Identification in Proteomics

A New Strategy of Protein Identification in Proteomics Xinmin Yin CS Dept. Ball State Univ.

Contents • Background. • Problems. • Proposal. • Conclusion.

What’s Proteomics ? Use of quantitative protein-level measurement of gene expression to characterize biological process and to decipher the mechanisms of gene expression control.

What’s Proteomics ? DNA transcription RNA translation PROTEIN

Current Research in Proteomics Quantitative measurement of proteins expressed in a cell * protein identification * protein quantification

Experimental Flow Chart of Protein Identification cell mass & sequence proteins peptides data processing

Schematic of MALDI MS/MS Laser Detector1 E1 MS1 E2 Peptides MS2 Detector2 Matrix Assisted Laser-induced Desorption Ionization

An Example of MALDI MS/MS Spectrum

Problem of Scaling up MALDI MS/MS to Proteomics Number of peptides is huge Human genome (~60,000 proteins) 2.6 X 106 peptides It will take THREE MONTHS to identify all proteins in a human cell.

My Proposal To develop a new data analysis method • Avoid using MS2 if a peptide can be identified in MS1. • Reduce size of peptide database during data analysis

Facts of MALDI MS/MS Experiment • 40% peptides are unique in mass when mass accuracy of MALDI is better than 10ppm. • Analyzing MS1 spectrum is much easier than analyzing MS2 spectrum. • It takes about 1 to 2 min for analysis of a single MS2 spectrum through database.

My Proposal • Using local protein database for data analysis. • Developing a peptide filter. • Automating data transfer from MS1 to MS2

Local Protein Database PURPOSE: 1. Make peptide database small according to experimental methods 2. Do online data processing • Download protein database from public sites. • Digest proteins in computer. • Select peptides according experimental methods, and store them in memory. • Calculate peptide chemical modification.

Generating Peptide Database Protein Database Peptide Database Enzyme Digestion Each protein can produce 30 ~ 100 peptides

Peptide Filter PURPOSE: 1. Do online data processing 2. Reduce peptides database size 2. Reduce number of peptides passed to MS2 • Identify peptide by mass alone. • Sort identified peptides according to proteins. • Delete rest peptides of the identified protein from peptide database. • Pass unidentified mass information to MS2.

Identifying Peptide by Mass Alone Mth Mexp Npep M + + M = Mth – Mexp If (M < ) Npep++ Npep: number of mass matched peptides : mass accuracy of MS1

Reducing Size of Peptide Database If (Npep == 1) Peptide is unique; Deleting peptides from the same protein Else Send peptide to MS2 for sequence information.

F L O W C H A R T

Program Design Computer Language: C++ Classes: Protein Peptide Linked list

Conclusion 1. My program will generate a list of peptides from the protein database. 2. Peptide identification is carried out while doing experiment. 3. Data analysis time will be significantly reduced. 4. This program will make identification of proteins from a cell possible and reliable.

A New Strategy of Protein Identification in Proteomics

A New Strategy of Protein Identification in Proteomics

Presentation Transcript

Proteomics of Protein Glutathionylation in Mitochondria of an Aging Heart

Proteomics and Protein Bioinformatics: Functional Analysis of Protein Sequences

Protein Identification

Identification of protein-protein binding motifs

Proteomics Informatics Workshop Part I: Protein Identification David Fenyö February 4, 2011

Understanding protein lists from proteomics studies

Identification of Protein Domains

Protein Sequencing and Identification

Proteomics technologies and protein-protein interaction

Protein chemistry to proteomics

Proteomics: Protein Profiling and Identification through Mass Spectrometry

Identification of Macromolecular Assemblies and Protein-Protein Complexes

Identification of specificity-determining positions in protein alignments

Protein Feature Identification

Proteomics technologies and protein-protein interaction

Ch17. Proteomics and Protein Identification

Proteomics and Glycoproteomics (Bio-)Informatics of Protein Isoforms

Applications of Protein Array in Diagnostics, Genomic and Proteomics

protein identification

protein identification service

INF380 – Proteomics Chap 7 –Protein Identification and Characterization by MS

Proteomics: Protein Profiling and Identification through Mass Spectrometry