학술논문

Defining intact protein primary structures from saliva: a step toward the human proteome project
Document Type
Author abstract
Report
Source
Analytical Chemistry. May 15, 2012, Vol. 84 Issue 10, p4383, 13 p.
Subject
Usage
Research
Composition
Mass spectrometry -- Usage
Protein structure -- Research
Proteomics -- Research
Saliva -- Composition
Post-translational modifications
Proteins -- Structure
Proteins -- Research
Salivary glands -- secretions
Salivary glands -- Composition
Post-translational modification
Language
English
ISSN
0003-2700
Abstract
Top-down mass spectrometry has been used to investigate structural diversity within some abundant salivary protein families. In this study, we report the identification of two isoforms of protein II-2 which differed in mass by less than 1 Da, the determination of a sequence for protein IB8a that was best satisfied by including a mutation and a covalent modification in the C-terminal part, and the assignment of a sequence of a previously unreported protein of mass 10433 Da. The final characterization of Peptide P-J was achieved, and the discovery of a truncated form of this peptide was reported. The first sequence assignment was done at low resolution using a hybrid quadrupole time-of-flight instrument to quickly identify and characterize proteins, and data acquisition was switched to Fourier-transform ion cyclotron resonance (FTICR) for proteins that required additional sequence coverage and certainty of assignment. High-resolution and high mass accuracy mass spectrometry on a FTICR-mass spectrometry (MS) instrument combined with electron-capture dissociation (ECD) provided the most informative data sets, with the more frequent presence of "unique" ions that unambiguously define the primary structure. A mixture of predictable and unusual post-translational modifications in the protein sequence precluded the use of shotgun-annotated databases at this stage, requiring manual iterations of sequence refinement in many cases. This led us to propose guidelines for an iterative processing workflow of MS and MSMS data sets that allow researchers to completely assign the identity and the structure of a protein.