|
|
||||||||
Proteomics and Protein Markers |
Departments of1
Pathology and 4
Oncology, Johns Hopkins Medical Institutions, Baltimore, MD.
2 National Cancer Institute of Milan, Milan, Italy.
3 University of Texas MD Anderson Cancer Center, Houston, TX.
5 Ciphergen Biosystems, Inc., Fremont, CA.
aAddress correspondence to this author at: Department of Pathology, Johns Hopkins Medical Institutions, 422 North Bond St., Baltimore, MD 21231. Fax 410-502-7882.
| Abstract |
|---|
|
|
|---|
Methods: Using a casecontrol study design, we performed protein expression profiling on serum samples from the National Cancer Institute (Milan, Italy). The validation sample cohort consisted of 61 women with locally invasive breast cancer, 32 with DCIS, 37 with various benign breast diseases (including 13 atypical), and 46 age-matched apparently healthy women (age range, 4468 years). Validated biomarkers were purified and identified with serial chromatography, 1-dimensional gel electrophoresis, in-gel ASP-N digestion, peptide mass fingerprinting, and tandem mass peptide sequencing.
Results: The BC3 and BC2 expression patterns in this sample set were consistent with the first study sample set. BC3 and BC2 were identified to be complement component C3adesArg and a C-terminaltruncated form of C3adesArg, respectively.
Conclusions: Evaluation of biomarkers in independent sample sets can help determine the broader utility of candidate markers, and protein identification permits understanding of their molecular basis. C3adesArg appears to lack specificity among patients with benign diseases, limiting its utility as a stand-alone tumor marker, but it may still be useful in a multimarker panel for early detection of breast cancer.
| Introduction |
|---|
|
|
|---|
Most molecular-based approaches that have been investigated for the early detection of breast cancer have specific targets, such as oncogenes, tumor suppressor genes, growth factors, tumor antigens, or other gene products, but because none of these alone can account for a large majority of the breast cancers and some are not specific to cancer or to breast tissues, such approaches have low sensitivity and specificity. To date, no molecular biomarkers have been recommended for the early detection of breast cancer (3). Tumor markers cleared by the Food and Drug Administration, such as CA 15-3 and CA27.29, are recommended only for monitoring treatment of advanced breast cancer or recurrence (4).
Rather than targeting a specific abnormality that may be present in only a small subgroup of patients, common "patterns of changes" that are associated with disease status are being identified by genomics- or proteomics-based approaches. Both genomics and proteomics approaches yield high-dimensional data that can be analyzed with multivariate statistics and powerful pattern recognition algorithms. The potential drawback of these approaches is their tendency to discover patterns among many variables that may not be a direct result of the pathology of interest but rather a result of preanalytical characteristics of a given sample set. It is therefore often possible to obtain a very high classification success rate in a single-site study. Independent validation in a separate sample set, collected from a different patient population and hospital, is one way to determine the true performance of a set of markers.
Using the high-throughput surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF MS) proteomics approach, we previously analyzed protein expression profiles of serum samples from 169 women with or without breast cancer and identified a panel of 3 serum proteins that could detect early-stage breast cancer (stage 0-I) with high sensitivity (93%) and specificity (91%) (5). However, that study included only 4 cases of DCIS; therefore, the utility of these markers for detecting in situ breast carcinoma could not be properly assessed. In addition, because that study examined samples taken entirely from one institution, the general validity of these markers could not be ascertained. In the present study, we performed SELDI-TOF MS protein expression profiling on samples obtained from a second hospital and included a greater number of DCIS cases. In addition, we performed purification and identification of the 2 confirmed markers.
| Material and Methods |
|---|
|
|
|---|
seldi protein profiling
Protein profiles were generated by use of Immobilized Metal Affinity Capture (IMAC-Ni) ProteinChip® arrays under binding and washing conditions as described previously (5).
bioinformatics and biostatistics
The data analysis process used in this study involved the following steps.
Peak detection.
ProteinChip Software 3.0 (Ciphergen Biosystems) was used to collect and evaluate the raw spectra. Results for each set of 196 specimens, including 176 study sera and 20 quality-control sera (pooled human sera obtained from Serologicals Corp.), were compiled, baseline-subtracted, and externally calibrated with All-In-1 Protein Standard (Ciphergen Biosystems). Qualified mass peaks (visual examination) of m/z 2000150 000 were selected manually. The peak intensities were normalized to the total ion current between m/z 2000 and 150 000 with the same external coefficient, and the data were exported to an Excel spreadsheet.
Evaluation of reproducibility.
We estimated the reproducibility of replicates by calculating the correlation of each pair of replicates and calculating the CV of the 3 reported peaks as calculated from pooled human sera. If no systematic bias was observed, the peak intensities identified in replicate analysis were averaged and then log-transformed.
Marker evaluation.
Nonparametric 2-sample comparisons were performed with the MannWhitney test. ROC curve analysis (in-house software implemented in MATLAB, Ver. 6.0) was performed for evaluation of the selected biomarkers.
protein identification
Protein purification was carried out with 100 µL of serum and a series of protein separation procedures, including IMAC, anion-exchange, and reversed-phase chromatography using sorbents available from Ciphergen-BioSepra. A schematic of the purification process is shown in Fig. 1A of the Data Supplement that accompanies the online version of this article at http://www.clinchem.org/content/vol51/issue12/. During each of the iterations, the fractions were profiled on IMAC arrays to monitor the presence or absence of the biomarkers of interest. Protein fractions that were enriched for BC2 and BC3, respectively, were separated on a 16% Tris-tricine sodium dodecyl sulfate gel. All visible bands in the correct molecular mass range were excised. The protein in each gel band (one half of the gel material) was eluted according to the passive elution protocol (6) and profiled on NP20 arrays. The identified gel bands (the other half of the gel material) containing the 8.1- and 8.9-kDa peaks were first reduced with dithiothreitol and then digested with ASP-N (Roche Diagnostics) for peptide mass fingerprinting. For confirmation of fragment sequence, the NP20 arrays containing the proteolytic fragments were analyzed by collision-induced dissociation with a Q-STAR® tandem mass spectrometry (MS/MS) instrument (Applied Biosystems/MDS Sciex) equipped with a PCI 1000 ProteinChip Interface (Ciphergen). A sequence database search was carried out with the Matrix Science Mascot program (http://www.matrixscience.com).
seldi-tof ms immunoassays for bc2 and bc3
We coupled 1.5 µg of mouse monoclonal antibody against human C3a (Chemicon) to 3 µL of Protein A HyperD® beads (Ciphergen-BioSepra). The uncoupled antibody was removed by washing 3 times with phosphate-buffered saline (PBS; 9 g/L NaCl, 0.165 g/L KH2PO4, 0.775 g/L Na2HPO4), and the beads were resuspended in 150 µL of PBS. We then added 0.5 µL of serum in 10 µL of PBS containing 0.1 g/L bovine serum albumin to the antibody beads and incubated them at 4 °C overnight. After incubation, the beads were washed 3 times with 150 µL of PBS containing 1 mL/L Triton followed by 3 washes with 150 µL of PBS and a brief rinse with water. The captured proteins were eluted twice with 30 µL of 500 mL/L acetonitrile containing 5 mL/L trifluoroacetic acid, and the pooled eluates were added to 100 µL of PBS. To achieve the optimal binding pH for IMAC30 chips (pH 7.4), 8 µL of 1 mol/L Tris (pH 10) was added to the mixture before our previously described standard binding and washing procedures.
ca 15-3
The CA 15-3 concentration was measured by IRMA-mat CA 15-3 (Byk-Sangtec Diagnostica).
| Results |
|---|
|
|
|---|
|
|
protein identification for bc2 and bc3
A schematic of the purification process for BC2 and BC3 is shown in Fig. 1A of the online Data Supplement. BC2 and BC3 coeluted in the pH 3 fraction of the IMAC-Cu column, whereas the organic fraction contained mainly BC2 (data not shown). The organic fraction was concentrated and applied directly on a sodium dodecyl sulfate gel for purification of BC2, whereas further fractionation was performed on the pH 3 fraction to obtain partially purified BC3 without interference from BC2. Gel images of the final fractions that were enriched for BC2 and BC3, respectively, and the gel bands containing each marker identified by protein profiling of the eluted proteins on NP20 arrays are shown in Fig. 1B of the online Data Supplement. BC3 was subsequently determined to be the anaphylatoxin C3a lacking the C-terminal arginine (C3adesArg; sequence SVQLTEKRMDKVGKYPKELRKCCEDGMRENPMRFSCQRRTRFISLGEACKKVFLDCCNYITELRRQHARASHLGLA). The underlined regions are sequence covered by MS/MS analysis of peptides 1091 Da (DKVGKYPKELRKCCE), 1909 Da (SVQLTEKRMDKVGKYPKELRKCCE), and 2641 Da (DCCNYITELRRQHARASHLGLA). The MS/MS analysis of peptide 1909 Da is shown in Fig. 1C of the online Data Supplement (data not shown for peptides 1091 and 2641 because of page limitations). The predicted mass of C3adesArg is 8923 Da, consistent with the measured mass of BC3 (8926 Da), and the predicted pI is 9.54, consistent with the inability of BC3 to bind anion-exchange resin at pH 9.0.
Similarly, BC2 (m/z 8116) was identified as a truncated form of C3adesArg by peptide mass fingerprinting and MS/MS analysis (data not shown). It was a C-terminal truncation of C3adesArg, lacking the C-terminal sequence RASHLGLA; we refer to this marker as C3adesArg
8.
immunoassays for bc2 and bc3
The identities of BC2 and BC3 as C3adesArg
8 and C3adesArg were further verified by SELDI-TOF MS immunodepletion using a monoclonal antibody against C3a (Fig. 3
). A subgroup of the serum samples (10 healthy, 9 benign, 10 DCIS, and 10 invasive cases) was randomly selected for an immunodepletion pull-down experiment using the same antibody. Distribution of the captured C3adesArg and C3adesArg
8 in the cancer and noncancer groups was consistent with the SELDI profiling results (Fig. 4
). The correlation coefficients between the SELDI immunoassay and profiling for BC2 and BC3 were 0.65 and 0.56, respectively.
|
|
diagnostic performance of bc2, bc3, and ca 15-3
To investigate whether CA 15-3 has any discriminatory power in this study cohort, we measured the serum CA 15-3 concentrations with IRMA-mat CA 15-3. Of 93 cancer sera tested, only 5 (all from patients with invasive cancer) would have been called positive according to the standard 30 units/mL cutoff (sensitivity, 5%). We observed no significant difference between the healthy control, benign, DCIS, and invasive cancer groups (data not shown). The diagnostic performances of BC2, BC3, and CA 15-3 in terms of ROC analysis are presented in Fig. 5
. The areas under the curves for C3adesArg and C3adesArg
8 in the validation data were 0.65 and 0.71, respectively. In contrast, the area under the curve for CA 15-3 was 0.52.
|
| Discussion |
|---|
|
|
|---|
In a previous study we selected 3 protein/peptides, BC1, BC2, and BC3, as potential biomarkers for breast cancer early detection and used several statistical measures to minimize the impact of analytical variability (5), but because the study did not have a completely independent test set, these markers required further validation. In this study, we evaluated these markers on samples obtained from a second hospital and included a greater number of DCIS cases. We were able to validate the expression patterns of 2 of the markers, BC2 and BC3, and identified them as complement component C3adesArg
8 and C3adesArg, respectively.
The human complement (C) system protects an individual against substances of nonself origin, including xenografts and microbial pathogens (7). There are 25 known soluble complement proteins, as well as 10 or more cell-surface complement receptors and regulatory proteins present on a wide range of host cells. Among these complement proteins, C3 is the most versatile and multifunctional molecule identified to date, having evolved structural features that allow it to interact in a specific manner with at least 25 different proteins (8). C3 is also the most abundant complement protein in serum (1.2 g/L) and supports the activation of all 3 pathways of complement activation, the classic, alternative, and lectin pathways (9)(10). On activation by molecules on the surfaces of the microorganisms or other nonself signals, native C3 (185 kDa) is cleaved into C3b (176 kDa) and C3a (9 kDa). C3a is very short lived and in serum is cleaved immediately into the more stable C3adesArg (8.9 kDa) by carboxypeptidases (9). Cleavage of C3 is functionally important because it causes a series of events that lead to complement-mediated lysis (10)(11)(12). C3a, together with C4a and C5a, the small fragments of C3, C4, and C5, respectively, are called the complement anaphylatoxins. In addition to anaphylaxis and chemotaxis, they also have demonstrated immunomodulatory effects on several different cell types (13).
Complements have long been thought to function in immunosurveillance against tumors, but the molecular mechanisms by which malignant cells are tagged by homologous C3 and the molecules responsible for homologous C activation are largely unknown. Whatever mechanism may be involved, increases in C3adesArg as well as C3adesArg
8 (presumably derived from further truncation of C3adesArg) in serum of women with breast cancer or benign breast diseases indicates complement activation in these patients.
Complement abnormalities have been described in various types of cancer. Higher complement hemolytic activity and C3 concentrations were observed in serum samples from children with neuroblastoma (14). Similarly, increased complement concentrations were reported in patients with lung (15)(16), digestive tract (17), and brain(18) tumors. It is likely that complement activation is an early event during tumorigenesis because the opposite phenomenon was observed in patients with advanced-stage metastatic brain tumors (18) and in patients of breast, gastric, and colorectal carcinomas who developed metastases after the surgery (19). Interestingly, we have also observed down-regulation of C3adesArg and C3adesArg
8 in pretreatment sera of breast cancer patients with distant metastasis (unpublished result), in contrast to their up-regulation in sera of patients with DCIS or locally invasive breast cancer shown in this and previous (5) work.
For BC1 (4.3 kDa), we observed an inconsistent expression pattern in the 2 studies. BC1 has been identified as a truncated form of interalpha-trypsin inhibitor heavy chain H4; Li et al., unpublished result). Our preliminary result indicates that BC1 is unstable and that further truncation occurs during prolonged storage. The instability of BC1 and variations on sample storage time are most likely the causes of the discrepancies we observed in the 2 data sets. In the original discovery study, serum samples from cancer patients were collected during a longer time interval than the control samples, whereas samples from the validation set were all collected within the same 2-year window. Although BC1 may still be a valid marker if serum were collected prospectively and stored under the same conditions, we are currently evaluating other fragments of the same protein for a more stable alternative to BC1 (unpublished result).
In summary, we used independent test data to evaluate 3 previously discovered potential serum biomarkers for breast cancer. Although several biomarker panels that use SELDI and ProteinChip arrays have been reported for various diseases (5)(20)(21)(22)(23)(24)(25)(26)(27)(28)(29), this report highlights one of the first validation studies that uses an independent test set. We identified 2 markers that showed consistent expression patterns in both cohorts, complement C3adesArg
8 and C3adesArg. Because C3a is increased in the sera of patients with breast cancer as well as benign breast diseases, it is likely that our results reflect activation of the complement system in response to a general disease process rather than proteins released by the tumor cells themselves. The apparent lack of specificity of C3adesArg among patients with benign diseases may limit its utility as a stand-alone tumor marker, but it may still be useful in a multimarker panel for the early detection of breast cancer. The applications and possible shortcomings of MS for clinical proteomics and biomarker discovery have been discussed extensively (30). It is true that with limited mass range coverage, it is unlikely that existing tumor markers such as CA125 could be directly detected by the current SEDLI platform. However, in this report we demonstrate that with proper study design and well-controlled sample-preprocessing protocols, it is possible to discover and validate biomarkers with consistent performance across multiple sites.
| Acknowledgments |
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
, IL-6, and the polyclonal immune response. J Immunol 1997;159:4279-4286.[Abstract]
The following articles in journals at HighWire Press have cited this article:
![]() |
T. Hori, Y. Naishiro, H. Sohma, N. Suzuki, N. Hatakeyama, M. Yamamoto, T. Sonoda, Y. Mizue, K. Imai, H. Tsutsumi, et al. CCL8 is a potential molecular candidate for the diagnosis of graft-versus-host disease Blood, April 15, 2008; 111(8): 4403 - 4412. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhang and D. W. Chan Cancer Biomarker Discovery in Plasma Using a Tissue-targeted Proteomic Approach Cancer Epidemiol. Biomarkers Prev., October 1, 2007; 16(10): 1915 - 1917. [Full Text] [PDF] |
||||
![]() |
J. F. Timms, E. Arslan-Low, A. Gentry-Maharaj, Z. Luo, D. T'Jampens, V. N. Podust, J. Ford, E. T. Fung, A. Gammerman, I. Jacobs, et al. Preanalytic Influence of Sample Handling on SELDI-TOF Serum Protein Profiles Clin. Chem., April 1, 2007; 53(4): 645 - 656. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nakagawa, S. K. Huang, S. R. Martinez, A. N. Tran, D. Elashoff, X. Ye, R. R. Turner, A. E. Giuliano, and D. S.B. Hoon Proteomic Profiling of Primary Breast Cancer Predicts Axillary Lymph Node Metastasis Cancer Res., December 15, 2006; 66(24): 11825 - 11830. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Nedelkov, U. A. Kiernan, E. E. Niederkofler, K. A. Tubbs, and R. W. Nelson Population Proteomics: The Concept, Attributes, and Potential for Cancer Biomarker Research Mol. Cell. Proteomics, October 1, 2006; 5(10): 1811 - 1818. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Du, W. A. Kibbe, and S. M. Lin Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching Bioinformatics, September 1, 2006; 22(17): 2059 - 2065. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. L. Hortin The MALDI-TOF Mass Spectrometric View of the Plasma Proteome and Peptidome Clin. Chem., July 1, 2006; 52(7): 1223 - 1237. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. L. Hortin, S. A. Jortani, J. C. Ritchie Jr, R. Valdes Jr, and D. W. Chan Proteomics: A New Diagnostic Frontier Clin. Chem., July 1, 2006; 52(7): 1218 - 1222. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Kepper, R. Reinhardt, A. Dahl, H. Lehrach, and S. Sauer Matrix-Assisted Laser Desorption/Ionization Mass Spectrometric Analysis of DNA on Microarrays Clin. Chem., July 1, 2006; 52(7): 1303 - 1310. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. P. Diamandis Serum Proteomic Profiling by Matrix-Assisted Laser Desorption-Ionization Time-of-Flight Mass Spectrometry for Cancer Diagnosis: Next Steps Cancer Res., June 1, 2006; 66(11): 5540 - 5541. [Full Text] [PDF] |
||||
![]() |
E. P. Diamandis Validation of breast cancer biomarkers identified by mass spectrometry. Clin. Chem., April 1, 2006; 52(4): 771 - 772. [Full Text] [PDF] |
||||
![]() |
J. Li, Z. Zhang, and D. W. Chan Validation of Breast Cancer Biomarkers Identified by Mass Spectrometry: Some of the authors of the article cited above respond: Clin. Chem., April 1, 2006; 52(4): 772 - 772. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |