|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Proteomics and Protein Markers |
1 National Center for Glycomics and Glycoproteomics, Department of Chemistry, Indiana University, Bloomington, IN; 2 Department of Medicine, Indiana University School of Medicine, Indianapolis, IN; 3 Breast Cancer Center, Greater Baltimore Medical Center, Baltimore, MD; 4 Indiana University Cancer Center, Indianapolis, IN.
aAddress correspondence to these authors at: Department of Chemistry, Indiana University, 800 E. Kirkwood Ave., Bloomington, IN 47405. Fax 812-855-8300; e-mail novotny{at}indiana.edu, ymechref{at}indiana.edu.
| Abstract |
|---|
|
|
|---|
Methods: We performed specific MALDI mass spectrometry (MS)-based glycomic profile analyses of permethylated glycans in sera from breast cancer patients (12, stage I; 11, stage II; 9, stage III; and 50, stage IV) along with sera from 27 disease-free women. The serum glycoproteins were enzymatically deglycosylated, and the released glycans were purified and quantitatively permethylated before their MALDI-MS analyses. We applied various statistical analysis tools, including ANOVA and principal component analysis, to evaluate the MS profiles.
Results: Two statistical procedures implicated several sialylated and fucosylated N-glycan structures as highly probable biomarkers. Quantitative changes according to a cancer stage resulted when we categorized the glycans according to molecular size, number of oligomer branches, and abundance of sugar residues. Increases in sialylation and fucosylation of glycan structures appeared to be indicative of cancer progression. Different statistical evaluations confirmed independently that changes in the relative intensities of 8 N-glycans are characteristic of breast cancer (P < 0.001), whereas other glycan structures might contribute additionally to distinctions in the statistically recognizable patterns (different stages).
Conclusions: MS-based N-glycomic profiling of serum-derived constituents appears promising as a highly sensitive and informative approach for staging the progression of cancer.
| Introduction |
|---|
|
|
|---|
Certain links between cancer diseases and altered protein glycosylation have been noted for several years (12)(13)(14)(15)(16) for both N-linked and O-linked glycoconjugates, whereas the most advanced knowledge of abnormal glycosylation in this set of diseases has primarily come from the investigations of tumor tissues and cell cultures. Detecting cancer-related changes in patients blood has been a distinct goal of more recent studies but has been less commonly pursued because of methodological difficulties with high-sensitivity measurement of glycans. Coincidentally, some recently identified cancer biomarkers in human serum are glycoproteins (17)(18)(19)(20), usually the mucin-type, large molecules.
In this study, we report that the profiles of N-glycans released from glycoproteins of human serum are highly indicative of the different conditions of breast cancer. Using small (10 µL) serum aliquots, we performed sensitive and quantitative mass spectrometry (MS)1 measurements of the constituent profiles of N-glycans originating from circulating proteins. These profiles could be further evaluated statistically through pattern recognition techniques [principal component analysis (PCA)] in terms of breast cancer disease stage. The N-glycan data cluster remarkably well for different stages, which further differ from a data set recorded from individuals apparently free of the disease. The clusters readily distribute into groups that correlate with the stage of breast disease as determined by pathological assessment. This type of clinically useful information could be obtained from a small volume of blood serum, without biopsy or tumor removal. To implicate certain oligosaccharide structures as biomarkers, we carried out additional statistical evaluations (data mining) through nonparametric ROC and ANOVA analyses for approximately 50 individual N-glycans. Finally, for a comparison, we also examined the glycomic profiles of breast cancer cell lines, both invasive and noninvasive, to determine whether they resemble the glycan patterns derived from patient specimens.
| Materials and Methods |
|---|
|
|
|---|
blood serum samples and clinical diagnosis
Blood serum collections from disease-free female volunteers and women diagnosed with different breast cancer stages were performed by a clinical team according to an institutional review board–approved clinical trial. Venous blood samples were taken during the morning fasting state, with minimal stasis in evacuated tubes. After at least 30 min, but within 2 h, the tubes were centrifuged at 20 °C for 12 min at 1200g. Sera were stored frozen in plastic vials at –80 °C before use in consecutive measurements.
We generated glycomic profiles for samples derived from women in one of two categories: (1) disease-free woman at low risk for developing breast cancer and (2) postmenopausal women with confirmed disease stratified according to their having noninvasive or invasive breast cancer. We divided breast cancer patients into 4 subgroups (I–IV) based on severity of disease (21). We collected samples from 27 healthy individuals and from 12 patients in stage I, 11 in stage II, 9 in stage III, and 50 in stage IV. A detailed characterization of the healthy volunteers and patients with their clinical diagnoses can be found in Tables 1–2 in the Data Supplement that accompanies the online version of this article at http://www.clinchem.org/content/vol54/issue7.
cancer cell lines
We lysed cancer cell lines (normal mammary epithelial, MCF10A; invasive, MDA-MB-231, MDA-MB-435; noninvasive, 578T, ADR-RES, BT549, and T47D) in a CHAPS-based buffer (150 mmol/L NaCl, 0.5% CHAPS, 50 mmol/L Tris (pH 7.5), 10 mmol/L sodium pyrophosphate, 1 mmol/L EDTA, 1 mmol/L EGTA). We then subjected lysates to ultracentrifugation at 16 400g, separating the samples into cytosolic and membrane-associated proteins, and 200-µg protein aliquots (determined by Bradford assay) underwent the same procedures described for blood serum.
trypsin digestion
Human serum samples were reduced and alkylated before the addition of trypsin as described (22). Briefly, 10 µL of human serum was dried and then resuspended in 25 µL of 25 mmol/L ammonium bicarbonate, 25 µL trifluoroethanol, and 2.5 µL of 200 mmol/L DTT. We incubated the samples for 45 min at 60 °C, added 10 µL of 200 mmol/L IAA, and left the samples at room temperature for 1 h in the dark. We added 2.5 µL DTT to remove excess IAA and again left the samples for 1 h in the dark. Finally, we added 300 µL water to dilute samples and 100 µL ammonium bicarbonate stock solution to adjust pH. Proteolytic digestion was performed using 1 µg/µL [or 1:50 (wt/wt) ratio] proteomics-grade trypsin dissolved in 1 mmol/L HCl and incubated at 37 °C overnight (at least 18 h). Afterward, enzyme activity was quenched by incubation at 95 °C for 10 min and allowed to cool at room temperature.
release of n-glycans from glycoproteins and purification
The N-glycans were enzymatically released from human serum samples as previously described (23). Briefly, we added 5 mU PNGase F to the reaction mixture, which was subsequently incubated overnight (18–22 h) at 37 °C. We removed deglycosylated peptides from the reaction mixture using C18 Sep-Pak® cartridges (Waters) that were preconditioned with ethanol and deionized water. We further purified the aqueous eluent containing released N-glycans using activated charcoal microcolumns (Harvard Apparatus). The columns were conditioned with acetonitrile and equilibrated with 0.1% trifluoroacetic acid (TFA). After trapping of the diluted sample, the microcolumns were washed with 0.1% TFA, and the samples were eluted in a minimum volume of 50% acetonitrile containing 0.1% TFA.
capillary permethylation
The N-glycans derived from human blood serum samples, as described above, were finally dried under vacuum and subsequently permethylated using a capillary approach, which involves the use of fused silica capillary reactor (500 µm i.d.; Polymicro Technologies) packed with sodium hydroxide beads (22). We used a Hamilton 100-µL syringe and a syringe pump from KD Scientific to introduce the sample into the reactor. Dried samples were resuspended in 50 µL DMSO to which 0.3 µL water and 22 µL methyl iodide were added before infusion through reactors. This permethylation procedure minimizes oxidative degradation and "peeling" reactions and prevents the need for excessive clean-up (22). Finally, permethylated N-glycans were extracted with chloroform and washed repeatedly with water.
maldi-tof/tof ms instrumentation
We resuspended dried permethylated samples in 2 µL methanol:water solution (50:50) containing 1 mmol/L sodium acetate and spotted 0.5-µL sample aliquots directly on the MALDI plate, mixed with an equal volume of 2,5-DHB matrix [prepared by suspending 10 mg of DHB in 1 mL water:methanol (50:50)], and dried under vacuum.
Mass spectra were acquired in the positive-ion mode on the Applied Biosystems 4700 Proteomic Analyzer. This instrument is equipped with Nd:YAG (355 nm) laser. Argon was used as a collision gas in the tandem MS measurements, and the collision cell pressure was set to 6.5 x 10–6 torr. The acquired spectra were the average of 1000 laser shots.
data evaluation
We further processed MS data using DataExplorer 4.0 (Applied Biosystems) and an software tool developed in-house (PeakCalc 2.0). We performed PCA using MarkerView (ABI), allowing the visualization of multivariate information. We used supervised PCA methods with a prior knowledge of the sample groups, such as healthy vs diseased. MS data were weighted using the natural logarithm of the peak intensities. The peak intensities were also scaled using the Pareto option, in which the average value is subtracted from each value and the difference divided by the square root of the standard deviation. This option is suited for MS data, since it prevents intense peaks from completely dominating PCA. Variation within groups was expressed as SE (24).
We also used ROC curve analysis AccuROC 2.5 (Accumetric Corporation) to assess the sensitivity and selectivity of the potential diagnostic variables, and data were statistically analyzed using a single-factor ANOVA test. The difference between the 2 groups of data was considered statistically significant when P values were <0.05.
| Results |
|---|
|
|
|---|
We were able to identify and quantify approximately 50 different N-glycan structures, which cover all typical structural types, including high-mannose, hybrid, and complex-type entities. As shown previously (26), the high-energy collision process used in this type of tandem mass spectrometry (MS/MS) yields reliable structural identification of each recorded glycan. MS/MS was employed here to identify the majority of the structures (data not shown). In comparing visually the glycomic profiles of the healthy individuals with those suffering from breast cancer, we were able to note that, within the molecular mass range of 1500– 5000 m/z, there was an overall decrease in abundance of smaller N-glycans (m/z: 1500–2700) compensated by an increase in the abundance of larger structures. Representative profiles are illustrated in Fig. 1
, A and B, with a healthy profile compared to stage I and IV, respectively. These relative ion intensity changes in a profile (small N-glycans changing into larger structures due to different addition of sugar residues) were enhanced from stage I to stage IV. This general observation is consistent with a published report (13) by Rye and Walker as well as more recent studies on glycosylation in tumor tissues (6)(8)(9). Obviously, the trend of adding fucosyl and sialyl residues to certain glycoprotein structures in tumor tissues, observed in these investigations, was also observed here.
|
pca of n-glycan profiles
In assessing more precisely the patterns of N-glycans that could be characteristic for specific cancer conditions, we turned to PCA, which is a chemometric tool designed to reduce the dimensionality in a data set by defining a reduced set of projection axes that allow a maximum amount of the variance information originally in the data set to be retained (27). The acquired profile data, measured for all subjects (healthy and breast cancer patients) were subjected to PCA as described above (Fig. 2
) followed by cluster analysis. The numbers of subjects in the early stages of breast cancer (stage I, 12; stage II, 11; stage III, 9) were lower than for stage IV (50) and individuals free of the disease (27). The final clustering of our data indicates that there are significant differences in the glycomic profiles of healthy individuals in comparison to those of individuals in the early stages of breast cancer. This observation supports the notion of the diagnostic potential for recognizing different stages of breast cancer and perhaps even detecting early onset through glycomic analysis of serum specimens. Data clustering was achieved despite the heterogeneity of the sample population studied here. This aspect is believed to be an advantage, since such heterogeneity reflects a true representation of a breast cancer population. However, the presence of such heterogeneity might also prompt challenging data interpretation.
|
glycan potential biomarker identification through data mining
We performed a further statistical evaluation of putative biomarkers through nonparametric ROC procedure (28)(29)(30). Fig. 3
shows some examples of ROC analysis of N-glycan markers, comparing healthy individuals and stage IV breast cancer patients. Thus, the N-glycan with m/z 3864 had an area under the curve (AUC) value of 0.97, indicating its high diagnostic accuracy (Fig. 3A
), whereas another structure (m/z 2111) had an AUC value of only 0.49, similar to random variation (Fig. 3C
). An AUC value of 0.88 was calculated for the N-glycan with m/z 1835, making it only a moderately accurate test (Fig. 3B
).
|
In Table 1
, we list the AUCs for the N-glycans that were evaluated by ROC analysis and their P values from single-factor ANOVA. We have taken into consideration only the N-glycans for which comparison of nondiseased and diseased experimental groups yielded P values <0.05.
|
A further correlation of the favorable AUC values with P values from ANOVA implicates 8 N-glycans whose relative intensity changes profoundly during breast cancer development (Fig. 4A
). These particular N-glycans thus represent strong biomarker candidates, as the occurrence of their specific structures has been confirmed by 2 independent statistical approaches, both accepted widely in the biomedical literature.
|
Whereas data on glycan modifications in cancer cell lines and tumors, including fucosylation and sialylation, are available (31)(32)(33)(34)(35), it seemed prudent to supplement our investigations on serum glycan levels by the parallel analyses of a human mammary epithelium cell (control) and cancer cell lines (invasive and noninvasive) using the same analytical procedure. Examining the 8 N-glycan structures implicated in the blood serum as pertinent to cancer, we found similar trends for 6 of these structures in the extracts prepared from invasive (MDA-MB-231, MDA-MB-435) and noninvasive (578T, ADR-RES, BT549, T47D) breast cancer cells (Fig. 4B
). The invasive cell lines, capable of forming tumor metastases distant from their original site of transformation, were found to have a more dramatic change in the glycomic pattern compared with the noninvasive cells. Moreover, N-glycans associated with m/z 3951 and 4226 are not observed in normal cell lines, suggesting their potential role during the tumorigenesis process.
structural correlations
Grouping N-glycan profile constituents into categories according to molecular size, number of antennas, and sugar residue abundance (Fig. 5
) can be indicative of certain general trends of disease progression. The overall trend in increased N-glycan size due to the addition of sialic acid residues and enhanced fucosylation is clearly evident. A noticeable fact is that all 8 N-glycans selected by AUC values and ANOVA are sialylated to a different degree (mono-, di-, tri-, and tetrasialylated). Moreover, 5 of these structures are fucosylated (2 of them difucosylated), supporting the general notion of fucosylation involvement during progression of cancer in a different organ (33).
|
| Discussion |
|---|
|
|
|---|
Additional validation for clinically relevant glycomic mapping from serum samples comes from the correspondence of implicated N-glycan structures with those found in cancer cell lines in association with membrane glycoproteins. As shown in Fig. 4
and Fig. 5
, and verified through statistical criteria, a more intense glycosylation is associated with cancerous cells, in agreement with recent observations made elsewhere (6)(8)(9) on glycosylation due to cancer progression. In our measurements, the invasive cell lines capable of forming tumors away from their original site of transformation represented a more extreme change in glycomic pattern compared with the noninvasive cells.
Six N-glycan structures from the cancer cell lines are shared with those implicated as significant in our serum measurements (8 major N-glycans). The corresponding differences in glycomic maps revealed a substantial increase of fucosylation (both within the core component and the branched segments) with malignant transformation. Interestingly, increased fucosylation has also been associated with pancreatic cancer (33), colorectal cancer (40), human leukocyte cancer (34), and renal carcinomas (35). At this point in time, measurements of various sialylated structures in serum present a less clear picture. Since various sialylated oligosaccharides were implicated in different malignant cells as both O- and N-linked structures (7)(8)(9)(31)(32)(33)(34)(35)(40), our future studies will concentrate on these structural types.
In conclusion, the MS-based glycomic profiling of serum-derived constituents described here provides a highly sensitive and informative approach to differential evaluation of cancer conditions. Such an approach can be based on the sound knowledge of aberrant glycobiology of cancerous cells that shed their surface glycoproteins into the surrounding biofluids. Although it is not yet clear which serum glycoproteins are primarily responsible for the diagnostically distinct N-glycans, statistical analyses of selected N-glycans (or their patterns) provide potential for developing diagnostic and prognostic procedures. Whereas MS instrumentation is relatively expensive, the method provides nearly absolute structural information at a moderate level of measurement throughput. PCA, ROC, and ANOVA statistical analyses of the MS data independently confirmed 8 N-glycans (P < 0.001) to be significantly different between disease-free and diseased subjects. Importantly, the glycan profiling analyses require only minute volumes of serum sample and represent essentially a noninvasive methodology for diagnosis.
| Acknowledgments |
|---|
Financial Disclosures: None declared.
Acknowledgments: The authors thank Milan Madera for the development of PeakCalc 2.0.
| Footnotes |
|---|
| References |
|---|
|
|
|---|
The following articles in journals at HighWire Press have cited this article:
![]() |
T. Satomaa, A. Heiskanen, I. Leonardsson, J. Angstrom, A. Olonen, M. Blomqvist, N. Salovuori, C. Haglund, S. Teneberg, J. Natunen, et al. Analysis of the Human Cancer Glycome Identifies a Novel Group of Tumor-Associated N-Acetylglucosamine Glycan Antigens Cancer Res., July 15, 2009; 69(14): 5811 - 5819. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. M Abd Hamid, L. Royle, R. Saldova, C. M Radcliffe, D. J Harvey, S. J Storr, M. Pardo, R. Antrobus, C. J Chapman, N. Zitzmann, et al. A strategy to reveal potential glycan markers from serum glycoproteins associated with breast cancer progression Glycobiology, December 1, 2008; 18(12): 1105 - 1118. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |