|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Proteomics and Protein Markers |
1 Li Ka Shing Institute of Health Sciences, 2 Department of Medicine and Therapeutics, 3 Centre for Emerging Infectious Diseases, and 4 Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Prince of Wales Hospital, Shatin, Hong Kong SAR.
aAddress correspondence to this author at: Li Ka Shing Institute of Health Sciences, Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Prince of Wales Hospital, Shatin, N.T., Hong Kong SAR. Fax 852-2648-8842; e-mail tcwpoon{at}cuhk.edu.hk.
| Abstract |
|---|
|
|
|---|
Methods: N-glycans from whole serum proteins in 2 µL serum were released by enzymatic digestion, cleaned up by hydrophilic chromatography, and subsequently quantitatively profiled with a linear MALDI-TOF MS system, which was originally designed for quantitative proteomic profiling. Serum N-glycome profiles from 46 patients with chronic hepatitis B infection and with different degrees of liver fibrosis were examined.
Results: The intra- and interassay CVs of peak intensities of the standard N-glycans were <8% and <17%, respectively. When the assay was applied to the analysis of serum N-glycome profiles, 17 peaks were found to be potential biomarkers for detection of liver fibrosis/cirrhosis. Linear regression analysis revealed that 4 peaks of 1341.5, 1829.7, 1933.3, and 2130.3 m/z (all P <0.005) had complementary value in detecting liver fibrosis and included them, but not any serological markers, in the diagnostic model. Leave-one-out cross-validation showed the diagnostic model could identify significant fibrosis (Ishak score
3) and cirrhosis (Ishak score
5), both at 85% accuracy.
Conclusion: This is the first study to illustrate the quantitative aspect of MALDI-TOF MS in N-glycome profiling and the first study to reveal the potential value of the serum N-glycan profile for identifying liver fibrosis.
| Introduction |
|---|
|
|
|---|
Glycomics studies at the whole-tissue level provide a general overview on the glycome, the total glycosylation pattern of glycoproteins, lipids, or other types of biomolecules. Some laboratories have attempted to release all glycans attached to whole serum proteins (the serum glycans/glycome) and compare the profiles of disease and control cases (11)(12)(13)(14). Potential diagnostic serum glycan markers have been identified for liver cirrhosis (11)(12), ovarian cancer (13), and breast cancer (14).
Capillary electrophoresis has been applied to profile the serum N-glycome (11) and can quantify carbohydrate molecules with high reproducibility at a CV <5% (15). Compared to mass spectrometry (MS), however, capillary electrophoresis has relatively lower throughput. Recently, Fourier transform ion cyclotron resonance MS has been used (13)(14), but information about its quantification performance has not been available. The MALDI-TOF mass spectrometer, a relatively cheaper instrument, has been used for characterization of carbohydrate structure for many years (12)(16). In the past 5 years, MALDI-TOF MSbased instruments, such as SELDI ProteinChip system (17)(18) and the ClinProt system (19), have been used for quantitative profiling of the serum proteome.
The feasibility of using MALDI-TOF MS for quantitative profiling of the glycome remained uncertain. In this study, we attempted to establish a high-throughput quantitative assay for profiling the serum N-glycome by using the MALDI-TOF MS function and spectrum analysis programs of the SELDI ProteinChip system. We also used the new assay to search for serum N-glycans, the concentrations of which were altered in patients with liver fibrosis, to serve as an example of the clinical potential of the assay.
| Materials and Methods |
|---|
|
|
|---|
-fetoprotein (AFP)] were retrieved from the database of our previous study (18). Ten patients had minimal fibrosis (Ishak score = 1), 9 had mild fibrosis (Ishak score = 2), 10 had moderate fibrosis (Ishak score = 3 or 4), 8 had severe fibrosis and incomplete cirrhosis (Ishak score = 5), and 9 had probable/definite cirrhosis (Ishak score = 6).
design of an assay for quantitative profiling of serum n-glycome
N-glycans from whole serum proteins in 2 µL serum were released by enzymatic digestion, cleaned up by hydrophilic chromatography, and then quantitatively profiled with a linear MALDI-TOF MS system, which was originally designed for quantitative proteomic profiling. The mass spectra were analyzed with the existing computer programs designed for quantitative proteomic profiling. Fig. 1
(see the Data Supplement that accompanies the online version of this article at http://www.clinchem.org/content/vol53/issue7 ) is the schematic diagram showing the design of the assay.
|
detachment of n-glycans from whole serum proteins
We denatured 2 µL of serum at 56 °C in 20 µL of 12.5 mmol/L sodium phosphate buffer (pH 8.3) containing 2.5 mL/L ß-mercaptoethanol (Sigma-Aldrich) and 1.25 g/L sodium dodecyl sulfate (Sigma-Aldrich) for 1 h and then added 5 µL of 100 mL/L Triton X-100 (Sigma-Aldrich) to neutralize the denaturing effect of the sodium dodecyl sulfate. We then added 25 µL of 50 mmol/L sodium phosphate buffer (pH 8.3) containing 0.5 U of peptide:N-glycosidase F (PNGase F; EC 3.5.1.52; Sigma-Aldrich). The digestion mixture was incubated at 37 °C for 24 h.
microscale purification of serum n-glycome by hydrophilic chromatography
A method originally developed for purification of glycopeptides (20) was modified in this study to purify N-glycans. We equilibrated 10 mg of preswollen Sepharose 4B gel (GE Healthcare Life Sciences) twice with 0.5 mL of binding solution (butanol:ethanol:7 mmol/L manganese chloride solution = 5:1:1, vol/vol/vol). We diluted 8 µL of the enzymatic digest with 200 µL of binding solution, mixed this with the gel in a 0.5-mL microcentrifuge tube, and incubated the mixture on a blood-tube rotator at room temperature for 30 min. The whole content was then transferred to a Spin-X column (Corning) and washed 4 times with 0.5 mL of washing solution (butanol:ethanol:water = 5:1:1, vol/vol/vol). Finally the N-glycans were eluted with 200 µL of 100 mL/L ethanol, followed by centrifugal vacuum drying. The N-glycans were redissolved with 5 µL of 100 mL/L ethanol as the purified N-glycan preparation and stored at 30 °C before MS analysis.
maldi-tof msbased n-glycome profiling assay
N-glycan samples and matrix chemical were added on an 8-spot gold ProteinChip array (Ciphergen Biosystems) by undertaking a sandwich spotting approach. Super-2,5-dihydroxybenzoic acid (DHB) in the presence of NaCl was used as the matrix chemical. A stock solution of super-DHB (1.35 g/L DHB, 0.15 g/L 2-hydroxy-5-methoxybenzoic acid, 10 mmol/L NaCl in 100 mL/L ethanol) was prepared, stored at 30 °C, and 6.7-fold diluted with MilliQ water on the day of MS analysis. First, 0.5 µL of super-DHB was spotted and vacuum-dried. Then 0.5 µL of the purified N-glycan preparation was spotted and vacuum-dried, followed by sequential addition and vacuum drying of 0.5 µL of 500 mL/L acetonitrile and 0.5 µL of super-DHB. Finally, 0.5 µL of 500 mL/L ethanol was spotted over the spotting area and vacuum-dried to achieve a homogeneous layer of analyte-matrix cocrystals. The gold ProteinChip arrays were read over the 9004000 m/z interval on the ProteinChip PBS II reader of a ProteinChip Biomarker System. The ProteinChip reader was equipped with a nitrogen laser operating at 337 nm wavelength and at 5 microJoule pulse. The mass spectra were acquired at a positive ion mode and an optimized interval of 12003000 m/z. Mass spectrum of each glycan sample was a mean of 610 mass spectra obtained over the sample spot at 61 acquisition positions. The mass spectrum of each sample was smoothed, baseline-subtracted, externally calibrated with the standard N-glycans (see Table 1 in the online Data Supplement), and finally internally calibrated with peaks at 1626.5, 2269.0, and 2947.6 m/z. The glycan peaks with signal-to-noise ratio >3 among the mass spectra were identified and quantified by use of Biomarker Wizard software (Ciphergen Biosystems). The peak intensities were normalized with the total ion current and subsequently with the total peak intensity. On the mass spectra, each nonsialylated glycan appeared as a single peak with an m/z value equivalent to the average molecular mass plus 1.0 mass unit (mass of a proton). For the sialylated glycans, sodium salts were formed and detected as a group of consecutive peaks with a step-wise increment of 22 m/z (see Fig. 2 in the online Data Supplement). For a sialylated glycan with n sialic acid residues, n + 1 consecutive peaks were detected. The normalized peak intensities of all consecutive peaks were summed to obtain the normalized peak intensity for the sodium-free parent glycan. Each sample was analyzed in duplicate. Mean values of the duplicates were used in subsequent data mining.
evaluation of quantitative performance of the maldi-tof msbased n-glycome profiling assay
Four standard N-glycans (CalBiochem) were used during evaluation. Structural information of the standard N-glycans is listed in Table 1 in the online Data Supplement. When the reproducibility of the normalized peak intensities was examined, 3 (M3N2, MAN8 and A2) of the standard N-glycans was fixed at 10 pmol. Intraassay CV was determined from 32 identical sample-matrix spots in one single day by use of the same preparations of matrix chemical and reagents. For the interassay CV, the standard glycan mixture was measured in duplicate for 12 consecutive days by using different batches of matrix chemical and reagents, which were prepared freshly on the day of experiment. Linearity relationships between the quantity and normalized peak intensity of each of 4 standard N-glycans were evaluated by varying amounts from 5 to 20 pmol while fixing the amount of the other 3 N-glycans at 10 pmol.
identification of potential diagnostic glycans for liver fibrosis/cirrhosis
As in our previous study (18), we used 2 criteria to identify the glycans associated only with disease, (a) the normalized peak intensities must be significantly higher/lower in patients with typical fibrosis/cirrhosis than in individuals with minimal fibrosis, and (b) the normalized peak intensities must correlate with the degree of fibrosis. The significance analysis of microarray algorithm (Stanford University) (17)(18)(21) was used to identify the glycans that were significantly higher/lower in patients with fibrosis/cirrhosis by comparing the N-glycome profiles of the cases with minimal fibrosis (Ishak score = 1) with those for cases with typical fibrosis/cirrhosis (Ishak score >3) at a median false discovery rate of 2.5%. In the significance analysis of microarray analysis, "2-classed, unpaired data" were selected as the data type, and 5000 permutations were performed. Correlations between the degree of fibrosis (Ishak scores) and the peak intensities of the significant glycans were analyzed by the Spearman rank order correlation test.
construction of glycan-based diagnostic model
Log2 values of the age, potential diagnostic glycans, and serological markers [total protein, albumin concentration, international normalized ratio, activated partial thromboplastin time (APTT), alanine transaminase, total bilirubin, ALP, AFP, hemoglobin, leukocyte count, and platelet count] were included as independent variables, and Ishak score was set as the dependent output variable. Forward stepwise linear regression analysis was performed using SPSS (SPSS Inc.) to select the variables with independent prediction values for constructing a diagnostic model to calculate the Fibro-Glyco index. Leave-one-out cross-validation was performed to evaluate the performance of the diagnostic model in detecting significant liver fibrosis (Ishak score
3) and cirrhosis (Ishak score
5), as we previously described (18).
statistical analyses
The Spearman rank order correlation test was used to examine correlation relationships between the N-glycan peak intensities and the clinical/biochemical variables. The sensitivity and specificity were calculated according to the standard formulas. ROC curves were constructed for differentiating patients with or without significant liver fibrosis (Ishak score
3) and for differentiating patients with or without liver cirrhosis (Ishak score
5). The likelihood ratios (LRs) were calculated by use of the standard formulas: positive LR (LR+) = sensitivity/(100% specificity); negative LR (LR) = (100% sensitivity)/specificity.
| Results |
|---|
|
|
|---|
0.95 for all 4 standard glycans (see Fig. 4 in the online Data Supplement).
|
serum n-glycome profile and liver fibrosisassociated fingerprint
The representative serum N-glycome mass spectra of serum samples from a healthy individual, a patient with minimal liver fibrosis, and a patient with significant fibrosis are shown in Fig. 1
, AC. Fig. 1D
shows that when heat-inactivated PNGase F was used to release the N-glycans, no significant peaks were detected. This finding indicates that the majority of the peaks in the mass spectra were contributed by the N-glycans released from the whole serum proteins. We observed that 5 freeze-thaw cycles did not significantly affect the serum N-glycome profiles (see Fig. 5 in the online Data Supplement).
We found that 55 glycans were matched among the serum samples. At a median false-discovery rate of 2.5%, we identified 15 N-glycans that were significantly higher and 8 N-glycans that were significantly lower in the patient group of typical fibrosis/cirrhosis. Among these 23 significant N-glycans, 13 were positively correlated with the degree of fibrosis, and 8 were negatively correlated (all P values <0.05; Fig. 2
). Therefore, only 21 glycans were significantly associated with liver fibrosis. ROC curve analyses showed that 17 of these liver fibrosisassociated glycans had potential value in the detection of either liver fibrosis and/or liver cirrhosis (all P values <0.05). The 1829.7 m/z glycan [area under the curve (AUC) = 0.90] was the best marker for detecting significant liver fibrosis, and the 1444.7 m/z glycan (AUC = 0.83) was the best marker for detecting liver cirrhosis. The results for the 17 potential diagnostic glycans are listed in Table 2
.
|
|
correlations between the n-glycan peak intensities and serological markers
Among the 17 potential diagnostic glycans, 5 were correlated with albumin concentration (r = 0.417 to 0.490 and 0.292 to 0.348; all P values <0.05); 5 were correlated with total protein (r = 0.329 to 0.351 and 0.292 to 0.343; all P values <0.05); 1 was positively correlated with APTT (r = 0.326; P = 0.027), and 3 were negatively correlated (r = 0.322 to 0.356; all P values <0.05); 4 were correlated with international normalized ratio (r = 0.304 to 0.403 and 0.380 to 0.450; all P values <0.05), resulting in 10 glycans significantly correlated with liver function. Concerning liver inflammation, only the 2606.9 m/z glycan was negatively correlated with alanine transaminase concentration (r = 0.366; P = 0.012). Concerning liver damage, 1 glycan was positively correlated with total bilirubin (r = 0.319; P = 0.031), whereas 1 glycan was negatively correlated (r = 0.316; P = 0.032). None of the glycans were correlated with ALP. Three glycans were positively correlated with AFP (r = 0.327 to 0.431; all P values <0.05). As a result, 76% (13 of 17) of the liver-fibrosisassociated glycans potentially reflected the physiological state of the liver. Finally, it is well known that platelet count is negatively correlated with degrees of fibrosis. We found that 8 glycans were correlated with the platelet count (all P values <0.05). Therefore, 82% (14 of 17) of the glycans forming the liver fibrosis fingerprint were significantly correlated with the serologic markers that directly reflect the liver physiology and/or with those that are well known to be altered in patients with liver fibrosis (Table 2
).
linear regression model for detecting liver fibrosis and cirrhosis
Among all the dependent variables (potential diagnostic glycans, serological markers, and age), only N-glycans of 1341.5, 1829.7, 1933.3, or 2130.3 m/z (all P <0.005) were selected as biomarkers with independent diagnostic values and included in the linear regression model for calculating the Fibro-Glyco index. Leave-one-out cross-validation showed that the Fibro-Glyco index significantly correlated with degrees of liver fibrosis (r = 0.784; P = 0.01). Furthermore, the index was useful in the detection of liver fibrosis and cirrhosis. The ROC curve areas for the Fibro-Glyco index in detection of liver fibrosis and in detection of liver cirrhosis both were 0.91 (Fig. 3
). At 84% specificity, the sensitivity for detecting liver fibrosis was 85%, the overall accuracy was 85%, and the LR+ and LR were 5.4 and 0.18, respectively. At 83% specificity, the sensitivity for detecting liver cirrhosis was 88%, the overall accuracy was 85%, and the LR+ and LR were 5.1 and 0.14, respectively.
|
| Discussion |
|---|
|
|
|---|
In the present study, using the serum N-glycome profiling assay, we successfully identified a panel of serum N-glycans as potential markers for detecting liver fibrosis and liver cirrhosis; 82% (14 of 17) of the glycans with potential diagnostic values were significantly correlated with the serological markers that directly reflect liver physiology and/or with those that are well known to be altered in patients with liver fibrosis. These findings also serve as indirect evidence indicating that identified diagnostic glycans have clinical meanings associated with the disease being studied.
Callewaert et al. (11) applied the DNA sequencer to profile sialidase-treated N-glycans from the whole serum proteins in patients with and without liver cirrhosis. The present study strongly suggested that the serum N-glycome profile is useful in predicting not only liver cirrhosis but also liver fibrosis. As far as we know, this is the first study demonstrating such clinical importance in humans. Because only 46 patients were examined in this study, we are undertaking a similar study with a much larger sample size to confirm the clinical value of the present diagnostic model. Further studies are also needed to determine whether the same serum N-glycanbased model can be used to detect liver fibrosis/cirrhosis with other underlying causes, such as chronic hepatitis C infection and chronic alcohol abuse.
N-glycans are different from peptides or proteins in terms of sequence-structure analysis. Because human N-linked glycans all contain the same chitobiose core [2 N-acetyl-glucosamines (GlcNAcs) attached to a trimannose] and are synthesized by a well-defined pathway, an accurate mass measurement will generally permit prediction of structure (23)(24). We attempted to predict the structures of the 4 major diagnostic glycans (1341.5, 1829.7, 1933.3, and 2130.3 m/z) from their mean m/z values by searching the Consortium for Functional Glycomics database and using Expasy GlycoMod (23) with an acceptable error tolerance
0.05%. The predicted structures of the 4 glycans were successfully matched to those that had been observed in human (Fig. 4
).
|
With the predicted structures, we found that 3 diagnostic N-glycans (1341.5, 1829.7, and 2130.3 m/z) showing positive correlations with fibrosis stages contain a proximal fucose and/or a bisecting GlcNAc. Because liver cirrhosis is the severe stage of liver fibrosis, our prediction was consistent with the observations that degrees of fucosylation on certain serum glycoproteins (25)(26)(27) were increased in liver cirrhosis. Recently Morelle et al. (12) also observed that certain serum N-glycan species containing a proximal fucose and/or a bisecting GlcNAc were increased in liver cirrhosis. One of these (1829.7 m/z) was found in the present study. Callewaert et al. (11) also found that a biantennary N-glycan with a proximal fucose and a bisecting GlcNAc was increased in liver cirrhosis and was a major diagnostic marker of liver cirrhosis. Because the m/z values allow us only to predict the structures of the glycans, further experiments are needed to confirm the predicted structures, such as tandem MS analysis (12) and glycosidase sequencing (28).
Our serum N-glycome profiling assay has several advantages. The assay can easily be performed by laboratories already equipped with the SELDI ProteinChip system, and only 2 µL of serum is needed for profiling the serum N-glycome. The assay is high-throughput, so a maximum of 96 serum samples can be examined in duplicate within 2 working days. As with the serum proteomic profiling assay, it is not necessary to know the structure of a diagnostic glycan, because a diagnostic N-glycan can be reproducibly recognized and quantified through its unique m/z value. When a MALDI TOF/TOF MS or MALDI Q-TOF MS instrument is used instead of the MALDI TOF MS instrument, the structures of the glycans can be further obtained by tandem MS analysis.
In addition to liver fibrosis, our assay has the potential to be applied to the diagnoses of other diseases, such as CDG. Multiple serum glycoproteins with abnormal glycosylation have been observed in patients with CDG (29)(30). Furthermore, our assay can be modified for other applications, such as quantitative profiling of glycans on a single protein. For example, quantitative profiling of the N-glycans detached from transferrin should be useful in detecting alcohol abuse. Quantitative analysis of specific N-glycans on haptoglobin may be useful in the diagnosis of liver cancer (31).
In conclusion, we have developed a high-throughput assay for quantitative profiling of serum N-glycome using a system originally designed for serum proteomic profiling. Using diagnosis of liver fibrosis as an example, we have demonstrated the potential of applying our assay to biomarker discovery.
| Acknowledgments |
|---|
Financial disclosures: None declared.
| Footnotes |
|---|
-fetoprotein; DHB, 2,5-dihydroxybenzoic acid; LR, likelihood ratio; AUC, area under the curve; GlcNAc, N-acetyl-glucosamine; APTT, activated partial thromboplastin time. | References |
|---|
|
|
|---|
-fetoprotein by sequential glycosidase digestion and lectin affinity electrophoresis. Clin Chim Acta 1996;254:23-40.[CrossRef][Web of Science][Medline]
[Order article via Infotrieve]The following articles in journals at HighWire Press have cited this article:
![]() |
K. D. Yang, W.-C. Chang, H. Chuang, P.-W. Wang, R.-T. Liu, and S.-H. Yeh Increased Complement Factor H with Decreased Factor B Determined by Proteomic Differential Displays as a Biomarker of Tai Chi Chuan Exercise Clin. Chem., January 1, 2010; 56(1): 127 - 131. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Bennett Untargeted Metabolomic Analysis Hits the Target Clin. Chem., December 1, 2007; 53(12): 2037 - 2039. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |