|
|
||||||||
Molecular Diagnostics and Genetics |
1 Institute of Pathology, Medizinische Hochschule Hannover, Hannover, Germany.
2 Department of Disease and Stress Biology, John Innes Centre, Norwich, England.
aAddress correspondence to this author at: Institute of Pathology, Medizinische Hochschule Hannover, Carl-Neuberg-Str. 1, D-30625 Hannover, Germany. Fax 49-511-532-5799; e-mail Lehmann.Ulrich{at}MH-Hannover.de.
| Abstract |
|---|
|
|
|---|
Methods: We used PyrosequencingTM technology to determine the methylation status of 68 CpG sites in the CpG island of the CDKN2B gene [cyclin-dependent kinase inhibitor 2B (p15, inhibits CDK4)], frequently hypermethylated in myeloid malignancies, in a series of bone marrow samples from patients with myelodysplasia and myeloid leukemia (n = 82) and from 32 controls. A total of 7762 individual methylation sites were quantitatively evaluated. Precision and reproducibility of the quantification was evaluated with several overlapping primers.
Results: The use of optimized sequencing primers and the new Pyro Q-CpGTM software enabled precise and reproducible quantification with a single sequencing primer of up to 15 CpG sites distributed over
100 bp. Extensive statistical analyses of the whole CpG island revealed for the first time disease-specific methylation patterns of the CDKN2B gene in myeloid malignancies and small regions of differential methylation with high discriminatory power that enabled differentiation of even low-grade myelodysplastic syndrome samples from the controls, a result that was confirmed in an independent group of 9 control and 36 patient samples.
Conclusion: The precise quantitative methylation mapping of whole CpG islands is now possible with Pyrosequencing software in combination with optimized sequencing primers. This method reveals disease-specific methylation patterns and enables the development of specific diagnostic assays.
| Introduction |
|---|
|
|
|---|
The most commonly used methods are qualitative and quantitative methylation-specific PCR (4), which are very sensitive and easy to use but can analyze only a very limited number of dinucleotide cytosine-guanosine (CpG)1
dinucleotides (
310, depending on the primers/probes used). Furthermore, these methods do not provide precise information about the methylation status of single CpG sites. Bisulfite genomic sequencing, which provides single-CpG resolution over several hundred bps, is the method of choice for comprehensive methylation mapping. This method is not practical in a diagnostic setting, however, because detection and quantification of low methylation levels require analysis of very large numbers of clones, a very tedious and labor-intensive procedure.
Recently, a new real-time DNA sequencing technology called PyrosequencingTM was developed for the analysis of single-base variations. This method employs a sequencing-by-synthesis principle and enables the precise quantification of incorporated nucleotides at polymorphic positions (5)(6). An indirect bioluminometric assay measures quantitatively the amount of pyrophosphate (PPi) that is released from each incorporated deoxynucleotide (dNTP). Through an enzyme cascade, the released PPi is converted into a light signal that is directly proportional to the amount of incorporated dNTP. The measured light signals are sequentially displayed as peaks in a "PyrogramTM". Treatment of the DNA with sodium bisulfite converts the epigenetic difference between methylated and unmethylated cytosine into a single-base variation of the C/T type (5-methylcytosine remains unaltered, whereas cytosine is selectively deaminated to uracil and amplified as thymidine). Therefore, Pyrosequencing is a very suitable tool for methylation analysis, and it is not hampered by the limitations of the techniques mentioned above. Pyrosequencing was shown to be a reliable technique for the precise quantification of methylation at single CpG sites (7)(8)(9)(10) with a very good correlation with other methods (11)(12)(13). If more than 5 variable positions have to be taken into account, however, the current single-base variation software provided by the manufacturer cannot calculate all possible sequence combinations reproducibly and within a reasonable time frame (i.e., less than a few minutes).
The tumor suppressor gene CDKN2B (cyclin-dependent kinase inhibitor 2B (p15, inhibits CDK4)2
is well known to be inactivated by hypermethylation in a wide variety of hematological malignancies (2). The epigenetic inactivation of the CDKN2B gene is one of the most frequent molecular alterations in myelodysplastic syndrome (MDS), a group of clonal stem cell disorders characterized by inefficient hematopoiesis and a considerable risk (up to 30%) of transformation into secondary acute myeloid leukemia (AML) (14). Methylation is heterogeneous in most cases, thus identification of disease-specific methylation pattern is challenging, requiring quantitative analysis with single base pair resolution. We used Pyrosequencing with a set of only 7 sequencing primers to investigate 68 CpG sites (see Fig. 1
) in patients with myelodysplasia and myeloid leukemia and in controls.
|
| Materials and Methods |
|---|
|
|
|---|
dna isolation and sodium bisulfite conversion
Genomic DNA was isolated essentially as previously described (15). DNA samples were treated with sodium bisulfite with the EZ DNA Methylation KitTM (Zymo Research, HiSS Diagnostics) according to the manufacturers instructions and then eluted in 40 µL elution buffer.
generation of the pcr product
PCR products were generated in a 50-µL reaction volume with 400 nmol/L of forward and reverse PCR primers (see Table S1 in the Data Supplement that accompanies the online version of this article at http:www.clinchem.org/content/vol53/issue1), 200 µmol/L of each dNTP, 1.5 mmol/L or 2.5 mmol/L MgCl2 (see Table S1 in the online Data Supplement), 1 xPlatinum-Taq reaction buffer, and 1.25 units PlatinumTaqTM (Invitrogen). PCR conditions were 95 °C for 5 min, followed by 50 cycles with denaturation at 95 °C for 30 s, annealing at 55 °C or 60 °C for 45 s (see Table S1 in the online Data Supplement), and elongation at 72 °C for 30 s, finished with 1 cycle final elongation at 72 °C for 5 min.
pyrosequencing
PCR products (4550 µL) were added to a mix consisting of 3 µL Streptavidin Sepharose HPTM (Amersham Biosciences) and 37 µL binding buffer (Biotage) and mixed at 1200 rpm for 5 min at room temperature.
We used the Vacuum Prep ToolTM (Biotage) to prepare single-stranded PCR products according to the manufacturers instructions. The Sepharose beads with the single-stranded templates attached were released into a PSQ 96 Plate LowTM (Biotage) containing a mix of 45 µL annealing buffer (Biotage) and 500 nmol/L of the corresponding sequencing primer (see Table S1 in the online Data Supplement). Pyrosequencing reactions were performed in a PSQ 96MATM System (Biotage) according to the manufacturers instructions, with the PyroGold SQATM Reagent Kit (Biotage). CpG site quantification was performed with the new methylation Software PyroQ-CpGTM.
Criteria for Pyrogram selection were as follows: sufficient peak height of >15 units (arbitrary units for light emission calculated by the software), symmetric peaks without any irregularities or side-peaks, wide reading length with a high reliability until the end of the sequence, and absence of any significant signals at the positions where a bisulfite treatment control was included or where control nucleotides were dispensed to check for unspecific background signals (Fig. 2
).
|
statistical analysis
We performed statistical analyses with the R software package (version 2.1.0). We used the Wilcoxon rank-sum test for pairwise comparison of medium methylation levels per site within each sample group.
For the identification of differentially methylated regions with highest discriminatory power, we used different window sizes (3 to 8 CpG sites) to compare methylation levels over the whole sequence. The mean methylation levels of the 3 to 8 CpG sites within each window were compared among the different sample groups, and the resulting P values were plotted against the start point of the window. P values <0.05 were considered statistically significant.
| Results |
|---|
|
|
|---|
300 bp (see Table S1 in the online Data Supplement). To analyze every CpG site within these fragments, 7 sequencing primers resulting in high-quality PyrogramsTM were selected (Fig. 2In a first step, to validate the newly developed Pyrosequencing assays using the new Pyro Q-CpGTM software with long reading lengths for multiple methylation sites, we analyzed placental DNA, which is known to be unmethylated for most genes, and DNA from the cell line KG1a, which is known to be nearly fully methylated at the CDKN2B CpG island. As expected, results indicated that placental DNA was completely unmethylated whereas KG1a DNA was nearly completely methylated at all 68 CpG sites; mean 94%, range 74%100% [mean (SD) for all CpG sites: 1.1%, data not shown].
Sequencing primers with slightly staggered binding sites gave nearly identical results for every CpG site analyzed, demonstrating that the quantification is independent of the sequencing primer (compare primers no. 1 and 2 in Fig. 3
.). Several sequencing primer combinations were positioned in such a way that the last CpG sites reached by 1 primer (CpG no. 1315 of primer no. 2 in Fig. 3
, 6090 bp away from the primer binding site) were the first CpG sites covered by the 2nd primer (CpG no. 13 of primer no. 4 in Fig. 3
). All of these measurements gave highly concordant results, demonstrating the reliability of Pyrosequencing for the quantitative evaluation of sequences up to 100 bp in length.
|
analysis of patient samples
To demonstrate the feasibility and power of this approach, we analyzed the very heterogeneous methylation pattern of the CpG islands of the tumor suppressor gene CDKN2B in bone marrow samples from patients with different MDS subtypes (RA, RARS, and RAEB) and from patients with CMML and AML, and in 32 control samples displaying only mild reactive changes. A total of 7762 individual CpG sites were analyzed successfully (for only 2.5% of potential methylation sites, for technical reasons reproducible results were not obtained). The high frequency of aberrant methylation and the marked intra- and interindividual heterogeneity reported in the literature for the CDKN2B gene (16)(17) were clearly confirmed by the quantitative high resolution mapping obtained in this study. For a subset of samples (21 CMML cases), a comparison between direct bisulfite sequencing and Pyrosequencing was possible. Overall, the concordance was quite high (623 of 893 CpG sites showed very similar methylation levels). However, in our hands Pyrosequencing produced much more reliable results in replicates, and direct sequencing showed greater sample-to-sample variation in the overall quality of the data. The discrepancies clustered at certain sites (difficult sequence context) or in certain samples (DNA quality).
A detailed overview displaying all methylation data for each sample is available as an online supplement (Table S2 in the online Data Supplement). Comparison of different measurements of the same patient sample showed a deviation of <5% in methylation levels at each single CpG site. The mean methylation level within each group for every CpG site under investigation is shown in Fig. 4
. The control group showed considerable methylation around CpG no. 25 and beyond CpG no. 50, with a mean (SD) methylation level at all 68 CpG sites of 5% (5%). In the patient samples, mean (SD) methylation levels were 6% (5%), 6% (6%), 11% (8%), 11% (9%), and 21% (13%), for RA, RARS, RAEB, CMML, and AML, respectively. Although the average methylation levels were similar for most subgroups, a clear correlation existed between different risk groups (control, RA+RARS, RAEB, and AML) and average methylation level (Spearmans correlation r2 = 1.00, P <0.01)). More important, the distribution of methylated cytosine residues displayed pronounced regional differences (see Fig. 4
). Classification of all cases according to WHO guidelines (18) gave a very similar picture (see Fig. S1 in the online Data Supplement).
|
identification of small regions with high discriminatory significance
To identify the regions showing the largest differences in methylation, we used the Wilcoxon rank-sum test to systematically compare mean methylation levels of all CpG sites within a small window (38 CpG sites in size) across the whole CpG island between all groups. The obtained P values were plotted against the start point of the window. In a comparison of the control group with low-risk MDS (RA and RARS) (Fig. 5
) regions displaying statistically highly significant differences are clearly discernible. All statistical data are available as an online supplement (Table S3 in the online Data Supplement). Even for comparison of RA and RARS cases, which are sometimes difficult to differentiate on clinical and morphological grounds alone, statistical differences were found (P values <0.02, see Table S3 in the online Data Supplement). Only the comparison of RAEB and CMML did not show significant differences in any region. Very similar regions with high discriminatory significance were also found if all cases were classified according to the new WHO guidelines (see Table S4 in the online Data Supplement).
|
We confirmed these results in an independent patient cohort with 8 RA, 9 RAEB, 8 RARS, and 11 AML cases as well as 9 controls. In this independent series, we analyzed a subregion of CpG no. 17 to no. 41 and found statistically significant differences for all groups for a window from CpG no. 31 to 38 used for pairwise comparison. Only the comparison of RA and RARS did not reach statistical significance (P = 0.07), most probably because of the smaller sample size (see Table S5 in the online Data Supplement for all statistical data). For a hierarchical cluster analysis of all methylation data see Fig. S2 in the online Data Supplement.
| Discussion |
|---|
|
|
|---|
100 nucleotides with one single-sequencing primer. The reliability until the end of the sequencing reaction was confirmed by 1 or 2 additional overlapping sequencing primers. Two alternative tools for the analysis of complex methylation patterns are base-specific cleavage and subsequent matrix-assisted laser desorption ionization time-of-flight mass spectrometry of cleavage fragments (MALDI-TOF-MS; (20)(21) and array-based analysis of DNA methylation patterns (22)(23)(24). MALDI-TOF-MS is a very promising approach that enables high-throughput analysis of a large number of CpG sites in many samples, but its usefulness for analyzing individual CpG sites in regions with very high CpG density is often limited by the absence of suitable cleavage fragments or the generation of identical fragments from different regions. The success of an array-based methylation analysis is heavily dependent on the composition of the sequence of interest. Even with a comprehensive assay design and the application of very stringent filter criteria for the selection of reliable oligonucleotides, Mund et al. (24) and Kimura et al. (23) were unable to accurately differentiate between individual CpG sites in difficult regions.
A systematic comparison of the methylation patterns revealed region-specific differences in the CDKN2B CpG island concerning distribution and level of CpG methylation. The identification of small regions with high discriminatory relevance implies that large areas of the CpG island do not have discriminatory relevance, a finding that demonstrates the necessity for a comprehensive quantitative methylation analysis before high-throughput assays targeting only a few potential methylation sites (e.g., methylation-specific PCR) can be developed and implemented into routine diagnostics. The functional consequences of this heterogeneity in molecular terms (e.g., accessibility for transcription factors) have to be addressed in future studies.
Occasionally very weak methylation signals were found at positions where control nucleotides were dispensed or where a conversion control was implemented. Consequently, we consider methylation values
5% as potential background signals with questionable significance, in concordance with other publications. For example, Shaw et al. (10) regard values of 0%5% methylation as background noise and Jones et al. (25) defined a threshold of detection of 5% for their JAK2 genotyping assay because they found residual wild-type signals of
5% in nearly all of their homozygous cases.
In conclusion, we demonstrate that quantitative long-read Pyrosequencing is a reliable quantitative approach that enables the high-resolution mapping of a whole CpG island in a very efficient semi-highthroughput fashion. Employing this technology, we showed that only a few quite small regions of the CDKN2B CpG island provide relevant information for differential methylation analysis. Our results show, again, the advantage of quantitative compared to a merely qualitative methylation analysis (10)(26)(27) as well as demonstrating that analyzing the methylation of every individual CpG site is the only procedure that allows a detailed characterization of methylation patterns and identification of relevant regions with high discriminatory power, thus forming the basis for the development of clinically useful assays.
| Acknowledgments |
|---|
This study was supported by Deutsche Krebshilfe, grant number 10-1842-Le I and Deutsche Forschungsgemeinschaft, KFO 119/TP2.
| Footnotes |
|---|
2 Human gene: CDKN2B, cyclin-dependent kinase inhibitor 2B (p15, inhibits CDK4). ![]()
| References |
|---|
|
|
|---|
The following articles in journals at HighWire Press have cited this article:
![]() |
T. Ripperger, N. von Neuhoff, K. Kamphues, M. Emura, U. Lehmann, M. Tauscher, M. Schraders, P. Groenen, B. Skawran, C. Rudolph, et al. Promoter methylation of PARG1, a novel candidate tumor suppressor gene in mantle cell lymphomas Haematologica, April 1, 2007; 92(4): 460 - 468. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |