|
|
||||||||
Molecular Diagnostics and Genetics |
1 Biotage AB, Uppsala, Sweden.
2 National Board of Forensic Medicine, Department of Forensic Genetics, Linköping, Sweden.
aAddress correspondence to this author at: Biotage AB, Kungsgatan 76, SE-753 18 Uppsala, Sweden. Fax 46-18-591922; e-mail anders.alderborn{at}telia.com.
| Abstract |
|---|
|
|
|---|
Methods: We developed a method to determine the copy number of the CYP2D6*5 and CYP2D6*2xN alleles by use of PyrosequencingTM technology. A single set of PCR and sequencing primers was used to coamplify and sequence a region in the CYP2D6 gene and the equivalent region in the CYP2D8P pseudogene, and relative quantification between these fragments was performed. The CYP2D8P-specific Pyrosequencing peak heights were used as references for the CYP2D6-specific peak heights.
Results: Analysis of 200 pregenotyped samples showed that this approach reliably resolved 04 genome copies of the CYP2D6 gene. In 15 of these samples, the peak pattern from one analyzed position was unexpected but could be solved by conclusive results from a second position. The method was verified on 270 other samples, of which 267 gave results that corresponded to the expected genotype. One of the samples could not be interpreted. The reproducibility of the method was high.
Conclusions: CYP2D6 gene copy determination by Pyrosequencing is a reliable and rapid alternative to other methods. The use of an internal CYP2D8P control as well as generation of a sequence context ensures a robust method and hence facilitates method validation.
| Introduction |
|---|
|
|
|---|
One of the most important CYP enzymes with respect to drug degradation is encoded by the CYP2D6 gene [GenBank accession no. M33388 (2)], which has been shown to be involved in the degradation of
100 drug compounds (3). Approximately 80 different alleles of the CYP2D6 gene have been identified (4), and their presence varies between ethnic populations. The enzymatic activities in persons carrying these allelic differences vary from total absence to different degrees of ultrarapid metabolism, which causes considerable variability in the response to certain drug treatments.
The CYP2D6 gene is located on chromosome 22, together with the two pseudogenes CYP2D7P [GenBank accession nos. X58467 and X58468 (5)] and CYP2D8P [GenBank accession no. M33387 (2)], which are localized in tandem. Low or eliminated CYP2D6 enzyme activity is usually caused by point mutations, small deletions, or insertions. The CYP2D6*5 allele, however, carries a deletion of the complete gene. This allele is one of the most frequent CYP2D6 alleles, and homozygous carriers totally lack metabolic activity. On the other hand, rapid metabolizers often carry a duplication, or even multiplication (up to 12 copies have been found), of the CYP2D6 gene, most commonly the CYP2D6*2 allele [CYP2D6*2xN (1)(6)(7)]. The increased enzymatic activity in individuals carrying duplications enhances the degradation of drugs, which may cause nontherapeutic drug concentrations at typical dosages. Many antidepressants, for example, are metabolized by CYP2D6, and there is a distinct relationship between the number of CYP2D6 genes and the rate of metabolism of antidepressants (8). The presence of the CYP2D6 duplication shows an interesting northsouth gradient in the European population, with a frequency of
1% in Scandinavia (9) to 710% in Italy (10) and as high as 29% in Ethiopia in northern Africa (11).
Several methods for determination of CYP2D6 gene copy number have been reported. Although considered as relatively laborious and unreliable, the most commonly used method is a long-range PCR spanning the CYP-REP regions flanking the CYP2D6 gene, followed by gel-based size analysis of the PCR products (12). Other methods are based on quantitative amplification of a small PCR fragment from the CYP2D6 gene, in which the amount of PCR product reflects the number of CYP2D6 genes in the analyzed genome (13)(14). The amount of CYP2D6-specific PCR product is assessed in relation to a coamplified region from an unrelated housekeeping gene, which is assumed to consistently be a single-copy gene.
PyrosequencingTM is a nonelectrophoretic, real-time DNA sequencing technology (15)(16). A primer is hybridized to a single-stranded PCR template, and the sequencing analysis is started by addition of nucleotides. The nucleotides are added sequentially, and through coupled enzymatic reactions, the polymerase-catalyzed incorporation of nucleotides can be monitored as light peaks in a PyrogramTM. Pyrosequencing has been used for many applications, such as genotyping of single-nucleotide polymorphisms (17)(18), mutation detection (19)(20), and sequence detection (21)(22). The peak heights in the Pyrogram are directly proportional to the emitted light from the Pyrosequencing reaction. This nature of the light peaks enables quantitative analyses such as assessment of allele frequency in a mixed sample population (23)(24)(25) and degree of DNA methylation (26)(27).
Several assay protocols have been developed for analysis of CYP alleles by Pyrosequencing (28)(29)(30)(31)(32). In this report, we describe an easy single-well assay for determination of CYP2D6 gene copy number, based on relative quantifications from Pyrosequencing reactions using the pseudogene CYP2D8P as reference.
| Materials and Methods |
|---|
|
|
|---|
An additional set of DNA samples from 270 Swedish blood donors were analyzed to verify the assay. The Regional Ethics Committee had given permission for the analysis, and all donors had given their consent. These DNA samples were extracted by the dodecyltrimethylammonium bromide/cetyltrimethylammonium bromide (DTAB/CTAB) method (33) and pregenotyped for the CYP2D6*2, *3, *4, and *6 alleles by a specific Pyrosequencing method (29). Individuals hypothetically homozygous for CYP2D6*1, *2, *3, *4, or *6 were tested for CYP2D6*5 by a multiplex long PCR (7) with modifications (29). Analysis for gene duplication was performed in 26 of the 270 samples with an established long-fragment PCR method (34) with several modifications: Primers were manufactured (Invitrogen AB) according to published primer sequences (34). A 25-µL PCR reaction was carried out with the Expand Long Template PCR System (Roche Molecular Biochemicals). The reaction mixture contained 2.5 µL of 10x PCR buffer 3 (22.5 mM MgCl2), 5 µL of deoxynucleotide triphosphate mixture (2.5 mM each nucleotide), 0.36 µL of enzyme mixture (3.5 U/µL), 0.4 µL of each primer (cyp-17F, cyp-32R, and cyp-207 F; 20 µM), 11.3 µL of water, and 5 µL of genomic DNA (10 ng/µL). The conditions for amplification with the primer pairs cyp-17F/cyp-32R and cyp-207F/cyp-32R were as follows: an initial denaturing step of 94 °C for 2 min, followed by 35 cycles of 94 °C for 15 s, 64 °C for 30 s, and 68 °C for 6 min, with a final elongation step of 68 °C for 7 min. The PCR products were separated and detected in ethidium bromide-containing 0.8% agarose gels. The number of tandem copies of the CYP2D6 gene could not be determined by the long-PCR method used.
assay design
The three genes CYP2D6, CYP2D7P, and CYP2D8P were aligned to find suitable regions for PCR primer positioning (to amplify CYP2D6 and CYP2D8P and to avoid amplification of CYP2D7P). PCR designs in which only the CYP2D6 and the CYP2D8P genes were amplified were further analyzed for optimization of Pyrosequencing specificity. One or several sequencing primers for each PCR fragment were located upstream of regions with nucleotide differences between the CYP2D6 and CYP2D8P genes. In the Pyrosequencing analysis, gene-specific signals were achieved by design of an assay-specific nucleotide addition scheme. The CYP2D6*5/CYP2D6*2xN assay was thus based on relative quantification of Pyrosequencing peak signals between a CYP2D6 gene-specific amplicon and a CYP2D8P amplicon. The Pyrosequencing reactions were designed so that possible unwanted amplification of the highly similar CYP2D7P gene did not interfere with any of the CYP2D6/CYP2D8P-specific peaks.
primer sequences
Primers from three different vendors were successfully tested in the assay. The upstream PCR primer used was A1058FP (5'-GGTGGCTGACCTGTTCTYTG-3'). The sequence includes three internal sites where the common D6/D8 sequence diverges from the D7 sequence. Keeping the cycling annealing temperature above 58 °C excludes amplification of the CYP2D7P gene. The degeneration at the third position from the 3' end of the primer (bolded Y) is attributable to the variation between the CYP2D6 and CYP2D8P sequences (C/T). The downstream PCR primer was biotinylated A1051RPB (5'-GGGCTCACGCTGCACATC-3'). One of the sequencing primers used was A685FP (5'-CGGGATGGTGACCACCTC-3'). The C at the 3' end of the primer is homologous in the D6 and the D8 sequences but differs from the D7 sequence. The other sequencing primer used was A1050FP (5'-GGCCTCCTGCTCATGATCC-3').
pcr conditions
The PCR reagents used were all from the Qiagen HotStarTaq reagent set (cat. no. 203205; Qiagen) except for the deoxynucleotide triphosphates (100 mM solution; cat. no. 27-2035-02; Amersham Biosciences), and autoclaved milliQ water was used for diluting the reagents. In the final PCR mixture, the concentrations were 1.5 mM MgCl2, 2.5 mM each nucleotide, 200 nM each primer, and 0.2 ng/µL template DNA. Addition of Q solution, added according to the Qiagen manual, was necessary for PCR product specificity. The PCR cycling was initiated with 15 min of enzyme activation at 95 °C (according to the Qiagen instructions) followed by 45 cycles at 95 °C for 30 s, 59 °C for 30 s, and 72 °C for 30 s. The reaction was finished with a 10-min final extension at 72 °C. The quality of the PCR products was checked on agarose gels.
PCR instruments from Hybaid (Thermo-Hybaid Model MBS 0.26) and Applied Biosystems (Models 6700 and 9600) generated PCR products successfully used in the subsequent Pyrosequencing analysis. To test the influence of PCR annealing temperature on Pyrosequencing peak heights, measured as relative light units, we performed a temperature gradient PCR on a Mastercycler gradient (Eppendorf) with annealing temperatures ranging from 50 to 65 °C over the entire PCR block.
analysis
The PCR products were prepared for Pyrosequencing analysis by use of a Vacuum Prep Workstation (Biotage AB), and the sequencing reactions were performed on either a PSQ 96MA System or a PSQ 96HS System as described by the manufacturer (Biotage AB). Nucleotide dispensation orders were chosen manually to generate specific peaks for the targeted CYP2D6 and CYP2D8P PCR products and to reveal any false amplification from the CYP2D7P pseudogene. For the sequencing primer A685FP, the dispensation order CGACACTC generated CYP2D6-specific peaks at dispensations 5 (A) and 6 (C), which were compared with the CYP2D8P-specific peaks at dispensations 7 (T) and 8 (C) (see Fig. 2
). For sequencing primer A1050FP, the dispensation order CTACATGCT generated CYP2D6-specific peaks at dispensations 5 (A) and 6 (T), which were compared with the CYP2D8P-specific peak at dispensation 7 (G). The Sequence Analysis software (Biotage AB) was used for measurement of peak heights, and the results were exported to MS Excel for further calculations and analysis. In the A685FP primer assay, the CYP2D6-specific peak at dispensation 6 (C) and the CYP2D8P-specific peak at dispensation 8 (C) were used for the calculations. In the A1050FP primer assay, the CYP2D6-specific peak at dispensation 6 (T) and the CYP2D8P-specific peak at dispensation 7 (G) were used.
|
| Results |
|---|
|
|
|---|
95% homologous to CYP2D6 at the nucleotide level), which qualified this gene as the reference. The relative amounts of amplification products were determined by use of the Pyrosequencing technology to sequence through short regions that differed between the CYP2D6 target gene and the reference gene. The signal peak pattern, presented in a Pyrogram, was thus a product of two sequencing reactions that were performed simultaneously. Three types of peaks occurred in the Pyrogram: peaks specific for CYP2D6 (i.e., free from interference by CYP2D8P); peaks specific for CYP2D8P (i.e., free from interference by CYP2D6); and peaks consisting of a combined light signal from both genes. We determined the number of CYP2D6 genes by calculating the ratio between CYP2D6-specific peaks and peaks that were specific for the CYP2D8P gene. All peaks in the Pyrograms were, however, used to ensure the specificity of the assay.
Several potential target regions in the CYP2D6 and CYP2D8P genes were amplified by PCR and analyzed. The majority of regions investigated were unstable and caused various problems when tested on the DNA sample panel, most probably because of variability within the pseudogene sequence. One region obtained with PCR primers A1058FP and A1051RPB at the end of exon 6, however, was robust for the majority of DNA samples tested (Fig. 1
); the ratio was found to be proportional to the number of CYP2D6 genes in the sample. Typical results of genotyping using the sequencing primers A685FP and A1050FP on this PCR product are shown in Fig. 2
. The designation "nxD6" indicates the number of CYP2D6 genome copies. 0xD6 was a genotype that was homozygous for the CYP2D6*5 allele, i.e., CYP2D6 was deleted on both chromosomes and only the sequence from CYP2D8P was seen. 1xD6 indicates a sample with one CYP2D6 gene copy (heterozygous for CYP2D6*5), and the expected peak ratio was
0.5. 2xD6 indicates a wild-type sample with two copies of CYP2D6, i.e., one on each chromosome and a ratio of 1.
|
pcr annealing temperature
The effect of PCR annealing temperature on the specific peak heights from CYP2D6 and CYP2D8P in a wild-type genome (CYP2D6*1/*2) was tested in a gradient PCR with annealing temperatures in the range 5065 °C. The PCR products were analyzed with the two sequencing primers, and specific peak heights from the CYP2D6 and CYP2D8P genes as well as the ratio between them were plotted as a function of temperature (Fig. 3
). Skewed peak heights indicated differences in PCR efficiency between CYP2D6 and CYP2D8P at all PCR annealing temperatures except for a relatively narrow window of 5860 °C. At annealing temperatures below this window, the CYP2D8P-specific peak heights were higher than those for CYP2D6, whereas the CYP2D6-specific peak heights were higher than those for CYP2D8P at higher temperatures. The results were as expected based on the PCR primer A1058FP sequence, which has a C/T degeneracy between the CYP2D6 and CYP2D8P sequences. An annealing temperature of 59 °C was chosen to obtain a ratio close to 1.
|
pyrosequencing analysis
Peak heights exported from the Pyrosequencing software were further analyzed in Microsoft Excel, where the relative peak heights between CYP2D6- and CYP2D8P-specific peaks were calculated. Panels A1 and A2 in Fig. 4
are scatter plots of the peak-height ratios for the respective sites for 40 selected samples of different genotypes, showing conclusive centering of ratios for both positions around values of 0 (0xD6), 0.5 (1xD6), 1 (2xD6), 1.5 (3xD6), and higher ratios for three samples. According to our analysis, two of these samples carried 4 gene copies, but that has not been possible to verify by any other technique. The third sample carried even more than 4 gene copies, indicated by the even higher peak-height ratio. This is visualized more clearly if the ratios are sorted from minimum to maximum (see panels B1 and B2 in Fig. 4
). Note that samples exhibiting any of the variants described in the section below were excluded from these plots. The peak-height ratios of these samples were not relevant, and the two peak ratio values from such samples would segregate drastically, indicating the variation and the requirement for visual examination of the Pyrograms.
|
reproducibility
To study the reproducibility of the assays, we repeatedly analyzed four samples with known CYP2D6 gene copy numbers (1xD6, 2xD6, 3xD6, and 4xD6) on nine separate occasions (on different days) and plotted the D6/D8P ratio against the expected number of CYP2D6 copies (Fig. 5
). Data points from samples analyzed on a specific occasion (day) are connected by a line, and the linear regression coefficients (R2) were calculated for each such series. For the A685FP assay, the linear regression coefficients varied from 0.9731 to 0.9994, and for A1050FP, they varied between 0.9632 and 0.9979. The lines confirmed a good linearity in the analysis of copy numbers, and the comparison between lines indicated high reproducibility. This analysis also indicated that the assay using sequencing primer A685FP gave a somewhat greater day-to-day variation than the assay using the A1050FP primer.
|
samples generating atypical pyrosequencing data
The CYP2D8P gene deviated from the expected sequence in 13 of the
200 test samples. One relatively common allelic type with a C
T transition at the equivalent of position 2933 in CYP2D6 was found downstream of the A1050FP sequencing primer. This was readily revealed, however, by the unexpected Pyrogram pattern (compare panel A2 in Fig. 6
with panel C2 in Fig. 2
), which was affected by the signals from the variants of the CYP2D8P gene. All of these samples exhibited a typical 2xD6 pattern when analyzed with the A685FP sequencing primer (compare panel A1 in Fig. 6
with panel C1 in Fig. 2
). One 1xD6 sample with a base transition in one of the alleles, rendering an equivalent effect for the A685FP primer, was also found among the 200 test samples (compare panel B1 in Fig. 6
with panel B1 in Fig. 2
). This sample exhibited a typical 1xD6 pattern when analyzed with the A1050FP sequencing primer (compare panel B2 in Fig. 6
with panel B2 in Fig. 2
).
|
Another allelic variant was found in one 3xD6 sample when analysis was performed with the A685FP sequencing primer. This variant was more difficult to interpret. The Pyroseqencing data (Pyrogram) from the A685FP assay exhibited a pattern indicating multiple copies (6xD6) (panel C1 in Fig. 6
). In contrast, the Pyrogram from the A1050FP assay exhibited a typical 3xD6 pattern (panel C2 in Fig. 6
). The only plausible explanation is that the sequencing primer A685FP did not anneal to one of the CYP2D8P alleles in this sample, leading to CYP2D8P-specific peaks being generated from only one of the two CYP2D8P alleles, giving reference peaks one half the height of those obtained from the same sample analyzed with the A1050FP primer.
assay verification
The two sequencing primers A685FP and A1050FP gave interpretable results in 269 of the 270 pregenotyped DNA samples. A few samples had to be reanalyzed because of discrepancies between the A685FP and A1050FP assays. Thirteen samples (4.8%) were identified as carrying one allele with a duplication of CYP2D6 as indicated by the gene copy number, 3xD6 (Table 1
). Long-range PCR analysis verified these findings.
|
The gene copy number 0xD6 was not found in any of the samples from blood donors. Pyrograms corresponding to a gene copy number of 1xD6 were generated by 23 of 24 samples from individuals previously genotyped as heterozygous carriers of the CYP2D6*5 allele. The aberrant sample had a gene copy number of 2xD6, which did not compare with the previously found genotype, CYP2D6*4/*5. The sample was reanalyzed with the same result.
A typical allelic distribution of the CYP2D6 gene (2xD6) was found in 231 samples. In the first set of analyses, three samples generated results that deviated between the two primers. The A685FP primer indicated 3xD6, whereas the A1050FP primer indicated 2xD6 in two of these samples. The third sample generated the opposite aberrant result. After reanalysis, all three samples gave the result 2xD6. This was also verified by the long-range PCR for multiple genes. We could find no explanation for the aberrant results in the first analysis.
One sample pregenotyped as wild-type, CYP2D6*2/*2, was identified as 1xD6 by the Pyrosequencing assay. However, reinvestigation using long-range PCR did not verify the 1xD6 gene copy number, i.e., no gene deletion was observed. Of the samples from 269 blood donors, four 2xD6 samples and one 3xD6 sample were found to carry one allele with the variant CYP2D8P sequence identified with primer A1050FP (see the Pyrogram in Fig. 6
). A typical peak pattern was obtained with the sequencing primer A685FP.
| Discussion |
|---|
|
|
|---|
DNA samples prepared from blood in four different laboratories were analyzed in this study. Interestingly, small differences in the relative peak patterns of the sequence were observed from DNA samples delivered from the different laboratories and purified by different methods (data not shown). Our assumption is that the method for preparation of genomic DNA may influence the first cycles of the PCR reaction, perhaps via remaining material (e.g., histone complexes or regulatory proteins) bound to the DNA after the preparation. This difference in relative peak pattern was more pronounced when we used primer batches from a certain vendor (data not shown). The conclusion is that care must be taken when interpreting variability that may be attributable to low quality of the primers or the DNA preparation used.
A few allelic variations found within the CYP2D8P gene generated atypical Pyrogram patterns. A simple solution for these problems would be to use a pattern-recognition approach for the genotyping analysis. Because stable patterns were generated for the different genotypes as well as for the atypical samples, this assay is suitable for software-based recognition of Pyrosequencing peak patterns. This would facilitate the analysis step considerably because export of peak heights and Excel-based analysis would be unnecessary. A relatively simple prototype software for such purposes was tested with very good results.
Blind tests of DNA from blood donors showed rare and unexplained discrepancies in results generated with the different techniques used. Reanalysis of the samples clarified most of these discrepancies. However, two samples remained divergent. One sample previously typed as CYP2D6*4/*5 gave a gene copy number of 2xD6, which is not usually found for the actual genotype. A possible explanation is that the *4 allele in this case contained a gene duplication. This is also supported by the long-PCR analysis for multiple genes, which showed a duplication of one of the alleles. In the other sample, the genotype CYP2D6*2/*2 predicted a wild-type 2xD6 gene copy number, but instead we found a 1xD6 profile. This was the only sample of the 269 blood samples tested for which a discrepancy remained between the found copy number and the genotype in combination with long-PCR methods for the CYP2D6*5 allele and multiple CYP2D6 genes. One sample from the blood donors was excluded because of recurrently unclear results from both Pyrosequencing and long-PCR reactions. Considering the relatively high probability for variation in the CYP2D8P pseudogene, it could be assumed that other allelic variants will appear in larger sample groups. However, the variations identified here all indicate the strength of the assay, which generates sequence background from two positions. It is possible to determine the reason for unexpected (atypical) patterns in one position by having as a key the typical appearance of peaks generated at the other position.
A possible problem with the assay would be total or partial deletion of one of the two CYP2D8P alleles in a particular genome. To our knowledge, a sample carrying such a deletion has not been reported. Such a sample would generate Pyrograms for both positions with CYP2D8P-specific reference peaks that are one half of the expected height relative the CYP2D6-specific peaks. The gene quantification analysis would suggest an erroneous 4xD6 genotype for wild-type (2xD6) samples. The same type of problem would appear if the variation within CYP2D8P was so pronounced that the PCR reaction does not work for one of the CYP2D8P alleles. As described above, such a nonfunctional primer sequence most probably contributed to the aberrant result obtained during sequencing of one of the tested samples. If both CYP2D8P alleles are deleted in a sample, the specific CYP2D8P peaks will be absent from the Pyrogram, indicating the necessity to use other techniques (i.e., long-range PCR) for the analysis of CYP2D6 gene copy number. No such samples were found among those tested.
Except for long-range PCR, all other methods for the determination of CYP2D6 gene copy number that have been reported in the literature use an unrelated gene as reference. To our knowledge, this is the first report describing an analysis using a highly related gene as reference. There are several advantages associated with the use of a related gene and coamplification in a method such as the one described here. Only a single set of primers is used (the same for both target and reference gene), which increases the probability of an accurate determination of gene copy number. The generation of amplicons from unrelated genes may lead to variations in amplification efficiencies between the different genetic regions. The use of a single set of primers also enables more straightforward method validation. Both of these advantages should make the approach described here attractive for pharmacogenetic studies.
| Acknowledgments |
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
The following articles in journals at HighWire Press have cited this article:
![]() |
H.-K. Lee, L. D. Lewis, G. J. Tsongalis, B. C. Schur, P. J. Jannetto, S. H. Wong, and K.-T. J. Yeo Validation of a CYP2D6 Genotyping Panel on the NanoChip Molecular Biology Workstation Clin. Chem., May 1, 2007; 53(5): 823 - 828. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Ledesma and J. A.G. Agundez Identification of Subtypes of CYP2D Gene Rearrangements among Carriers of CYP2D6 Gene Deletion and Duplication Clin. Chem., June 1, 2005; 51(6): 939 - 943. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |