|
|
||||||||
Molecular Diagnostics and Genetics |
1 Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Toronto, ON, Canada.
2 Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada.
3 Discipline of Pathology, Memorial University, St. Johns, NL, Canada.
4 Urology, University Hospital Charité, Humboldt University, Berlin, Germany.
aAddress correspondence to this author at: Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, 600 University Ave., Toronto, ON, Canada M5G 1X5. Fax 416-586-8628; e-mail ediamandis{at}mtsinai.on.ca.
| Abstract |
|---|
|
|
|---|
Methods: Variant-specific reverse transcription-PCRs (RT-PCRs) for KLK1, KLK2, KLK5, and KLK15 were used to identify and clone the full coding sequence of intron III-containing splice variants. In addition, variant-specific RT-PCRs for the cloned KLK3 and KLK4 variants as well as for the "classical" forms of the six genes were used to determine their expression profiles in healthy tissues, their regulation by steroids, and their differential expression in prostate cancer.
Results: KLK1, KLK2, KLK3, KLK4, KLK5, and KLK15 showed a common type of splice variant in which intron III is retained. Expression profiling of these splice variants revealed expression profiles similar to those of the classical mRNA forms, although the pattern of hormonal regulation was different. The KLK15 splice variant was up-regulated in 8 of 12 cancerous prostate tissues. All encoded variant proteins were predicted to be truncated and catalytically inactive because of a lack of the serine residue of the catalytic triad.
Conclusions: The first six centromeric members of the KLK gene family have splice variants that retain intron III. Some variants show tissue-specific expression. The KLK15 splice variant appears to be a candidate biomarker for prostate cancer.
| Introduction |
|---|
|
|
|---|
The human kallikrein (KLK) 1 genes are a family of 15 serine protease genes that map to chromosome 19q13.4 (11)(12)(13). Kallikreins represent the largest cluster of protease genes of any kind within the human genome. Although the function of many of these proteases is currently unknown, their clinical utilities have been successfully exploited. For example, human kallikrein 3 [prostate-specific antigen (PSA)] is the best biomarker for prostate cancer screening, diagnosis, staging, and monitoring (14). Other members of this family are promising biomarkers for prostate, breast, and ovarian cancer (15)(16)(17)(18). The association of this gene family with other disorders, such as Alzheimer disease (19), and physiologic processes, such as skin desquamation (20), has also been suggested.
Alternative pre-mRNA splicing is a common event among members of the KLK gene family. To date,
70 KLK splice variants have been reported, and each KLK gene possesses at least one variant (13). Splicing events such as exon skipping, extension, truncation, cryptic exons, and intron retention have been observed in both the coding and noncoding regions of the genes. Furthermore, alternative transcriptional start sites and polyadenylation sites have also been reported. Most of these variant transcripts are predicted to encode for truncated proteins lacking one or more residues of the catalytic triad because of frameshifts. An exception is one KLK4 isoform, which retains all of the residues of the catalytic triad but lacks the signal peptide and is thought to act intracellularly (21). Despite the fact that alternative splicing substantially increases the diversity of this locus, most of these putative protein isoforms have not been isolated, with the exception of a few proteins encoded by KLK2 and KLK3 variants (22). The association of some of these splice variants with cancer has been examined (23)(24)(25)(26)(27)(28)(29). Some of them are tissue, developmental stage, stimulus, or disease specific (our unpublished data).
Common patterns of alternative splicing have been observed within several gene families. For example, a study examining the effects of alternative splicing on transcripts encoding membrane proteins revealed that a common splice form leads to the removal of the transmembrane domain of single-pass transmembrane proteins, producing a soluble protein isoform (30). Another study has shown that 50 protein domain types were selectively removed by alternative splicing (31).
In the present study, we examined the frequency of a common type of splicing (intron retention) within the KLK gene family. Intron retention has been reported for several other genes. Recently, a study examining intron retention in a set of 21 106 known human genes revealed that 14.8% retained at least one intron (32). The probability of intron retention increases as intron length decreases because introns <100 bp in length are retained in 95% of cases. Among human kallikrein splice variants reported to date, two of them, Psa-rp2 for the KLK3 gene (GenBank accession no. AJ310938) and KLK4 variant 1 for the KLK4 gene (GenBank accession no. AF148532), retain intron III, which is relatively short (143 and 83 bp, respectively). Because other kallikrein genes, such as KLK1, -2, -5, and -15, also possess a short intron III, we speculated that retention of intron III might be a common splicing event among members of the KLK family.
| Materials and Methods |
|---|
|
|
|---|
matched noncancerous/cancerous tissues from prostate cancer patients
We obtained 12 pairs of matched tissue samples (noncancerous/cancerous) from patients with a median age of 63 years who underwent radical prostatectomy for prostatic adenocarcinoma at the University Hospital Charité, Berlin, Germany. Fresh prostate tissue samples were obtained from the cancerous and noncancerous parts of prostatectomy specimens. Small pieces of tissue were gross-dissected by a pathologist immediately after prostate removal, snap-frozen, and stored in liquid nitrogen until analysis. To ensure that the tissue was either malignant or benign, histologic analysis was performed by the same pathologist, as described previously (33). Only samples that were fully surrounded by malignant tissue were used. Tissue characterized as noncancerous was usually taken from the inner zone of the contralateral lobe.
The samples were collected with informed consent, and the study was approved by the Ethics Committee of the Charité Hospital.
hormonal regulation experiments with cancer cell lines
Cells were cultured to near confluency in RPMI medium (Life Technologies, Inc.) supplemented with glutamine (200 mmol/L), bovine insulin (10 mg/L), fetal bovine serum (100 mL/L), antibiotics, and antimycotics. The cells were then aliquoted into 24-well tissue culture plates and cultured to
50% confluency. Twenty-four hours before the experiments, the culture media were replaced with phenol red-free medium containing 100 mL/L charcoal-stripped fetal bovine serum. For stimulation experiments, the steroid hormones estradiol (estrogen), dihydrotestosterone (androgen), norgestrel (synthetic progestin), aldosterone (mineralocorticoid), and dexamethasone (synthetic glucocorticoid) dissolved in absolute ethanol were added to the culture medium at a final concentration of 108 mol/L. Cells stimulated with ethanol were included as controls. In all cases, the final ethanol concentration was 1 mL/L. The cells were cultured for 24 h and then harvested for mRNA extraction. All experiments were performed in triplicate.
rna extraction
Prostate tissues and other healthy human tissues (esophagus, fallopian tube, hippocampus, ovary, pituitary, and tonsil) were pulverized with a hammer under liquid nitrogen. Total RNA from these tissues, as well as from cell line pellets, was extracted with TRIzol reagent (Life Technologies) and treated with DNase I (Invitrogen) according to the manufacturers instructions. The RNA concentration and purity were determined spectrophotometrically.
reverse transcription
We reverse-transcribed 2 µg of total RNA into first-strand cDNA, using the SuperScriptTM First-Strand Synthesis System for reverse-transcription-PCR (RT-PCR; Invitrogen). The final volume was 20 µL. We diluted 1 µL of the cDNA 100-fold and performed a PCR reaction for the housekeeping gene ß-actin, as described below, to check the quality of the first-strand cDNA synthesis.
pcr
Three PCR reactions were performed for each splice variant. In the first reaction, we used the primers F1 and R1 to simultaneously amplify both the "classical" as well as the splice variant form of interest (Fig. 1
). Using primers sets F2/R2 and F3/R3, we amplified and characterized the whole coding region of each splice variant (Fig. 1
). Furthermore, using the primers identified by asterisks in Table 1 of the Data Supplement that accompanies the online version of this article at http://www.clinchem.org/content/vol51/issue3/, we were able to achieve specific amplification of each splice variant alone (one of the two primers binds within intron III). To examine the tissue expression of each splice variant, we used both splice-variant specific primers (F2/R2 or F3/R3) and the F1/R1 pair of primers in different reactions. For the steroid hormone regulation experiments, we used the F1/R1 pair of primers, whereas for the prostate cancer/noncancer pairs, we used the splice variant-specific primers.
|
Each PCR reaction was carried out in a reaction mixture containing 1 µL of cDNA, 10 mM Tris-HCl (pH 8.3), 50 mM KCl, 1.5 mM MgCl2, 200 µM deoxynucleoside triphosphates, 100 ng of primers, and 2.5 U of Hot Star Taq DNA polymerase (Qiagen Inc.) on a Eppendorf thermocycler. The cycling conditions were 95 °C for 15 min to activate the Taq polymerase followed by 35 or 40 cycles (Table 1 in the online Data Supplement) of 94 °C for 30 s, the annealing temperature (Ta, in °C; see Table 1 in the online Data Supplement) for 30 s, 72 °C for 30 s, and a final extension at 72 °C for 10 min. Equal amounts of PCR products were electrophoresed on 1.5% agarose gels and visualized by ethidium bromide staining.
To verify the identity of the PCR products, we purified them, using gel extraction reagents (Qiagen), and cloned them into the TOPO TA cloning vector (Invitrogen) according to the manufacturers instructions. The inserts were sequenced from both directions by use of vector-specific primers with an automated DNA sequencer.
in silico analysis
Sequence homology searching was performed with the basic local alignment research tool (BLAST), available from the National Center of Biotechnology Information, against the human expressed sequence tag (EST) database (dbEST). Sequences with >95% homology were considered as putative ESTs.
| Results |
|---|
|
|
|---|
|
|
genomic organization
Because all of these splice variants retain intron III, they have four instead of five coding exons (Fig. 2
). A new exon is created by merging coding exons 3 and 4 of the classical form and the intron between them.
We defined the 5' and 3' splice sites [branch point, polypyrimidine (pY) tract, and acceptor site] of the classical form of the corresponding splice variant (Fig. 3
) to explore the reasons and possible mechanism that gives rise to this splicing pattern. Knowing that the optimal donor and acceptor sites are denoted by the sequences AG/gtRagt and (Y)ncag/GT, respectively, we observed that none (except KLK5) of the classical forms had optimal splice sites in either the donor or acceptor sites (Fig. 3
). KLK5 was the only gene that had an optimal donor site. Regarding the possible branch point (YNY YRA Y), again the KLK5 gene was the only one with an optimal branch point. Finally, all of the genes have a pY tract located 18 to 40 bp before the acceptor site (Fig. 3
).
|
Lander et al. (3) showed that shorter introns have a higher GC content in humans. On average, the GC content (60.8%) of intron III of kallikrein genes is higher than all other introns and is the only intron with a higher GC content than its flanking exons (data not shown). Furthermore, the GC content for sequences upstream and downstream from the premature termination codon is 66.8% and 58.2%, respectively (data not shown).
predicted protein sequence
Using the open reading frame finder program (ORF Finder) from the National Center of Biotechnology Information (http://www.ncbi.nlm.nih.gov/gorf/gorf.html), we identified the predicted protein sequence for each splice variant (Table 1
; also see Fig. 1 in the online Data Supplement). Comparison with KLK3-IRIII and KLK4-IRIII revealed the following features: (a) they retain the signal peptide, suggesting that they will be secreted proteins; (b) they have a premature termination codon (PTC), and thus encode truncated proteins (Fig. 2
); (c) they lack the "Ser" residue of the catalytic triad, and thus will not function as serine proteases; and (d) they contain a unique C-terminal sequence encoded from the intronic sequence (Fig. 4 in the online Data Supplement).
in silico analysis
Using BLAST, we identified ESTs that correspond to our experimentally defined splice variants (Table 2
). We found 2 ESTs for KLK1-IRIII, 10 for KLK2-IRIII, and 1 for KLK15-IRIII. No EST was found for KLK5-IRIII. For the previously identified splice variants, we found five ESTs for KLK3-IRIII and one EST for KLK4-IRIII.
|
tissue expression
The tissue expression profile of the six splice variants was elucidated by RT-PCR using total RNA from 36 healthy human tissues. By performing two PCR reactions for each splice variant (as described in the Materials and Methods), we were able to compare the expression of each splice variant with its corresponding classical form. Generally, the concentrations of the splice variant mRNAs were similar to the concentrations of the corresponding classical forms (Table 3
). Thus, KLK1-IRIII was highly expressed in kidney, pancreas, salivary gland, and thyroid. The splice variants KLK2-IRIII, KLK3-IRIII, and KLK4-IRIII were highly expressed in the prostate. The expression of these variants appears to be more prostate specific (particularly for KLK3-IRIII) compared with that of the corresponding classical forms. KLK15-IRIII was more highly expressed in prostate, salivary gland, testis, and thyroid. Finally, KLK5-IRIII appears to have a broader expression pattern than that of the classical form (Fig. 2 in the online Data Supplement). High expression was seen in the fallopian tube, esophagus, and pituitary. We observed no expression of the classical form in these tissues. On the other hand, in the cervix, salivary gland, spinal cord, stomach, and thyroid, in which the classical form was expressed in high concentrations, we observed no expression of the splice variant.
|
hormonal regulation
We examined the hormonal regulation patterns of the splice variants and classical forms of the six kallikrein genes by RT-PCR. Splice variants KLK2-IRIII, KLK3-IRIII, KLK4-IRIII, and KLK15-IRIII followed the same pattern as the corresponding classical forms; i.e., they were regulated mainly by the androgen dihydrotestosterone (DHT) and the androgenic progestin norgestrel. The splice variants KLK4-IRIII, KLK5-IRIII, and KLK15-IRIII were down-regulated by dexamethasone (Fig. 4
).
|
Of interest is the pattern of regulation observed for KLK5-IRIII. Previous studies indicated that the classical form, KLK5, is up-regulated by estradiol and norgestrel in the cell line BT474 (34). In the present study, we observed that in the cell line BT474, KLK5-IRIII was down-regulated by estradiol, whereas KLK5 was up-regulated; in the cell lines PA-1 and HTB12, estradiol up-regulated KLK5-IRIII. In the cell line ES-2, only KLK5-IRIII was expressed and was down-regulated by norgestrel. Down-regulation by norgestrel was observed in the BG-1 cell line and by dexamethasone in the cell line HTB12. On the other hand, DHT up-regulated KLK5-IRIII in the cell line BG-1 (Fig. 4
).
differential expression in cancerous vs noncancerous matched prostate samples
We examined the expression of the six splice variants in 12 pairs of matched noncancerous and cancerous prostate tissues. We observed no differential expression for the splice variants KLK2-IRIII, KLK3-IRIII, and KLK4-IRIII. The KLK1-IRIII splice variant was down-regulated in cancer in four pairs, whereas it was up-regulated in two (data not shown). KLK15-IRIII was up-regulated in 8 of 12 pairs (pairs 15 and 911), whereas we observed no difference for the remaining pairs (Fig. 5
).
|
| Discussion |
|---|
|
|
|---|
Intron retention is a relatively common event. For example, in Drosophila P elements, an intron may be either spliced or retained, depending on the cell type in which it is expressed (35). In mammals, intron retention has been observed in many genes, including human tumor necrosis factor-ß (36), human growth hormone (37), bovine growth hormone (38), and rat
-fibrinogen (39). Dirksen et al. (40) have shown that both suboptimal 5' and 3' splice sites are required for intron retention of bovine growth hormone. In the present study, examination of 5' and 3' splice sites of the third intron of KLK1 through -5 and KLK15 indicated that all of them, except for KLK5, have suboptimal sites. Furthermore, mutational analysis of the pY tract of introns has shown that a minimum of five uninterrupted thymidines are required for strong binding of the heterogeneous ribonucleoprotein C (hnRNP C) and for correct splicing to take place (41). Only KLK1, -2, and -3 have an uninterrupted sequence of five thymidines, whereas KLK5 and -15 seem to have a very weak pY tract. The branch point was also shown to be important for effective splicing (41). Of the genes examined here, only KLK5 has a fully conserved branch point. In conclusion, it seems that a suboptimal 5' splice site (donor site) and 3' splice site (branch point, pY tract, and acceptor site) play a major role in intron III retention of these genes.
The higher GC content of intron III compared with the rest of the introns is in agreement with the results reported by Goodall and Filipowics (42), who demonstrated in plants that a higher GC content might have a lower excision rate, and with the results of a global analysis of the human transcriptome for intron retention (32). High GC content is a characteristic feature of active euchromatin and transcriptional activity. This feature of intron III might indicate that this area of the gene is more susceptible to transcription factor binding and higher transcriptional activity.
Nonsense-mediated mRNA decay (NMD) is a surveillance posttranslational mechanism that controls the quality of the mRNA function by degrading all abnormal mRNA transcripts that contain a PTC (43). The retention of intron III changes the reading frame and creates a PTC. It seems the splice variant transcripts reported here are immune to NMD because high concentrations of cytoplasmic mRNAs were detected. NMD-resistant mRNAs for other genes, such as ß-globin (44), von Willebrand factor (45), cystic fibrosis transmembrane conductance regulator (46), LDL receptor (47), and apolipoprotein B (48), have been reported. It is possible that some genes contain cis-acting sequences that confer resistance to NMD. Such sequences have been found in yeast (49)(50).
Many proteins encoded by intron-retaining splice variants have been reported. These include isoforms of cofactor proteins CD44 and CD46 (51)(52), human growth hormone (53), human gonadotropin-related hormone gene (54), effector cell protease receptor-1 (55), and murine vitamin D receptor (VDR0) (56). Interestingly, in the latter case, the vitamin D receptor isoform (VDR1) was shown to act as a dominant-negative receptor against VDR0 transactivation. The predicted proteins of the kallikrein splice variants described here will be truncated and lack the serine residue of the catalytic triad. These proteins may act in a dominant-negative manner, regulating the function of the corresponding classical forms, or may display an as yet unknown nonprotease function.
Tissue-specific splicing has been reported for 1030% of the human genes (6). We have shown here that the splice variant of the KLK3 gene seems to be expressed exclusively in the prostate. Very high expression in the prostate was also observed for the KLK2 gene splice variant. These data agree with our in silico analysis, according to which the KLK2 and KLK3 splice variants are expressed predominantly in the prostate. The finding of ESTs for these variants in sciatic nerve libraries is interesting because the classical KLK2 and KLK3 genes are known to be expressed almost exclusively in the prostate. The KLK5 splice variant seems to have a pattern of expression different from that of the classical form because in some tissues, either the splice variant or the classical form is expressed. The tissue-specific expression of the KLK2, KLK3, and KLK5 splice variants might be valuable if these genes find application as diagnostic and/or prognostic biomarkers in cancer. The differential tissue expression and hormonal regulation between the splice variants and the corresponding classical forms suggests that different cis- and/or trans-acting elements might regulate their transcription.
The connection between human kallikreins and cancer has been reported in many studies (11). Numerous kallikreins are established or emerging biomarkers for the diagnosis, prognosis, and monitoring of cancer at the mRNA and/or protein level. Splice variants of these genes display cancer-specific expression, or they are differentially expressed in cancer (22)(23)(24)(25)(57). In this study, we found that the splice variant of the KLK15 gene is up-regulated in 8 of 12 prostate cancer tissues compared with the corresponding healthy tissue samples. The classical form of the KLK15 gene has also been shown to be up-regulated at the mRNA level in prostate cancer and was associated with more aggressive forms (58).
A relationship between splice variants that retain introns and human diseases, including malignancies, has been revealed by global analysis of the human transcriptome (32). Among 88 genes that generate putative truncated proteins, there are genes associated with WilliamsBeuren syndrome and BattenSpielmayerVogt disease. Furthermore, several genes are related to the tumorigenic process, including the p19A, tumor necrosis factor receptor, BCL2-like 11, and CDC2-like 10 genes.
Our unpublished data indicate that 3' extension of coding exon 3 in the rest of the genes may produce similar isoforms. For example, we found that KLK6 and KLK13 have a splice variant with 3'-coding exon 3 extension by
100 bp (close to the length of the retained intron III in the genes examined here). The extension also leads to a frameshift and creation of a PTC. The predicted encoded proteins will be truncated and lack the catalytic serine residue. Bioinformatic analysis of ESTs revealed that 3'-coding exon 3 extension or complete retention is universal among all kallikrein genes.
In this study, we showed a common type of alternative splicing, characterized by complete retention of intron III, in 6 (KLK1 through -5 and KLK15) of the 15 human kallikrein genes. Some of these variants may have diagnostic and/or prognostic value because they show tissue-specific expression or differential expression in comparison with the classical form. The up-regulation of the KLK15 splice variant in prostate cancer warrants examination of the encoded protein as a biomarker of prostatic cancer.
| Acknowledgments |
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
A and
B (
') chains of fibrinogen. Cell 1982;31:159-166.[CrossRef][ISI][Medline]
[Order article via Infotrieve]
The following articles in journals at HighWire Press have cited this article:
![]() |
J. L.V. Shaw and E. P. Diamandis Distribution of 15 Human Kallikreins in Tissues and Biological Fluids Clin. Chem., August 1, 2007; 53(8): 1423 - 1432. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Holzscheiter, J. C. Biermann, M. Kotzsch, P. Prezas, J. Farthmann, G. Baretton, T. Luther, V. C.G. Tjan-Heijnen, M. Talieri, M. Schmitt, et al. Quantitative Reverse Transcription-PCR Assay for Detection of mRNA Encoding Full-Length Human Tissue Kallikrein 7: Prognostic Relevance of KLK7 mRNA Expression in Breast Cancer Clin. Chem., June 1, 2006; 52(6): 1070 - 1079. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CON |