|
|
||||||||
Reviews |
Department of Epidemiology, German Centre for Research on Ageing, Bergheimer Strasse 20, D-69115 Heidelberg, Germany.
aAuthor for correspondence. Fax 49-6221-548142; e-mail h.brenner{at}dkfz-heidelberg.de.
| Abstract |
|---|
|
|
|---|
Method: Relevant articles published up to and including May 2005 were identified in the PubMed database. At least 10 cases and 10 controls had to be analyzed for a study to be included in the review. Data concerning the study population, performance characteristics, and the collection and processing of urine samples were extracted from the reviewed articles.
Results: In all, 34 retrospective studies evaluating 21 different markers complied with the inclusion criteria. Most of the studies were rather small and included heterogeneous clinical study populations. Promising results were reported for a few markers in single studies, but they have often not been replicated in subsequent larger studies. Some of the more promising results were obtained with 24-h urines or with specimen-handling procedures that might be difficult to perform under screening conditions.
Conclusions: Larger studies with a prospective design are required to confirm promising findings regarding performance characteristics of some novel markers recently reported in mostly small studies. Future studies should also pay particular attention to the practicality of the markers under screening conditions.
| Introduction |
|---|
|
|
|---|
The purpose of urine-based screening tests for prostate cancer is to find cancer cells from which markers can be extracted or to find released proteins or nucleic acids that are modified compared with the forms in healthy men. Three different groups of markers can thus be considered for the detection of prostate cancer: DNA-, RNA-, and protein-based markers. The main challenge is to find a marker that has good performance characteristics and at the same time allows easy urine collection and processing.
The primary aim of this review was to summarize the current evidence regarding performance characteristics of tests proposed for urine-based prostate cancer detection. Another important aspect was to examine the practicality of these markers under screening conditions.
| Materials and Methods |
|---|
|
|
|---|
Only human studies published in English were considered for the review. We used sensitivity and specificity to describe the performance characteristics of the tests; therefore, we included only studies that contained both prostate cancer cases and controls. To ensure a minimum precision of estimates of sensitivity and specificity, we included only studies with at least 10 cases and 10 controls. We calculated the 95% confidence intervals of those parameters based on the exact binomial distribution.
Among the studies complying with the inclusion criteria, 3 major different groups were distinguished according to the type of marker used: DNA-, RNA-, or protein-based markers. Among the DNA-based markers, a further distinction was made between genetic and epigenetic markers. Among protein-based markers, a further distinction was made between qualitative and quantitative markers. For quantitative markers, the cutoff point used in each study is given in one of the tables in this review. In a few articles, the cutoff point was not explicitly mentioned but could be derived from a figure. Information concerning the study population, performance characteristics, and collection and processing of urine markers was extracted for the review.
The numbers of cases and controls as well as mean or median ages were extracted to describe the study population. If possible, Gleason scores and tumor stage or grade were included. If not otherwise mentioned, cases were mostly confirmed by biopsy, which is the current gold standard for confirmation of prostate cancer. In addition, whenever possible, the criteria used for selection of controls were recorded. It was not clear in all reports whether the controls were confirmed to be free of prostate cancer by biopsy. If this information was reported, it is given one of the tables in this review. In some studies, women were included as controls. Whenever possible, specificity was calculated after exclusion of women from the controls. If this was not possible, the original specificity including women was used for the review and the number of women is given in the table.
Information concerning collection and processing of urine samples was extracted to assess the practicality and suitability of the marker for possible use as a screening tool. For example, a few studies in which urine was collected by urethral washing or milking or by catheter were included in the review, and this is listed in the tables.
Finally, we looked at potential sources of biases, e.g., whether the authors of the report explicitly indicated that those who performed the analyses were blinded to the diagnosis, whether cases and controls were comparable with respect to age and other factors that might affect test performance, and whether urine sampling procedures were similar for cases and controls.
| Results |
|---|
|
|
|---|
Studies evaluating genetic alterations are listed in Table 1
. Two studies investigated loss of heterozygosity (LOH) at defined locations as a tumor marker (6)(7). LOH might be the most common deletion event in prostate cancer (8). Cussenot et al.(6) assessed 4 locations and obtained a sensitivity of 73% and a specificity of 67% for LOH at one or more of the locations. Including 2 additional locations, Thuret et al. (7) obtained a sensitivity of 87% and a specificity of 44%. Urine was collected after prostatic massage in both studies.
|
A genetic biomarker of cellular oxidative stress that might be related to cancer is 8-hydroxydeoxyguanosine. Using a cutoff of 100 µg/g of creatinine, Chiou et al. (9) calculated a sensitivity of 31% and a specificity of 100% for the detection of prostate cancer by use of this marker.
Studies assessing epigenetic alterations are summarized in Table 2
. All 5 studies (10)(11)(12)(13)(14) evaluated the performance characteristics of promoter hypermethylation of the glutathione S-transferase P1 gene as a tumor marker. This DNA alteration appears in >90% of prostatic carcinoma tissues. Sensitivity was between 19% and 76%, and specificity ranged from 56% to 100%. Sensitivity was lowest (19%30%) in the only study in which urine was collected without previous prostatic massage or previous biopsy. Where assessed, no significant association with either Gleason score (12)(13) or tumor stage (10)(11) was found.
|
Studies evaluating RNA-based urine markers are summarized in Table 3
. The highest number of specimens analyzed for a particular marker was for DD3PCA3 after prostate massage, which was analyzed in 3 different studies (15)(16)(17). A noncoding messenger RNA is expressed by the DD3PCA3 gene in epithelial prostate cells and is overexpressed in prostate cancer tissue samples compared with nonmalignant tissue (16). The sensitivity in the 3 studies ranged from 66% to 82% and the specificity from 76% to 89%. Unfortunately, comparison of age between cases and controls was not possible with the data reported in the 3 studies. Furthermore, only samples expressing enough PSA were included in the studies; therefore, probably only persons at higher risk were evaluated in these studies.
|
Cells may escape senescence and proliferate if telomerase is activated (18). Expression of human telomerase reverse transcriptase (hTERT) is critical for telomerase activity. In their study, Crocitto et al. (14) measured hTERT RNA expression by reverse transcription-PCR and obtained a sensitivity and specificity of 36% and 66%, respectively (Table 3
). Using different approaches, Meid et al. (18) and Vicentini et al. (19) measured telomerase activity directly with the telomeric repeat amplification protocol assay and obtained sensitivities of 58% and 90% and specificities of 100% and 87%, respectively (Table 4
). In all 3 studies, prostatic massage was performed. Furthermore, Meid et al. (18) found a significant association between Gleason score and telomerase activity.
|
The performance characteristics of survivin, an inhibitor of apoptosis (20), as a marker for prostate cancer was evaluated in 2 studies (20)(21). Whereas Wang et al. (20) measured mRNA expression (RNA-based approach; Table 3
), Smith et al. (21) used a polyclonal antibody to detect survivin (protein-based approach; Table 4
). Sensitivity was 0% (whereas it was 100% and 80% for bladder cancer) in both studies, and specificity reached 100% (20) and 91%(21), respectively.
An overview of studies evaluating protein-based quantitative markers, including the studies of telomerase (18)(19) and survivin (21) discussed above, is given in Table 4
. Prostatic inhibin-like peptide is involved in the suppression of follicle-stimulating hormone (22). In each of 2 studies, both based on collection of 24-h urines and using a slightly different cutpoint, Teni and coworkers (22)(23) estimated a sensitivity >80% and specificity of 100%; however, these impressive results, obtained in 1988 and 1989, have not subsequently been reproduced.
Stoeber et al. (24) assessed minichromosome maintenance 5 (MCM-5) protein as a potential marker for prostate cancer. Minichromosome maintenance proteins are involved in the initiation of DNA replication and thus play a critical regulatory role (24). The high estimate of sensitivity (92%) in this blinded study was based on rather small numbers of patients, but a relatively precise estimate of 82% based on more than 200 controls was obtained for specificity. Two other markers, bladder tumor fibronectin (25) and basic human arginine amidase(26), have also been evaluated, each in a single study. It was not clear in the report regarding bladder tumor fibronectin (25) whether there was a difference in urine sampling between cases and controls because voided and catheterized samples were collected. Otherwise, there seemed to be no differences in the urine-handling procedures or in age between cases and controls in the 2 studies. The sensitivity and specificity for bladder tumor fibronectin were 43% and 77%, respectively (25), and for basic human arginine amidase were 47% and 100%, respectively (26).
Two studies assessed scatter factor (27)(28), 2 other studies assessed transferrin (or transferrin/creatinine ratio) (29)(30), and 1 study assessed immunoglobulin concentrations (31) as potential biomarkers. All 5 studies reported rather poor performance characteristics as these potential markers showed either a sensitivity (27)(28)(29)(31) or a specificity (30) near 30%. Furthermore, 3 of the studies (27)(28)(30) did not mention the ages of the participants, and 1 study (31) might have used a different sampling for cases and controls. In addition, the females used as controls by Rosen et al. (28) could not be excluded from specificity calculations.
Tissue factor may be expressed by malignant tissue and aid tumor growth (32). Both Lwaleed et al.(32) in 2000 and Adamson et al. (33) in 1993 obtained quite similar results for urinary tissue factor, with sensitivities of 57% (33) and 65%(32) and specificities in both studies around 75%. Furthermore, Lwaleed et al. (32) reported a significant increase in urinary tissue factor concentrations with higher tumor grade. However, both studies (32)(33) included large proportions of rather young persons as controls. Lwaleed et al. (32) also used females as controls.
Three studies evaluated urinary PSA, which may originate from free serum PSA or be produced in the urethral duct (34), as a tumor marker in 24-h urines (34)(35) and 12-h urines (36), respectively. Tremblay et al. (35) obtained a sensitivity of 31% and a specificity of 100%, but the majority of the controls were much younger than the cases. Age differences between cases and controls were small in the studies by Irani and coworkers. (34)(36), which examined the ratio between urinary and serum PSA. The first study showed good performance, with 84% sensitivity and 89% specificity (34), but these results were not confirmed in a later multicenter study with more participants (sensitivity, 42%; specificity, 80%) (36).
Studies evaluating urine tests based on qualitative protein markers are listed in Table 5
. Each of the 4 markers was evaluated in just 1 study. The best performance characteristics of all studies included in this review were reported by Edward et al. (37) in 1982 for prostatic cancer antigen 1, but the study population was rather small and the results have not been reproduced by any subsequent published studies in the last 23 years. In their rather large study, Chopin et al. (38) collected 24-h urines but obtained a very poor sensitivity (18%) for acidic fibroblast growth factor. Much higher sensitivities but somewhat lower specificities were reported in blinded studies by Rogers et al. (39) for
-methylacyl-coenzyme A racemase and by Moses et al. (40) for matrix metalloproteinases. In the latter study, however, there were considerable differences concerning age, sex, and collection of urine samples between cases and controls.
|
| Discussion |
|---|
|
|
|---|
Particularly advantageous in terms of practicality of a screening test would be a stable marker that is not strongly influenced by temperature, so that urine samples could be mailed to laboratories for analysis. In most of the studies, the urine specimens were stored frozen, which might be difficult to achieve in mass screening programs. In nearly all of those studies, it was not clear whether and to what extent the stability of the markers would be affected by intermediate storage of samples at room temperature.
Another important practical aspect is the type of urine sample needed for analysis. For example, although in their first study Irani et al. (34) obtained promising results for urinary PSA with 24-h urine samples, in studies using midstream urine samples, results were disappointing (42)(43). Even in the later multicenter study by Irani et al. (36), in which they used 12-h urine samples, the good results from the previous study (34) were not reproduced. Whether the different procedures used for urine collection were the only cause of the discrepancy remains to be examined. It is evident, however, that a marker that could be measured in more easily collected samples (such as a midstream urine sample compared with a 24-h urine) would be a great advantage for use in mass screening.
In 9 of 13 studies evaluating DNA- or RNA-based markers, prostatic massage or palpation was performed with the idea of increasing the sensitivity. This was not done in any of the studies evaluating protein-based markers. When we compared the performance of glutathione S-transferase P1 in the 2 studies from Goessl and coworkers (10)(11) (with massage) with the performance in the study by Jeronimo et al. (12) (without massage), prostatic massage seemed to be associated with a much higher sensitivity. However, in the recent study by Crocitto et al. (14), both the sensitivity and specificity were poor despite prostatic massage. The impact of prostatic massage has not been evaluated within a single study using otherwise consistent methodology among men with and without prostatic massage; therefore, its impact remains unclear. Clarification of this issue appears to be important because of the possible lack of acceptance by patients and the additional work required by physicians when prostatic massage would need to be performed as part of a screening test. A similar problem of acceptance could occur when urine is collected by urethral milking/washing or by catheter, as was done in 6 studies. In contrast to prostatic massage, there was no indication that the test performance might be increased by these methods.
For DNA- and RNA-based markers, more extensive laboratory processing may often be required than for protein-based markers. Compared with RNA, DNA has the advantage of being mostly more stable in urine; therefore, DNA-based markers might require much less effort for preservation of urine samples. Immunologic assays such as ELISA dominate the analyses of protein markers. Because these tests can be quite inexpensive, a stable and reliable protein-based marker would appear to be particularly suited for mass screening. New mass spectrometric techniques, such as matrix-assisted laser desorption/ionization time of flight (MALDI-TOF) or surface-enhanced laser desorption/ionization-time of flight (SELDI-TOF) mass spectrometry, might open further avenues for protein-based screening for prostate cancer in the future. Rehman et al. (44) used this method to identify proteins in urine samples as markers for prostate cancer. Their results appeared to be potentially promising, but the study had to be excluded from this review because there were only 6 cases and 6 controls.
Apart from the small sample size leading to rather imprecise estimates of performance characteristics of tests in most studies, potential sources of bias also must be taken into account. If possible, evaluation of diagnostic tests should be performed in a blinded fashion. Only 7 studies explicitly reported using a blinded design (13)(15)(19)(24)(36)(39)(40). One of these studies (40) used different urine sampling methods among cases and controls, which may hinder comparability of results. In 3 studies (28)(32)(40), the specificity may have been overestimated by inclusion of female participants as controls because it was not possible to exclude them from the specificity calculations. In addition, bias caused by age differences between cases and controls could not be excluded in 18 studies and should be carefully avoided in future studies.
In 18 studies, at least some information about tumor stage, grade, or Gleason score was provided, but only for 3 studies were the performance characteristics calculated according to stage or Gleason score (10)(19)(31), and only 5 other studies provided some information on the association between marker and tumor stage or Gleason score (11)(12)(13)(18)(32). Therefore, for most markers the sensitivity for detecting early-stage prostate cancer, the main target of potential screening programs, is unknown. Likewise, the ability of tests to distinguish aggressive from slowly growing cancers is essentially unknown. Future studies should aim at differentiating estimates of sensitivity according to stage and grade.
Because of the differences in study populations, collection and handling of urine samples, and laboratory techniques, the comparability of estimates of performance characteristics between studies is limited. For the same reasons, and because few markers were evaluated by more than 1 study, we decided not to use formal metaanalysis techniques to pool results from multiple studies. The fact that some of the most favorable results, such as those reported by Edwards et al. in 1982 (37), have never been replicated also points to potential publication bias, which should be kept in mind when interpreting the summary tables provided in this review.
In summary, development of urinary biomarkers for detection of prostate cancer appears to be in an early phase. On the basis of the 5-phase model of biomarker development for early detection of cancer, proposed by Sullivan Pepe et al. (45), the studies evaluated in this review appear to be phase 1 or 2. Some of the studies reported promising results that should be replicated in larger, prospectively designed studies. Progress in molecular biology might offer new opportunities to identify urine markers with better sensitivity and specificity. Mass spectrometry might be a particularly promising approach for discovering novel markers, but it should be kept in mind that peak height, which is measured by the mass spectrometer, is not linearly related to the abundance of the specific molecule and that the peaks identified by different investigators as indicative of the same disease are different (46). Furthermore, a combination of markers might be useful to improve the performance characteristics. Ideally, future prospectively designed studies should assess a variety of markers in the same, large study population to provide comparable, precise estimates of sensitivity and specificity for each individual marker as well as for various combinations of markers. In addition, future studies should also address practical issues, such as urine sample collection procedures or the need to immediately freeze samples at subzero temperatures, which might be relevant for mass screening. Finally, the costs of marker analysis and the impact of their possible use on the cost-effectiveness of population-based mass screenings also need to be addressed.
|
|
| Acknowledgments |
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
methylacyl coenzyme a racemase protein. J Urol 2004;172:1501-1503.[CrossRef][ISI][Medline]
[Order article via Infotrieve]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |