|
|
||||||||
Editorials |
1 VA San Diego Healthcare System, and, University of California, San Diego, San Diego, CA 92161
aAuthor for correspondence. E-mail dherold{at}ucsd.edu.
Recent developments in the field of mass spectrometry have provided the accuracy and sensitivity to evaluate very-low-abundance steroids such as testosterone in female and pediatric patients. In this issue of Clinical Chemistry, Taieb et al. (1) present the most comprehensive evaluation of automated testosterone immunoassays to date. They compared 10 commercially available immunoassays with isotope-dilution gas chromatographymass spectrometry (ID-GC/MS) and reached the inescapable conclusion that testosterone immunoassay results for specimens from females are inaccurate. Similar data have been reported for individual testosterone immunoassays previously (2), but Taieb et al. (1) are the first to show that for every commercially available testosterone assay studied, the values are in errorby a factor of 2 on average and in some cases by a factor of almost 5. Are assays that miss target values by 200500% meaningful? Guessing would be more accurate and additionally could provide cheaper and faster testosterone results for femaleswithout even having to draw the patients blood.
By limiting all guesses to a narrow range, e.g., 2.042.44 nmol/L, the results would rarely be off by more than a factor of 3. Using a random number generator, we generated values close to the average female concentration measured by Taieb et al. (they were kind enough to share their data with us as an aid to writing this editorial). A BlandAltman plot for guessed values vs ID-GC/MS values had a mean difference for the 55 female samples of 0 nmol/L with a SD of the differences of 1.2 nmol/L. This SD compares favorably with those presented by Taieb et al. (1) in Table 4. Although not intended to be a statistically rigorous proof that random numbers are better than measuring female testosterone values with immunoassays, guessing appears to be nearly as good as most commercially available immunoassays and clearly superior to some!
Because medical test decisions are not made in a vacuum, a patients appearance and presenting complaints would give the person guessing the serum testosterone concentration important information. Women with rapidly evolving signs and symptoms of viralization will have dramatically increased testosterone [>10.4 nmol/L (300 ng/dL)] (3), whereas women with late-onset 21-hydroxylase deficiency have moderately increased testosterone [
4.2 nmol/L (
120 ng/dL)] (4). Using this information while making an educated guess should give dramatically improved results. This would make educated guessing the better choice with the added benefits of rapid turnaround time and very low cost.
What are the implications of the results of the study by Taieb et al. (1) for epidemiologic research? A recent study by Dorgan et al. (5) designed to address this issue concluded "that although absolute concentrations may differ for some hormones, RIA and mass spectrometry can yield similar estimates of between subject differences in serum concentrations of most steroid sex hormones commonly measured in population studies". The testosterone assay that Dorgan et al. were comparing with MS included an extraction and column purification. Many people believe that liquid-liquid extraction combined with column purification before RIA analysis provides accurate results for testosterone in specimens from females. However, we have previously demonstrated that RIAs that include extraction and column purification steps do not agree well with ID-GC/MS (6). An important limitation in the study by Dorgan et al. (5) is that for female specimens they tested only sample pools (low, mid, and high). Determining how the assay would work on individual patient samples is not possible when pooled samples are used. This is a critical flaw, because clinicians are concerned about the concentration of testosterone in an individual; in contrast, when pooled samples are analyzed, any cross-reacting substances in an individual sample are diluted in the rest of the pool. In Fig. 1 of their report, Taieb et al. (1) show that there is a wide degree of scatter when an extraction chromatography RIA is compared with ID-GC/MS for individual specimens. Although it does appear that extraction chromatography RIA is slightly more accurate than commercially available testosterone immunoassays, until an extraction chromatography RIA has been properly validated, results from epidemiologic studies based on these methodologies are also suspect.
How can assays that are grossly inaccurate gain approval for use in diagnosis and treatment of endocrine abnormalities? Several factors warrant consideration. In the US, the Food and Drug Administration approval process for a new diagnostic assay when there is an existing, approved diagnostic assay consists of demonstrating substantial equivalence to a predicate assay in a premarket notification 510(k) process. For testosterone, one of the predicate devices that is acceptable for demonstrating substantial equivalence is the Chiron ACS-180 testosterone assay. Several years ago, we compared the ACS-180 testosterone assay with ID-MS. The ACS-180 did not provide reliable results for female specimens (2). If the predicate device is not accurate, how can the newly designed assay hope to function properly in a clinical setting? This feature of the 510(k) process is one reason that our profession has made little progress in developing clinically acceptable testosterone immunoassays. From our clinical laboratory perspective, we suggest that predicate devices need to be validated by an independent chemical technique, preferable by a reference (or definitive) method (7)(8), before they are accepted as the standard to establish substantial equivalence. With the current regulatory environment, clinical chemistry is allowed, or perhaps even legislated, to perpetuate substandard levels of performance.
Recently, attention has focused on the need for better reporting of diagnostic accuracy of laboratory tests in peer-reviewed journals (9). Clearly, diagnostic accuracy is different from analytical accuracy, but concepts from the STARD initiative can also be applied to improving the testing and reporting of analytical accuracy. As stated by Bruns, diagnostic accuracy compares the results of one or more tests "with a reference (gold) standard in a group of patients suspected of having the condition of interest" (10). We suggest that tests of analytical accuracy should also include analysis of specimens from diseased individuals. In the case of testosterone, the immunoassays do not work in healthy females and fail miserably when used in potentially diseased females (1)(2)(6).
Laboratory professionals should not be associated with a test where an educated guess would provide an equivalent or better result.
References
The following articles in journals at HighWire Press have cited this article:
![]() |
H. W. Vesper and L. M. Thienpont Traceability in Laboratory Medicine Clin. Chem., June 1, 2009; 55(6): 1067 - 1075. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Thienpont, K. Van Uytfanghe, S. Blincko, C. S. Ramsay, H. Xie, R. C. Doss, B. G. Keevil, L. J. Owen, A. L. Rockwood, M. M. Kushnir, et al. State-of-the-Art of Serum Testosterone Measurement by Isotope Dilution-Liquid Chromatography- Tandem Mass Spectrometry Clin. Chem., August 1, 2008; 54(8): 1290 - 1297. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Z. Stanczyk, J. S. Lee, and R. J. Santen Standardization of Steroid Hormone Assays: Why, How, and When? Cancer Epidemiol. Biomarkers Prev., September 1, 2007; 16(9): 1713 - 1719. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. L. Nielsen, C. Hagen, K. Wraae, K. Brixen, P. H. Petersen, E. Haug, R. Larsen, and M. Andersen Visceral and Subcutaneous Adipose Tissue Assessed by Magnetic Resonance Imaging in Relation to Circulating Androgens, Sex Hormone-Binding Globulin, and Luteinizing Hormone in Young Men J. Clin. Endocrinol. Metab., July 1, 2007; 92(7): 2696 - 2705. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Handelsman The Rationale for Banning Human Chorionic Gonadotropin and Estrogen Blockers in Sport J. Clin. Endocrinol. Metab., May 1, 2006; 91(5): 1646 - 1653. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Kushnir, A. L. Rockwood, W. L. Roberts, E. G. Pattison, A. M. Bunker, R. L. Fitzgerald, and A. W. Meikle Performance Characteristics of a Novel Tandem Mass Spectrometry Assay For Serum Testosterone Clin. Chem., January 1, 2006; 52(1): 120 - 128. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Sikaris, R. I. McLachlan, R. Kazlauskas, D. de Kretser, C. A. Holden, and D. J. Handelsman Reproductive Hormone Reference Intervals for Healthy Fertile Young Men: Evaluation of Automated Platform Assays J. Clin. Endocrinol. Metab., November 1, 2005; 90(11): 5928 - 5936. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Cawood, H. P. Field, C. G. Ford, S. Gillingwater, A. Kicman, D. Cowan, and J. H. Barth Testosterone Measurement by Isotope-Dilution Liquid Chromatography-Tandem Mass Spectrometry: Validation of a Method for Routine Clinical Practice Clin. Chem., August 1, 2005; 51(8): 1472 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. P Ly and D. J Handelsman Empirical estimation of free testosterone from testosterone and sex hormone-binding globulin immunoassays Eur. J. Endocrinol., March 1, 2005; 152(3): 471 - 478. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Bassindale, D. A. Cowan, S. Dale, A. J. Hutt, A. R. Leeds, M. J. Wheeler, and A. T. Kicman Effects of Oral Administration of Androstenedione on Plasma Androgens in Young Women Using Hormonal Contraception J. Clin. Endocrinol. Metab., December 1, 2004; 89(12): 6030 - 6038. [Abstract] [Full Text] [PDF] |
||||
![]() |
C Tomlinson, H Macintyre, C A Dorrian, S F Ahmed, and A M Wallace Testosterone measurements in early infancy Arch. Dis. Child. Fetal Neonatal Ed., November 1, 2004; 89(6): F558 - F559. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Z. Stanczyk Reliability of Extraction/Chromatography RIAs Clin. Chem., April 1, 2004; 50(4): 778 - 778. [Full Text] [PDF] |
||||
![]() |
D. A. Herold and R. L. Fitzgerald Reliability of Extraction/Chromatography RIAs: Response Clin. Chem., April 1, 2004; 50(4): 778 - 778. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |