|
|
||||||||
Endocrinology and Metabolism |
1 ARUP Institute for Clinical and Experimental Pathology, Salt Lake City, UT.
2 Department of Pathology, University of Utah Health Sciences Center, Salt Lake City, UT.
aAddress correspondence to this author at: ARUP Laboratories, 500 Chipeta Way, Salt Lake City, UT 84108. Fax 801-584-5207; e-mail william.roberts{at}aruplab.com.
| Abstract |
|---|
|
|
|---|
0.02 mIU/L. The protocol for determining the functional sensitivity of TSH assays specifies analyses of serum samples with two reagent lots over a 6- to 8-week period. Methods: We determined the functional sensitivities of the Access 2, ADVIA Centaur, ARCHITECT i2000, E170, IMMULITE 2000, and Vitros ECi automated methods, using seven serum pools and two reagent lots for each method.
Results: The observed functional sensitivities were as follows: Access 2, <0.020 mIU/L; ADVIA Centaur, 0.039 mIU/L; ARCHITECT i2000, <0.005 mIU/L; Elecsys E170, 0.011 mIU/L; IMMULITE 2000, 0.014 mIU/L; Vitros ECi, 0.004 mIU/L. However, there were large differences between some method means for the seven serum pools. For the pool with the lowest TSH concentration, mean results were as follows: Access 2, 0.0203 mIU/L; ADVIA Centaur, 0.0085 mIU/L; ARCHITECT i2000, 0.0049 mIU/L; E170, 0.0098 mIU/L; IMMULITE 2000, 0.0077 mIU/L; Vitros ECi, 0.0014 mIU/L. Method-comparison studies using samples with TSH concentrations >0.2 mIU/L also showed method differences. The ARCHITECT i2000 method was the most precise at low TSH concentrations.
Conclusions: TSH methods do not provide comparable results for serum pools with TSH concentrations <0.2 mIU/L or for patient results across the analytic measurement range. Further investigation into the cause of these differences and additional harmonization efforts are required.
| Introduction |
|---|
|
|
|---|
0.02 mIU/L (1). This permits patients with nonthyroidal illnesses to be distinguished from those with primary hyperthyroidism. This is particularly important in patients hospitalized with nonthyroidal illnesses in whom TSH concentrations as low as 0.02 mIU/L may be encountered. Ill hyperthyroid patients should have profoundly low TSH concentrations (<0.02 mIU/L) (1). In addition, high-risk patients with well-differentiated thyroid carcinoma who are receiving thyroxine for suppression therapy should have TSH <0.01 mIU/L (1). It is recommended that the functional sensitivity of the TSH assay be verified independently of the manufacturers claim by use of human serum pools. At least 10 different runs spaced over 68 weeks with at least two different lots of reagents and two different instrument calibrations during the test period should be performed (1). A review of the literature failed to identify any studies conducted in the past 5 years in which the functional sensitivity of multiple TSH methods had been evaluated with these guidelines. We therefore conducted experiments according to these guidelines, using six automated TSH methods that were referred to as "third generation" in the manufacturers package inserts. We also compared the mean TSH results, using the same serum pools that were used to assess imprecision, to evaluate the agreement of these automated methods for low TSH concentrations.
| Materials and Methods |
|---|
|
|
|---|
20%, of 0.010.02 mIU/L. Two different lots of reagents were used for each method. According to the manufacturers information, all assay calibrators are traceable to the WHO Second International Reference Preparation 80/558. Assay imprecision was assessed by use of commercial quality-control materials, Anemia and Lyphocheck Levels 13 (Bio-Rad Laboratories). Each control material was analyzed by each method in duplicate per run. Daily runs were performed 2 days a week over a period of 3 weeks for each lot of reagent for a total of 24 replicates for each control. We estimated the functional sensitivity of each assay by preparing seven human serum pools from samples submitted for routine clinical testing. To prepare the pools, we assayed these samples on an E170 analyzer and combined samples with similar results. Aliquots of these pools were stored at 70 °C until use. Each pool was analyzed as outlined above for the commercial control material.
The overall CV was calculated by use of each individual replicate for each concentration of the control material and serum pool for each method. The functional sensitivity of each method, except for the Access 2 and ARCHITECT i2000 methods, was estimated by fitting a power function to the imprecision data for pooled serum samples and calculating the TSH concentration that gave a CV of 20%. The Access 2 and ARCHITECT i2000 methods did not achieve a CV >20% with any pool that was tested.
A method-comparison study was also conducted with one lot of reagent and using 104 patient samples with TSH concentrations that spanned the analytic measurement range of each assay. All studies using samples from humans were approved by the Institutional Review Board of the University of Utah. PassingBablok analysis of these data was performed with Analyze-itTM software version 1.71 (Analyze-it Software, Ltd.). The standard error of the estimate (Sy|x) and correlation coefficient (r) were obtained from linear regression analysis using Analyze-it software.
| Results |
|---|
|
|
|---|
|
The results of imprecision studies conducted with human serum pools are summarized in Table 2
. The estimated functional sensitivities for each method obtained with the mean pool concentration determined by each method are summarized in the legend to Fig. 1
. It is noteworthy that for the Access 2 and ARCHITECT i2000 assays, the pool with the lowest TSH concentration did not have CVs >20%; therefore, the true functional sensitivities of these assays could not be adequately assessed. The CV of the Access 2 method was 11% at a TSH concentration of 0.0203 mIU/L, and the ARCHITECT produced a CV of 6.9% at a TSH concentration of 0.0049 mIU/L. The ADVIA Centaur method did not meet the definition of a third-generation TSH assay because the TSH concentration at which it had a CV of 20% was 0.039 mIU/L and not <0.02 mIU/L. It is also noteworthy that for the human serum pool with the lowest TSH concentration, the Access 2 method gave a result of 0.02 mIU/L whereas the Vitros ECi gave a result of 0.001 mIU/L.
|
|
To examine the agreement of the methods at low TSH concentrations, we plotted the mean value for each pool vs the pool number (Fig. 2
). The Access 2 method gave the highest mean results for the three pools with the lowest mean TSH concentrations, whereas the Vitros ECi method gave the lowest results for the six pools with the lowest TSH concentrations. The ARCHITECT i2000 method gave the second lowest mean results for the three pools with the lowest mean concentrations, whereas the E170 method gave the second highest results for these pools. When the mean values for each method for pool 1 were compared by use of the t-test, the ADVIA Centaur value was not significantly different from either the E170 or the IMMULITE 2000 mean values (P = 0.28 and 0.70, respectively). All other combinations of methods including the E170 and IMMULITE 2000 were statistically different (P <0.01).
|
Because all methods ranked the TSH concentrations in the seven human serum pools in the same order and the TSH concentration was lowest in pool 1, we also judged imprecision at low TSH concentrations based on the CV calculated for pool 1. In this case, the ARCHITECT i2000 was the most precise, followed by the Access 2, the E170, the IMMULITE 2000, the Vitros ECi, and the ADVIA Centaur in that order.
We assessed the comparability of methods, using patient samples with a wide range of TSH concentrations (Fig. 3
). We used the IMMULITE 2000 as the comparison method because it gave "middle" values for the seven human serum pools and had a functional sensitivity <0.02 mIU/L. The Access 2, ARCHITECT i2000, and IMMULITE 2000 methods agreed well with one another and had slopes within 10% of 1.0, whereas the ADVIA Centaur, E170, and Vitros ECi yielded consistently higher results with slopes of 1.231.33. The scatter around the regression line was noticeably increased for TSH concentrations >25 mIU/L.
|
Inspection of results from the method-comparison study performed with individual patient samples revealed eight samples with TSH concentrations <0.1 mIU/L by the IMMULITE 2000 method. The sample with the lowest TSH concentration had an IMMULITE 2000 result of 0.01 mIU/L. The results obtained with the ARCHITECT i2000, E170, and Vitros ECi were 0.0173, 0.035, and 0.006 mIU/L, respectively. It is noteworthy that the ADVIA Centaur and Access 2 results for this sample were 0.133 and 0.20 mIU/L. The other seven samples in this group generally showed much closer agreement than was seen for the first sample. The method means for the seven samples were 0.091, 0.104, 0.054, 0.084, 0.053, and 0.040 mIU/L for the Access 2, ADVIA Centaur, ARCHITECT i2000, E170, IMMULITE 2000, and Vitros ECi methods, respectively.
| Discussion |
|---|
|
|
|---|
One limitation of our study design was the manner in which samples were incorporated with routine testing. The instrument operator was not blinded to the presence of test pools in the run. The technologist responsible for this project loaded all samples on all analyzers and was the primary operator of the ARCHITECT i2000 analyzer. We doubt that this factor had a significant effect on our results because current analyzers are "black boxes" on which samples are loaded and results are produced automatically. However, we cannot exclude the possibility that functional sensitivity estimates performed in the routine clinical laboratory might actually be higher than our estimates.
There was good agreement of all methods for pool 7, which had a TSH concentration of 0.2 mIU/L. Below this concentration the Vitros ECi gave lower results than the other five methods. At TSH concentrations <0.05 mIU/L, the Access 2 method gave substantially higher concentrations than the other methods. For human serum pool 1, the Vitros ECi measured a mean concentration of 0.0014 mIU/L, and the mean for the Access 2 was 0.0203 mIU/L. The other four methods gave a mean of 0.0077 mIU/L for this pool. The Access 2 result was more than twice the next highest result.
Overrecovery with the Access 2 method has not been reported previously. The lower than expected results for the Vitros ECi have been described previously (6)(7)(8). An earlier study comparing four TSH methods with samples having TSH values <0.8 mIU/L found that only two of the four showed good agreement (9). Possible explanations include differences in calibration, although all assays appeared to have been calibrated according to the same international standard, and differences in the specificities of the anti-TSH monoclonal antibodies used. These authors recommended that an international collaborative study be conducted to clarify reasons for assay differences and to standardize TSH measurements. To our knowledge, such a study has not been performed, and it is still clearly needed. An opinion on the general topic of immunoassay standardization, including the issues of analyte heterogeneity and the lack of reference methods, has been published previously in this journal (10).
When we reviewed the data from the method-comparison study for the nine results that were <0.1 mIU/L on the IMMULITE 2000, most of the results from the Access 2 were comparable to the other methods. However, one of the nine samples gave a result that was 20-fold higher than the IMMULITE 2000 result. It is possible that a few patient samples that gave substantially higher results on the Access 2 method than on the E170 and IMMULITE 2000 methods, which were used to screen samples for pool preparation, could have caused an increase in the Access 2 results compared with the other methods. It would be interesting to prepare patient pools using the Access 2 method to screen samples and repeat the functional sensitivity study with new pools. An alternative approach that might be useful to address the ability of TSH assays to measure low TSH concentrations is to estimate the minimum detectable concentration (11). This approach incorporates both linearity and imprecision considerations and has the potential to overcome some of the limitations of the concept of functional sensitivity that we have highlighted.
The intermethod discrepancies at low TSH concentrations suggest that the clinical usefulness of some TSH assays may be impaired. Diagnosis of hyperthyroidism in hospitalized patients and administration of appropriate thyroxine suppression therapy for thyroid carcinoma require accurate TSH measurements at TSH concentrations of 0.010.02 mIU/L. It is difficult to find published studies on the clinical usefulness of commercial TSH assays for these indications. In light of our results, additional information on the clinical usefulness of current commercial TSH assays is needed.
We estimated reference intervals (Table 3
) for each of the methods, using the slope and intercept from PassingBablok analysis for each method compared with the IMMULITE 2000 and a reference interval of 0.44.0 for the IMMULITE 2000, which was obtained from its package insert. This approach yielded lower reference limits ranging from 0.39 to 0.55 mIU/L and upper reference limits from 4.0 to 5.3 mIU/L. According to each manufacturer, all methods have been standardized to the same reference material. Our results with patient samples call into question how well this standardization process is working for TSH. There seem to be two groups of methods. One group, which includes the Access 2, ARCHITECT i2000, and IMMULITE 2000, agree reasonably well for patient samples containing >0.1 mIU/L TSH. The other group, which includes the ADVIA Centaur, E170, and Vitros ECi, also agree reasonably well, but the two groups differ by 1535%. TSH from the human pituitary has several different glycoforms (12). Differences in how TSH glycoforms are recognized by different immunoassays may contribute to difficulties in assay standardization and lead to discordant results with individual patient samples (13). Additional investigations into the role TSH glycoforms may be useful.
|
In conclusion, additional studies are needed to understand why there are differences among methods for samples with TSH concentrations <0.2 mIU/L. Once these differences are better understood, additional harmonization efforts can be implemented so that TSH methods show better agreement for samples with low TSH concentrations. All of the methods that we examined had a functional sensitivity <0.02 mIU/L except for the ADVIA Centaur. The ARCHITECT i2000 and Vitros ECi demonstrated the lowest functional sensitivities, but the ARCHITECT i2000 was the most precise of the methods evaluated in this study, with a CV of 6.9% at a TSH concentration of 0.0049 mIU/L.
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
The following articles in journals at HighWire Press have cited this article:
![]() |
C. A. Spencer, J. G. Hollowell, M. Kazarosyan, and L. E. Braverman National Health and Nutrition Examination Survey III Thyroid-Stimulating Hormone (TSH)-Thyroperoxidase Antibody Relationships Demonstrate That TSH Upper Reference Limits May Be Skewed by Occult Thyroid Dysfunction J. Clin. Endocrinol. Metab., November 1, 2007; 92(11): 4236 - 4240. [Abstract] [Full Text] [PDF] |
||||
![]() |
G Brabant, P Beck-Peccoz, B Jarzab, P Laurberg, J Orgiazzi, I Szabolcs, A P Weetman, and W M Wiersinga Is there a need to redefine the upper normal limit of TSH? Eur. J. Endocrinol., May 1, 2006; 154(5): 633 - 637. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Zophel, G. Wunderlich, and J. Kotzerke Should We Really Determine a Reference Population for the Definition of Thyroid-Stimulating Hormone Reference Interval? Clin. Chem., February 1, 2006; 52(2): 329 - 330. [Full Text] [PDF] |
||||
![]() |
S. Donadio, A. Pascual, J. H.H. Thijssen, and C. Ronin Feasibility Study of New Calibrators for Thyroid-Stimulating Hormone (TSH) Immunoprocedures Based on Remodeling of Recombinant TSH to Mimic Glycoforms Circulating in Patients with Thyroid Disorders Clin. Chem., February 1, 2006; 52(2): 286 - 297. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Waskiewicz, A. Burkhardt, and K. Emancipator Performance Characteristics of 6 Third-Generation Assays for Thyroid-Stimulating Hormone Clin. Chem., October 1, 2005; 51(10): 1904 - 1905. [Full Text] [PDF] |
||||
![]() |
W. L. Roberts Performance Characteristics of 6 Third-Generation Assays for Thyroid-Stimulating Hormone: Dr. Roberts responds Clin. Chem., October 1, 2005; 51(10): 1905 - 1905. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |