Clinical Chemistry Link to Randox Laboratories Web Site
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Clinical Chemistry 44: 2340-2346, 1998;
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit an electronic Letter to
the Editor about this paper
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (26)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Stöckl, D.
Right arrow Articles by Thienpont, L. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stöckl, D.
Right arrow Articles by Thienpont, L. M.
Related Collections
Right arrow Laboratory Management
(Clinical Chemistry. 1998;44:2340-2346.)
© 1998 American Association for Clinical Chemistry, Inc.


Laboratory Management

Validity of linear regression in method comparison studies: is it limited by the statistical model or the quality of the analytical input data?

Dietmar Stöckl, Katy Dewitte, and Linda M. Thienponta

a Author for correspondence. Fax 32-9-264 81 98; e-mail Linda.Thienpont{at}rug.ac.be.


   Abstract
Top
Abstract
Introduction
Materials and Methods
Results and Discussion
Summary and Conclusion
References
 
We compared the application of ordinary linear regression, Deming regression, standardized principal component analysis, and Passing–Bablok regression to real-life method comparison studies to investigate whether the statistical model of regression or the analytical input data have more influence on the validity of the regression estimates. We took measurements of serum potassium as an example for comparisons that cover a narrow data range and measurements of serum estradiol-17ß as an example for comparisons that cover a wide data range. We demonstrate that, in practice, it is not the statistical model but the quality of the analytical input data that is crucial for interpretation of method comparison studies. We show the usefulness of ordinary linear regression, in particular, because it gives a better estimate of the standard deviation of the residuals than the other procedures. The latter is important for distinguishing whether the observed spread across the regression line is caused by the analytical imprecision alone or whether sample-related effects also contribute. We further demonstrate the usefulness of linear correlation analysis as a first screening test for the validity of linear regression data. When ordinary linear regression (in combination with correlation analysis) gives poor estimates, we recommend investigating the analytical reason for the poor performance instead of assuming that other linear regression procedures add substantial value to the interpretation of the study. This investigation should address whether (a) the x and y data are linearly related; (b) the total analytical imprecision (sa,tot) is responsible for the poor correlation; (c) sample-related effects are present (standard deviation of the residuals sa,tot); (d) the samples are adequately distributed over the investigated range; and (e) the number of samples used for the comparison is adequate.


   Introduction
Top
Abstract
Introduction
Materials and Methods
Results and Discussion
Summary and Conclusion
References
 
Linear regression, when applied to method comparison data, provides useful information about proportional, constant, and random error via, respectively, the slope, intercept, and standard deviation of the residuals (Sy||x) (1). Linear regression data may be used for calibrating a new method against an established one or validating the utility of a method in relation to analytical quality specifications (1)(2). The classical linear regression model is referred to here as ordinary linear regression (OLR).1 The regression line is calculated by minimizing the squared residuals in the y direction ("least squares"). OLR assumes an error-free x variable and a constant analytical imprecision (sa) of the y variable (also called "homoscedastic" variance), both of which are seldom met in practice. To compensate for variable sa, weighted forms of OLR have been introduced; however, they have not received much attention in practice. For the case in which x is not error-free, alternatives to OLR have been proposed that consider variances in both variables. One of the most important of those alternatives is the so-called Deming regression (DR) (3). In its most simple form, DR assumes equal x and y variances and minimizes the distance of the data points orthogonal to the regression line (3). When the variances in x and y differ considerably (but their ratio is constant), the distance of the data point is minimized at an angle to the regression line that is dependent on the ratio of the two variances (4). Note that the regression procedures that consider variances in x and y are also known under the name principal component analysis (4). When a constant ratio of the variances is assumed, the technique is called standardized principal component analysis (SPCA). In addition to the parametric alternatives, nonparametric versions of linear regression were established in method comparison studies, of which the Passing–Bablok regression (PBR) probably is the most widely used variant (5)(6)(7). PBR is based on the rank principle and assumes a constant ratio of the variances. The procedure is claimed less sensitive to outliers than its parametric counterparts.

Note that, regardless of the model used, the uncertainty in the regression estimates increases when (a) the magnitude of Sy||x relative to the data range is high; (b) the data are not adequately distributed over the investigated range; (c) the number of data points is low; or (d) the relation between the data is not linear. In the last case, the model itself is wrong and should be replaced by a nonlinear one. (Nonlinear procedures are sometimes referred to as second-order linear regression.) For the above reasons, it is recommended that method comparison data always be accompanied by graphical presentations (8)(9).

Because of the availability of different linear regression procedures, the question of which is the most appropriate for method comparison studies has been investigated by several groups, e.g., by Cornbleet and Gochman (10), Wakkers et al. (11), Linnet (12)(13), Payne (14), and Lawton et al. (15). From these investigations, which focused mainly on the error in slope estimation and the consequent statistical unreliability of hypothesis testing, many prefer DR over OLR. Others make use of OLR dependent on the value calculated for the product–moment correlation coefficient (r). For example, Wakkers et al. (11), Westgard and Hunt (1), and the EP9 protocol from the NCCLS (8) restrict the use of OLR to those cases where r >=0.99 (11) or >=0.975 (1)(8). With respect to PBR, its relevance still deserves better demonstration because some put its use as an alternative to OLR into perspective by revealing inadequacies in simulation tests (12), whereas others advocate it over DR (14). In the most extreme, it is argued that linear regression should not be applied at all in connection with method comparison studies (16), but that graphical techniques (bias plots) (9)(17)(18) be used instead.

To elucidate the relevance of these recommendations in practice, we here compare the usefulness of different linear regression variants (OLR, DR, SPCA, and PBR) on the basis of data obtained from real-life studies (19)(20). Our main emphasis was to clarify whether the quality of the analytical input data or the particular regression method used has the greater influence on the validity of the regression data. We discuss, in particular, the importance of correlation analysis in connection with regression analysis and the relevance of Sy||x.

The discussion of whether bias plots or linear regression procedures are more appropriate for interpretation of method comparison studies is beyond the scope of this study. The reader is referred to reports by Stöckl (9), Lawton et al. (16), Bland and Altman (17), and Hyltoft Peterson et al. (18) for this subject. Note also that the data of the method comparison studies here serve only the purpose of evaluating the different regression procedures. The analytical and clinical relevance of the method comparison studies have been described elsewhere (19)(20).


   Materials and Methods
Top
Abstract
Introduction
Materials and Methods
Results and Discussion
Summary and Conclusion
References
 
The data sets used for our investigation came from recently published method comparison studies for serum estradiol-17ß (19) (using 22 samples) and serum potassium (20) (using 60 samples). The estradiol-17ß study used a relatively low number of samples because it investigated the possibility of recalibration of routine methods in terms of a reference method.

For the different regression procedures, OLR was performed with Microsoft EXCEL® or the EVAL KIT (21) software. DR was performed with Microsoft EXCEL [using the respective formulae used by Cornbleet and Gochman (10) and Linnet (12)] or the EP Evaluator (22) software. SPCA and PBR were performed with the EVAL KIT software. Note that the latter calculates the SPCA slope as the geometric mean of the two slopes that result from OLR, performed with y and with x as the independent variable (4). Robust OLR was performed with SYSTAT 7.0® software (23). From the several variants available, the trimming procedure was chosen, with a factor of 0.1 for discriminating outlying residuals.


   Results and Discussion
Top
Abstract
Introduction
Materials and Methods
Results and Discussion
Summary and Conclusion
References
 
overview
Tables 1 and 2 summarize the regression and correlation data for the two method comparison studies. Table 1 contains the data for the serum estradiol-17ß study (19), where measurements by 12 different immunoassays (y variables) were compared with those of a reference method based on isotope dilution gas chromatography–mass spectrometry (x variables). Table 2 contains the data for serum potassium, obtained by comparison of results from 12 laboratories using four different routine methods (y) and a reference method based on ion chromatography (x) (20). The substantial difference between these two cases lies in the investigated concentration ranges. In the case of estradiol-17ß, the concentrations ranged from 0.015 to 3.33 nmol/L (i.e., ~3 decades), whereas in the case of potassium they ranged from 3.56 to 5.42 mmol/L (i.e., 1 decade). The comparisons are ordered in both tables according to increasing values of Sy||x, as calculated from OLR.


View this table:
[in this window]
[in a new window]
 
Table 1. Regression data from method comparison for serum estradiol-17ß measurement: isotope dilution gas chromatography–mass spectrometry reference method (x) and routine immunoassays (y).


View this table:
[in this window]
[in a new window]
 
Table 2. Regression data from method comparison for serum potassium measurement: ion chromatography reference method (x) and routine methods (y).

the Sy||x PROBLEMATIC: CONNECTION BETWEEN Sy||x ANDr
Like the correlation coefficient r, Sy||x indicates the magnitude of total random error of the method comparison, including nonlinearity, drift or shift, total analytical imprecision (sa,tot), and sample-related effects. However, in our experience when several methods are compared with each other at the same time, Sy||x is a better indicator for total random error than r, particularly when the data range is wide, as illustrated in Table 1Up . There, the Sy||x values increase >10-fold, i.e., from 0.017 to 0.192 nmol/L, whereas the corresponding r values decrease only from 1 to 0.98. On the other hand, r is more useful for a general expression of the magnitude of total random error because it is independent of the units of x and y. However, r is dependent on the data range. For wide data ranges (e.g., 3 decades), much greater values of r are needed to describe the same quality of correlation than for small data ranges (e.g., 1 decade). This might explain the difference in what past studies have considered a "good" r value in method comparisons, namely r >=0.975 (8) or r >=0.99 (11).

The mathematical relationship between Sy|x and r can be delineated by the formula of Sy|x = (10). By substituting the slope by|x= , and after simplification (N - 1 {approx} N - 2), r2=1- is obtained. Thus, the greater the ratio Sy|x:data range, the lower the r value.

Note that the higher the ratio Sy||x:data range or the lower the value of r, the greater the uncertainty in the estimates of slope and intercept (this holds true for all regression procedures).

the Sy||x PROBLEMATIC: CONNECTION BETWEEN Sy||x, ANALYTICAL IMPRECISION, AND SAMPLE-RELATED EFFECTS
The importance of calculating and interpreting the value for Sy||x follows the fact that Sy||x is influenced by two effects, namely, by sa,tot (which equals ) and by sample-related effects. Note, however, that Sy||x may be inflated by nonlinearity and drift/shift during the comparison. Because of the latter, we recommend that method comparison studies should be done with particular care for internal quality control (IQC). Thus, in addition to the common interpretation of systematic differences between methods reflected by slope and intercept, we advise using the information content of the regression analysis about random error by comparing the observed value of Sy||x with the one predicted from sa,tot (18). However, only the Sy||x value obtained from OLR can be used for this purpose (8). Naturally, as already discussed, the higher the Sy||x values, the more uncertain the regression estimates for systematic differences between methods will be.

When Sy||x sa,tot, sample-related effects are present. When the values for x come from a hierarchically higher reference method, then it is obvious that the routine method (y) caused the problems. When two routine methods are involved, either or both methods might be affected by sample-related effects.

When sa,tot and Sy||x are of similar magnitude, one should still check whether the correlation coefficient r has a "reasonable" value (e.g., >=0.975 or >=0.99; see the discussion below). When r is considerably lower and sa,tot {cong} Sy||x, method precision is too poor for sufficiently reliable estimation of slope or intercept. In such a case, there is a strong need to minimize the influence of imprecision by performing replicate analyses, checking IQC data (e.g., for drifts or shifts), or measuring all samples within the same analytical run. We want to stress at this point that method comparison studies should be done with particular care for IQC, which means that many more IQC measurements should be performed when carrying out a method comparison study than when using an assay in routine operation.

the Sy||x PROBLEMATIC: DIFFERENT REGRESSION PROCEDURES CALCULATE DIFFERENT VALUES
It is important to notice that the different regression procedures give different values for Sy||x. Note that PBR does not calculate values for Sy||x at all, which we consider a major disadvantage of this regression method. The formula given by Cornbleet and Gochman (10) calculates "true" Sy||x values, in the sense of orthogonal distances of the data pairs from the regression line (those authors used the term Sy·x instead of Sy||x). Generally, they are smaller than those from OLR. However, they become identical to those of OLR in the case of a zero slope, whereas they are smaller by when the slope is 1.

Furthermore, the reader should be aware that commercial software packages also calculate different values for Sy||x. DR, as performed with the program of Rhoads (22), gave values for Sy||x that were nearly identical to those of OLR. We assume that the Rhoads program for DR calculates "usual" Sy||x values; however, they may differ slightly from those of OLR when slope and intercept do not totally agree in both procedures. SPCA, as performed by the software we used (21), calculated values of Sy||x that were smaller by than those of OLR. In this program, true Sy||x values are calculated by assuming an equal imprecision of both methods and, hence, dividing the usual Sy||x value by .

As noticed above, only the Sy||x values, as calculated by OLR, can be used to compare the predicted variance across the regression line with the observed one (8).

interpretation of the regression data for case 1 (method comparisons for serum estradiol-17ß measurement): comparison of olr and dr/spca
The values for slope and intercept according to OLR and DR/SPCA are very similar for methods 1–10 (see Table 1Up ). The biggest difference in the slope (intercept) value is 0.007 (0.006 nmol/L) for method 10. Additionally, the standard errors for slope and intercept differ <1% between OLR and DR. (The EVAL KIT program does not provide standard errors for SPCA.) Interestingly, all respective values for r are >=0.99. These findings correspond with the restrictions implied before by Wakkers et al. (11) to apply OLR to method comparison data only when r >=0.99. On the contrary, for methods 11 and 12, the values for slope and intercept by OLR differ substantially from those by DR/SPCA. For these methods, r values <0.99 are observed. In consequence, in these cases DR/SPCA would be most appropriate (10)(11)(12)(13). However, from the purely statistical point of view, one would prefer the application of DR/SPCA in all cases.

But what is the analytical relevance of these findings? Does the application of, for example, DR in place of OLR add a value to the method comparison for cases 11 and 12? From the analytical point of view, we would doubt this. Interestingly, when recalibration of the routine methods on the basis of their correlation with the reference method was proposed (19), these cases were intuitively excluded because of the poor correlation and the high values of Sy||x. In other words, even when recalibrated for individual samples, those methods would reveal differences from the reference method that were too large. Note also that the uncertainty of the slope (95% confidence level) was ~0.08 for cases 11 and 12, which would introduce a considerable calibration uncertainty when those methods would be recalibrated by use of the method comparison. Therefore, despite its statistical justification, application of DR instead of OLR makes little analytical sense for these cases.

From these first observations, we confirm that OLR is a valid regression procedure when r >=0.99 for method comparison studies that cover a wide data range. This holds true for the estimation of slope and intercept and their respective confidence intervals. Consequently, under the restriction that r be >=0.99 for data that cover a wide range, OLR can be applied for calibration purposes as well as for hypothesis testing. When r <0.99, one should investigate whether a different regression procedure really solves the analytical problem.

interpretation of the regression data for case 1 (method comparisons for serum estradiol-17ß measurement): investigation of pbr
PBR corresponds very well to the other regression procedures for low to medium values of Sy||x (see methods 1–7), with the exception of method 4. However, PBR slope and intercept estimates differ from the other regression variants when Sy||x becomes high (in particular, in methods 11 and 12). It is obvious from this observation that PBR cannot be regarded as a substitute for DR or SPCA.

As addressed before, test 4 shows the peculiarity that the slopes for OLR and DR/SPCA differ from that of PBR, despite a relatively low value of Sy||x (Fig. 1 A). When plotting the OLR residuals (Fig. 1B ), it can be seen that this discrepancy originates from the fact that the method comparison data are not linearly related. This is also evidenced from the sign sequence of the y residuals. PBR gives a sign sequence of 2 x plus, minus, 3 x plus, 2 x minus, plus, 8 x minus, and 5 x plus, whereas OLR gives 7 x plus, minus, plus, 8 x minus, plus, minus, plus, minus, and plus. These sequences reveal that the middle block of results has a negative bias compared with the low and high block of results. On the basis of this observation, a nonlinear regression procedure (e.g., a quadratic one) may be more appropriate in this case.



View larger version (16K):
[in this window]
[in a new window]
 
Figure 1. Analysis of method 4 in Table 1Up .

(A) Measurements of serum estradiol-17ß by method 4 from Table 1Up compared with a reference method by different regression procedures. (B) Plot of the OLR residuals for method 4 from Table 1Up vs the serum estradiol-17ß concentration as determined by the reference method. (——-), OLR; (- ··· - ··· -), SPCA; (– – –), PBR; (· · · · ·), line of equality. Note that the lines for OLR and SPCA overlap.

From the fact that OLR and DR/SPCA were more closely related with each other than with PBR in all comparisons, we conclude that PBR should be applied with care to method comparisons that use medium sample sizes. PBR may treat too many data points as outliers. On the other hand, when discrepancies between different linear regression procedures are observed, this should be taken as a hint for an in-depth investigation of the underlying problem.

interpretation of the regression data for case 2 (method comparison data for serum potassium)
Table 2Up shows that in the case of clustered method comparison data (by clustered we mean a small concentration range of the xvariables, here within 1 decade), the values for slope and intercept according to the four regression procedures are quite similar for low to medium values of Sy||x (see methods 1–9). Note that for these nine methods, r values between 0.996 and 0.983 were found. This observation stands in relation to the previously mentioned restriction for using OLR dependent on r >=0.975 (1)(8). Interestingly, after logarithmic transformation of the data for estradiol-17ß, the range comes to fall within 1 decade, and the "critical" r value of 0.99 decreases to ~0.975. This indicates that the "r >=0.975 rule" (8) might be generally useful as a screening rule for valid application of OLR when data ranges are <1 decade. However, as mentioned above, from the purely statistical point of view, DR would be preferable also in those cases.

For methods 10–12, the slope according to OLR differs distinctly from the slopes according to DR, SPCA, and PBR. Notice that for the latter methods r values of 0.954 (method 10), 0.871 (method 11), and even 0.652 (method 12) were found. In those cases, are PBR, SPCA, or DR really the solution to the problem? Again, we would doubt such a statement. This can be substantiated by the graphical comparison of a "good" (method 7, r = 0.993), a "borderline" (method 10, r = 0.954), and a "poor" method comparison (method 11, r = 0.871; Fig. 2 ). Compared with method 7, the worse correlation of method 10 mostly seems to be associated with several outlying results. Indeed, robust OLR applying a trimming factor of 0.1 for outlying residuals (23) gives results nearly identical to those of PBR and DR (robust OLR, slope and intercept: 0.973 and 0.115 mmol/L; PBR, slope and intercept: 0.970 and 0.129 mmol/L; DR, slope and intercept: 0.973 and 0.135 mmol/L). Alternatively, outliers could have been eliminated on the basis of the 4 · Sy||x rule (10). In consequence, one would look for reasons for the poor outcome. We know from our study (20), that it was not the method that caused the problems, but the performance of the laboratory. Clearly, in this case it is not a different regression procedure that is helpful, but an investigation of the reasons for poor analytical quality.



View larger version (15K):
[in this window]
[in a new window]
 
Figure 2. Measurements of serum potassium by three routine methods (from Table 2Up ) compared with a reference method by different linear regression procedures.

(A), Method 7 (note that OLR, SPCA, and PBR are indistinguishable); (B), method 10; (C), method 11. (—–-), OLR; (- ··· - ··· -), SPCA; (– – –), PBR; (· · · · ·), line of equality.

In addition, for what concerns the interpretation of linear regression data, investigation of the statistical reliability of the estimates is often overlooked. For example, for method 10 (r = 0.954), the 95% confidence limits for slope and intercept according to DR were 0.973 ± 0.078 mmol/L and 0.135 ± 0.336 mmol/L, respectively. It follows that from the statistical point of view, one would conclude that there is no difference between the routine and reference methods because the respective confidence intervals include a slope of 1 and an intercept of 0. Hence, recalibration of the routine method would not be necessary. (Generally, the higher the total random error of a study, the higher the chance that statistical hypothesis testing is passed.) On the contrary, from the analytical point of view, one would certainly consider recalibration of the routine method, especially because the lower limit of the slope is 0.895 (0.973 - 0.078). However, using the method comparison for recalibration, ~8% uncertainty would be added to the original calibration slope because of the uncertainty of the regression in method comparison alone. We consider such a value as too high for a potassium test. (The CLIA limit for total error is ~10% for potassium concentrations at the high end of the reference interval.) This demonstrates again that statistical considerations alone cannot give useful interpretation of method comparison studies.

general recommendations
Present the data graphically and visually inspect them for adequacy of range and for outliers.

Inspect the data for linearity:

Correlation analysis:

Linear regression:

Interpretation:

Special note:

These recommendations are meant as help for interpreting method comparison studies. They will not work in every case; the skill, knowledge, and experience of the analyst are still the most important factor in adequate interpretation of, for example, regression estimates.


   Summary and Conclusion
Top
Abstract
Introduction
Materials and Methods
Results and Discussion
Summary and Conclusion
References
 
Our real-life data showed that the analytical input data have more influence on the reliability of the linear regression data than the particular regression procedure applied. Consequently, when OLR (in combination with correlation analysis) gives poor regression estimates, we recommend investigating the analytical reason for this performance instead of assuming that other linear regression procedures will improve the interpretation of the study. This investigation should address whether (a) the data for x and y are linearly related; (b) sa,tot is responsible for the poor correlation; (c) sample-related effects are present (Sy||x sa,tot); (d) the samples are adequately distributed over the investigated range; and (e) the number of samples used for the comparison is adequate to the purpose of the application of linear regression.


   Footnotes
 
Laboratorium voor Analytische Chemie, Faculteit der Farmaceutische Wetenschappen, Universiteit Gent, Harelbekestraat 72, B-9000 Gent, Belgium.

1 Nonstandard abbreviations: OLR, ordinary linear regression; sa, analytical imprecision; DR, Deming regression; SPCA, standardized principal component analysis; PBR, Passing–Bablok regression; sa,tot, total analytical imprecision; and IQC, internal quality control.


   References
Top
Abstract
Introduction
Materials and Methods
Results and Discussion
Summary and Conclusion
References
 

  1. Westgard JO, Hunt MR. Use and interpretation of common statistical tests in method comparison studies. Clin Chem 1973;19:49-57. [Abstract]
  2. Drager NR Smith H eds. Applied regression analysis 1966:44-103 John Wiley & Sons New York. .
  3. Deming WE. Statistical adjustment of data 1943:184 Wiley New York. .
  4. Bablok W, Heackel R, Meyers W, Wosniok W. Biometrical methods. Haeckel R eds. Evaluation methods in laboratory medicine 1993:203-241 VCH Verlagsgesellschaft Weinheim. .
  5. Passing H, Bablok W. A new biometrical procedure for testing the equality of measurements from two different analytical methods. Application of linear regression procedures for method comparison studies in clinical chemistry, Part I. J Clin Chem Clin Biochem 1983;21:709-720. [ISI][Medline] [Order article via Infotrieve]
  6. Passing H, Bablok W. Comparison of several regression procedures for method comparison studies and determination of sample sizes. Application of linear regression procedures for method comparison studies in clinical chemistry, Part II. J Clin Chem Clin Biochem 1984;22:431-445. [ISI][Medline] [Order article via Infotrieve]
  7. Bablok W, Passing H, Bender R, Schneider B. A general regression procedure for method transformation. Application of linear regression procedures for method comparison studies in clinical chemistry, Part III. J Clin Chem Clin Biochem 1988;26:783-790. [ISI][Medline] [Order article via Infotrieve]
  8. National Committee for Clinical Laboratory Standards. Method comparison and bias estimation using patient samples, tentative guideline. NCCLS publication EP9-A. Villanova, PA: NCCLS, 1995:1–36..
  9. Stöckl D. Beyond the myths of difference plots [Letter]. Ann Clin Biochem 1996;33:575-577.
  10. Cornbleet PJ, Gochman N. Incorrect least-squares regression coefficients in method-comparison analysis. Clin Chem 1979;25:432-438. [Abstract/Free Full Text]
  11. Wakkers PJM, Hellendoorn HBA, Op De Weegh GJ, Heerspink W. Applications of statistics in clinical chemistry. A critical evaluation of regression lines. Clin Chim Acta 1975;64:173-184. [ISI][Medline] [Order article via Infotrieve]
  12. Linnet K. Evaluation of regression procedures for method comparison studies. Clin Chem 1993;39:424-432. [Free Full Text]
  13. Linnet K. Performance of Deming regression analysis in case of misspecified analytical error ratio in method comparison studies. Clin Chem 1998;44:1024-1031. [Abstract/Free Full Text]
  14. Payne RP. Method comparison: evaluation of least squares, Deming and Passing/Bablok regression procedures using computer simulation. Ann Clin Biochem 1997;34:319-320.
  15. Lawton WH, Sylvestre EA, Young-Ferraro BJ. Statistical comparison of multiple analytic procedures. Technometrics 1979;21:397-416.
  16. Hollis S. Analysis of method comparison studies [Editorial]. Ann Clin Biochem 1996;33:1-4.
  17. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986;i:307-310.
  18. . HyltoftPetersen P, Stöckl D, Blaabjerg O, Pedersen B, Birkemose E, Thienpont L, et al. Graphical interpretation of analytical data from comparison of a field method with a reference method by use of difference plots [Opinion]. Clin Chem 1997;43:2039-2046. [Abstract/Free Full Text]
  19. Thienpont LM, De Leenheer AP. Efforts by industry towards standardization of serum estradiol-17ß measurements. Clin Chem 1998;44:671-674. [Free Full Text]
  20. Thienpont LM, Van Nuwenborg JE, Stöckl D. Intrinsic and routine quality of serum total potassium measurement as investigated by split-sample measurement with an ion chromatography candidate reference method. Clin Chem 1998;44:849-857. [Abstract/Free Full Text]
  21. Goldschmidt HMJ van Dongen CAJM eds. EVAL KIT [CKCHL software]. Version 3.1. A program for analytical, clinical and managerial evaluations of reagents and instruments 1994 Department of Clinical Chemistry, Centralized Laboratory for Clinical Chemistry and Hematology, St. Elisabeth Hospital Tilburg, The Netherlands. .
  22. Rhoads DG. EP Evaluator [Computer program]. Release 3. Eleven modules for evaluating methods 1995 DG Rhoads Associates Inc. Kennett Square, PA. .
  23. Systat 7.0 for Windows®[Computer program]. Chicago, Il: Statistics SPCC Inc., 1996..



The following articles in journals at HighWire Press have cited this article:


Home page
Clin. Chem.Home page
K. Lorentz
Routine {alpha}-Amylase Assay Using Protected 4-Nitrophenyl-1,4-{alpha}-D-maltoheptaoside and a Novel {alpha}-Glucosidase
Clin. Chem., May 1, 2000; 46(5): 644 - 649.
[Abstract] [Full Text] [PDF]


Home page
Clin. Chem.Home page
J. O. Westgard
Points of Care in Using Statistics in Method Comparison Studies
Clin. Chem., November 1, 1998; 44(11): 2240 - 2242.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit an electronic Letter to
the Editor about this paper
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (26)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Stöckl, D.
Right arrow Articles by Thienpont, L. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stöckl, D.
Right arrow Articles by Thienpont, L. M.
Related Collections
Right arrow Laboratory Management


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS