|
|
||||||||
Electronic Letters to:
|
|
Electronic letters published:
|
|
|||
|
Andrew KRAMAR, Biostatistician CRLC Val d'Aurelle, Montpellier, France, David Faraggi, Antoine Fortuné, Ben Reiser
Send letter to journal:
akramar{at}valdorel.fnclcc.fr Andrew KRAMAR, et al.
|
Eight computer programs for ROC analysis (1) have been compared for
several criteria: data input (10%), data output (15%), correctness (40%),
completeness (20%), software comfort (10%) and manual (5%). Since our mROC
program (2) obtained the lowest score, we feel it necessary to comment on
these weights since not enough detail is provided. First of all, all
programs obtained the maximum score for correctness, however, we can see
from the examples in table 4 that the results are not the same. Only
MedCalc and mROC give “correct” non-symmetrical confidence intervals,
although mROC is slightly more conservative. All the other programs use
symmetric normal approximations to the confidence intervals, yet non-
parametric methods are used for all the other calculations. As far as
completeness is concerned, we feel it unjustified to attribute the lowest
score to mROC, since the reason that mROC was developed in the first place
was to be able to handle correlated multivariate data appropriately by
selecting the best linear combination of markers which maximizes the area
under the curve, by first transforming the data into a gaussian
distribution. All other programs only allow two-way comparisons, which
will become very impractical and difficult to summarize once there are
more than 4 diagnostic tests to be compared. Also, it should be remembered
that there is a direct relationship between p-values obtained by a
comparison of 2 AUC’s and their respective confidence intervals, which
provide more information. As far as data input is concerned, we feel that
any program using ROC methodology should be based on valid data. If
clinicians will need to “correct” data within one of these programs, this
means that there has been a neglect in the data management process of the
study, and it is not within an analysis program that these corrections
should be performed. By doing so, suspicions will be raised as to why data
needs to be “modified” or “deleted”. Every computer program should have
it’s own specificity and may need to be used by persons sensitive to the
requirements of assessing and comparing the diagnostic validity of
correlated laboratory tests within a multivariate environment. |
|||
|
|
|||
|
Frank Schoonjans University Hospital Gent, Dept. Endocrinology, Belgium
Send letter to journal:
frank.schoonjans{at}rug.ac.be Frank Schoonjans
|
The authors of the paper comparing 8 computer programs for ROC analysis (1) seem to have missed a difference in the methods used by the different programs to calculate the confidence interval (CI) of the area under the ROC curve (AUC). In table 4 of this publication, one can distinguish 2 groups of programs: the first group containing MedCalc and mROC, who have a more narrow and non-symmetrical CI, and a second group of programs who give a symmetrical CI. The reason for this is that MedCalc calculates the CI using the Binomial distribution which is a more precise method than the method based on the Normal distribution used by the other programs (2). mROC calculates the CI of the AUC using the method of Mee (3) yielding a comparable CI as when using the Binomial distribution method. This difference in methodology may be less important for the CI of the example AUC of 0.702, but the difference becomes more important, and the Normal distribution method more inappropriate, when the AUC approaches 1. In addition, GraphROC is praised for its ability to compare ROC curves at a certain sensitivity or specificity cutoff. However, statistical testing at specific sensitivity or specificity values is known to be impossible in most cases using non-parametric methods (4), which GraphROC claims to use. We therefore suggest that the methodology used by GraphROC may be unsound. References 1. Stephan C, Wesseling S, Schink T, Jung K. Comparison of eight computer programs for receiver-operating characteristic analysis. Clin Chem, 2003;49:433-9. 2. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 1982;143:29-36. 3. Mee RW. Confidence intervals for probabilities and tolerance regions based on a generalization of the Mann-Whitney statistic. J Amer Statist Assn, 1990;85:793-800. 4. Zweig MH, Campbell G. Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical chemistry. Clin Chem, 1993;39:561 -77. |
|||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH |