Clinical Chemistry
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
 QUICK SEARCH:   [advanced]


     


Clinical Chemistry 0: clinchem.2007.097923v1, 2008; 10.1373/clinchem.2007.097923
This Article
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
clinchem.2007.097923v1
54/8/1372    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lambert, J.
Right arrow Articles by Carrat, F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lambert, J.
Right arrow Articles by Carrat, F.

Received on September 18, 2007
Accepted on May 6, 2008

Informatics and Statistics

How to Measure the Diagnostic Accuracy of Noninvasive Liver Fibrosis Indices: The Area Under the ROC Curve Revisited

Jerome Lambert 1, Philippe Halfon 2, Guillaume Penaranda 3, Pierre Bedossa 4, Patrice Cacoub 5, Fabrice Carrat 1*

1 Université Pierre et Marie Curie-Paris; INSERM, UMR-S 707; unité de santé publique, Assistance Publique Hôpitaux de Paris, Hôpital Saint-Antoine, Paris, France
2 Laboratoire Alphabio, Hôpital Ambroise Paré, Marseille, France
3 Département de Biostatistiques, CDL Pharma, Marseille, France
4 Service d'Anatomie Pathologique, Assistance Publique Hôpitaux de Paris, Hôpital Beaujon, Clichy, France
5 Service de Médecine Interne, Hôpital Pitié-Salpêtrière, Assistance Publique Hôpitaux de Paris, Paris, France

* To whom correspondence should be addressed. E-mail: carrat{at}u707.jussieu.fr.

BACKGROUND: The area under the receiver operating characteristics curve (AUC) is widely used to measure the diagnostic accuracy of noninvasive fibrosis indices. However, use of the AUC assumes a binary gold standard, whereas fibrosis staging is based on an ordinal scale and also depends on the distribution of fibrosis stages in the study sample. We explored other fibrosis staging accuracy measures designed for ordinal gold standards, the C-statistic and the Obuchowski measure.

METHODS: We performed a simulation study to assess the bias in estimating the accuracy measures when the distribution of fibrosis stages in the study sample do not fit the reference distribution in the population to which the indices are applied. We also estimated the type I error of the tests comparing these measures in 2 samples with different distributions of fibrosis stages. We illustrated the practical use of these measures by reanalyzing real data.

RESULTS: Compared with the AUC or the C-statistic, the Obuchowski measure showed limited bias when the distribution of fibrosis stages in the study sample differed from the reference distribution. The type I error was strongly inflated with the AUC or the C-statistic but was preserved in the Obuchowski measure. When we compared noninvasive indices on real data, AUC analysis led to discordant results depending on how the fibrosis stages were grouped together. One single conclusion was drawn from the analysis based on the Obuchowski measure.

CONCLUSIONS: We recommend using the Obuchowski measure for assessing the diagnostic accuracy of noninvasive indices of fibrosis.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Copyright © 2008 by the American Association for Clinical Chemistry.