|
|
||||||||
Informatics and Statistics |
1 Division of Metabolic Diseases, Department of General Pediatrics, University Childrens Hospital, Heidelberg, Germany.
2 Department of Pediatrics and Institute of Clinical Chemistry, Hamburg University Medical Center, Hamburg, Germany.
3 Department of Medical Informatics, University of Heidelberg Medical Center, Heidelberg, Germany.
aAddress correspondence to this author at: University Childrens Hospital, Division of Metabolic Diseases, Im Neuenheimer Feld 150, D-69120 Heidelberg, Germany. Fax 49-6221-564069; e-mail Sirikit.Ho{at}med.uni-heidelberg.de.
Background: In newborn screening with tandem mass spectrometry, multiple intermediary metabolites are quantified in a single analytical run for the diagnosis of fatty-acid oxidation disorders, organic acidurias, and aminoacidurias. Published diagnostic criteria for these disorders normally incorporate a primary metabolic marker combined with secondary markers, often analyte ratios, for which the markers have been chosen to reflect metabolic pathway deviations.
Methods: We applied a procedure to extract new markers and diagnostic criteria for newborn screening to the data of newborns with confirmed medium-chain acyl-CoA dehydrogenase deficiency (MCADD) and a control group from the newborn screening program, Heidelberg, Germany. We validated the results with external data of the screening center in Hamburg, Germany. We extracted new markers by performing a systematic search for analyte combinations (features) with high discriminatory performance for MCADD. To select feature thresholds, we applied automated procedures to separate controls and cases on the basis of the feature values. Finally, we built classifiers from these new markers to serve as diagnostic criteria in screening for MCADD.
Results: On the basis of
2 scores, we identified
800 of >628 000 new analyte combinations with superior discriminatory performance compared with the best published combinations. Classifiers built with the new features achieved diagnostic sensitivities and specificities approaching 100%.
Conclusion: Feature construction methods provide ways to disclose information hidden in the set of measured analytes. Other diagnostic tasks based on high-dimensional metabolic data might also profit from this approach.
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |