|
|
||||||||
Editorials |
Department of Microbiology and Molecular Cell Biology, Center for Biomedical Proteomics, Virginia Prostate Center, Eastern Virginia Medical School, 700 W. Olney Road, Norfolk, VA 23508, E-mail semmesoj{at}evms.edu
Great medical benefit may result from biomarker discovery, but the scarcity of useful biomarkers among myriad genes and proteins makes this task every bit as daunting as finding the needle in a haystack. At the moment we are not even certain what the needle or the haystack looks like (although many of us are certain we know one when we see one) or, more precisely, how to separate the signal from the noise. The need for better disease management tools has placed considerable demand on the scientific community to find appropriate clinical biomarkers, ushering in the "omics" erathe application of specific technologies such as proteomics, genomics, and metabolomics along with the mainstreaming of high-throughput, high-volume analytical approaches.
Clearly the development and implementation of novel technologies as well as innovative new application of "old" technologies is justified. However, this pushing of the technical envelope must be balanced with careful scientific evaluation of the performance characteristics of each new paradigm. The collision of these two imperatives has never been more apparent than in the current debate over protein expression profiling and pattern recognitionbased diagnostics. The opposing forces of excitement associated with innovation (1)(2) and caution regarding bias, chance, and overgeneralization (3)(4)(5) must be balanced by the research community.
One case study for this issue is serum protein expression profiling. Following promising seminal work, many questions were raised after closer scrutiny of published data (3)(4)(5). Causes of concern included lack of analytical reproducibility, diminished robustness of discovered biomarkers during validation, and the fear that the prevalent detected serum proteins were produced by the liver. Indeed, the majority of these concerns can be attributed to bias, chance, and our rush to generalize results, a phenomenon nicely articulated in two recent articles (6)(7). The research community is called to strengthen its vigil over possible sources of bias, which can occur at many points along the discovery pathway. Although biostatisticians and epidemiologists bear the greatest responsibility for study design and data analysis, there are avoidable sources of experimental bias that must be recognized by laboratory scientists. Evaluation of analytical reproducibility and determination of sources of variability are essential steps in the biomarker discovery process. Identifying sources of sample bias introduced during clinical or laboratory processing allows for a greater understanding of the nondisease-related events that confound biomarker discovery.
Examining the influence of known sample processing variables on the spectral output after expression profiling analysis by mass spectrometry was the focus of the study by Rosamonde Banks and colleagues (8) in this issue. The authors introduced several changes in blood sample collection and processing and measured the resulting variability by use of surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF MS). Specifically, they assessed the impact of anticoagulant, types of serum collection tubes, and elapsed time between venipuncture and sample analysis. The test samples were processed by use of various affinity-activated surfaces immediately before mass spectral analysis. As has been suggested elsewhere (9), plasma types and serum separator tube choice appeared to have a profound impact on the spectra observed. This variability was further confounded by the selected affinity surface, confirming a phenomenon that has been observed before but never demonstrated under controlled conditions (10). Of particular interest is the observation that the time from blood collection to analysis is a critical period in which changes in protein profiles occur. Whereas most researchers would assume that the quicker the serum sample is processed the better, these authors observed that a sample stabilizes following a 30-min period. Thus, the recommendation for protein expression profiling of blood samples would include a provision that analysis should not be performed until after an initial 30-min period. Clearly the skeptical researcher must be aware that this study investigated only a small number of many possible variables. Thus, although analysis after a 30-min stabilizing period will avoid these reported blood changes, there is no guarantee that analysis between 30 min and 4 h will avoid artifacts in all features of the serum proteome. Indeed, the results presented by Banks and colleagues (8) serve to underscore the need for further and more extensive examination of the constituents of the "haystack".
When we attempt to understand the possible sources of sample processing variability it is useful to examine known biological phenomena that might give rise to observed events. The authors approach this question by examining the impact of the clotting process on the sample-specific spectra. Their analysis of the spectral profile showed alterations of many fairly prominent peaks corresponding with in vitro manipulation of platelets. The authors list several m/z peaks that were altered in their system, providing a useful catalog for the research community, but the strongest take-home message here is that these sorts of perturbation studies should be routinely conducted as a means of pinpointing possible confounding events. The knowledge that biological phenomena such as platelet activation can profoundly influence observed mass spectral output demands that "omics" researchers examine study protocols for population characteristics that would affect the platelet activation process.
A benefit of studies designed to discover sources of experimental bias is the direct definition of confounders that lead to incorrect associations and erroneous conclusions. Such new knowledge will facilitate study design and data interpretation. Our efforts toward uncovering the elusive protein biomarker in the proteome require a complete understanding of the proteomic haystack. The observed platelet-dominated changes seen in the unfractionated abundant proteins may or may not be relevant among the less abundant proteins, but selective removal of platelets before sample analysis might prove useful. Conceptually, one might reduce the impact of the haystack by targeting subproteomes. In this respect, biological perturbation studies may serve as nice systems for developing techniques that minimize the observation of nondisease-related events. The more we define the biological noise, the easier it will be for us to find the biologically relevant signal.
References
The following articles in journals at HighWire Press have cited this article:
![]() |
R. R. Drake, E. E. Schwegler, G. Malik, J. Diaz, T. Block, A. Mehta, and O. J. Semmes Lectin Capture Strategies Combined with Mass Spectrometry for the Discovery of Serum Glycoprotein Biomarkers Mol. Cell. Proteomics, October 1, 2006; 5(10): 1957 - 1967. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |