|
|
||||||||
Lipids and Lipoproteins |
1
Environmental Health Laboratory Sciences Division, National Center for Environmental Health, National Centers for Disease Control and Prevention, 4770 Buford Hwy. NE, F25, Atlanta, GA 30341-3724.
a Author for correspondence. Fax 770-488-4192; e-mail spc1{at}cdc.gov.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Using TC measurements taken 1 year apart from 14 600 men and women, Thompson and Pocock (15) studied the implications of serum TC measurement variability on screening and monitoring lipid risk classification. They observed a within-subject total CV of 7.4% and a within-subject biological CV of 6.5%. They concluded that a single measurement of TC could be used to reliably distinguish (i.e., with >95% probability) between true values above and below the high 75th percentile of 6.9 mmol/L (265 mg/dL) only when the true TC value was >7.8 mmol/L (300 mg/dL) or <6.1 mmol/L (235 mg/dL). Gillman et al. (16) found that among 24 subjects 6.18.8 years of age, one TC measurement allowed reliable assignment to the acceptable category (<4.4 mmol/L, or 170 mg/dL) only if the measured value was <4.01 mmol/L (154.9 mg/dL) and to the high category (>5.17 mmol/L, or 200 mg/dL) only if the measured value was >5.56 mmol/L (215.1 mg/dL). With one TC measurement, no value allowed assignment to the borderline-high (4.405.17 mmol/L, or 170199 mg/dL) category, and one reading <4.78 mmol/L (184.9 mg/dL) allowed reliable classification below the high (5.17 mmol/L, or 200 mg/dL) cut point.
These concerns about reliable lipid risk classification have led us to
examine the effect of systematic bias and random error, QC, and
intraperson biological variation (CVb) on the NCEP clinical
classifications for reported values of lipid measurements. For the
purposes of our analyses, misclassification is considered to occur if a
true lipid homeostatic set point is within the range for desirable risk
but the reported lipid value is in the range for high risk, or if a
true lipid homeostatic set point is within the range for high risk but
the reported lipid value is in the range for desirable risk. These
criteria for misclassification represent a medically useful approach
because they address the practical clinical situation faced by
physicians who do not want to unnecessarily treat a patient whose lipid
concentration is in a desirable risk category or fail to treat a
patient whose lipid concentration is in a high-risk category and who
want to avoid the unrealistic and practically impossible situation of
trying to distinguish between desirable and borderline risk categories
or between borderline and high-risk categories when lipid values are
near a cut point. Misclassification as defined here is of greatest
concern because of its potential to create a financial or psychological
burden on the patient. Our approach examines the joint probability of
the following two events that must occur simultaneously for incorrect
patient classification: (a) the laboratory obtains a
measured value within the range for high risk when the true homeostatic
set point is in the range for desirable risk, or the laboratory obtains
a measured value within the range for desirable risk when the true
homeostatic set point is in the range for high risk; and (b)
the QC sample(s) measured during the analytical run in which the
patient specimen was analyzed are within acceptable limits
(17). The conclusions we reach by this approach indicate
that a laboratory satisfying the NCEP recommendations and performing
adequate QC procedures can attain correct classifications for TC, TGs,
HDLC, and LDLC with probability
0.97, except for LDLC (with
probability >0.90) when the systematic bias is between 0.5
SDAnalytic and 2.0 SDAnalytic.
| Materials and Methods |
|---|
|
|
|---|
Each OC curve corresponds to the estimated probability of correct
classification and is determined by subtracting the joint probability
estimated above from the number one. The probability associated with
whether a laboratory will obtain a result beyond a specified decision
limit is based on gaussian distribution theory, where the mean of the
distribution is assumed to be determined by the true value of the
sample plus the specified laboratory bias, and the variance is assumed
to be determined by the specified CVb and the specified laboratory
precision. The probability associated with whether a patient result
will be reported is based on a computer simulation of a QC procedure
using a multirule Shewhart chart (18) with multirules
12S, 13S, 22S, R4S,
41S, and 10
applied to two QC samples
per analytical run from either one QC pool (for a total of only two QC
samples) or two QC pools (for a total of four QC samples). When two QC
pools are used, we apply the multirules to each pool separately and
assume that patient results will only be reported if analytical runs
for both QC pools corresponding to the patient specimen measurement are
acceptable. Each probability estimated from a QC simulation is based on
5000 analytical runs and is computed by dividing the number of
analytical runs with at least one reject signal by the total number of
analytical runs. A reject signal from a multirule that spans more than
one analytical run is considered to occur only when the multirule
condition has been satisfied for the designated number of analytical
runs required to produce a reject signal.
The joint probability of correct classification is applicable to an
individual patient with specified CVb whose health status is being
determined on the basis of a single randomly collected specimen and
analyzed in a laboratory with a specified accuracy and precision. The
determination of health status is assumed to be based on whether the
measured patient lipid result is within the "desirable" or
"undesirable" range. These ranges are presented in Table 1
and correspond to the NCEP Laboratory Panel's recommended
analytical performance guidelines (1). The specified CVbs
are also presented in Table 1
and correspond to those estimated by
Smith et al. (19). The specified accuracy and precision,
also presented in Table 1
, correspond to the maximum allowable values
recommended in the NCEP guidelines.
|
The OC curves provide an overall indication of the likelihood of correct patient classification by mapping performance characteristics as functions of inherent and systematic bias and inherent and increased random analytic error. Also provided with the OC curves for correct classification probabilities are OC curves for the individual probabilities associated with the two independent events on which the joint probabilities were computed.
| Results |
|---|
|
|
|---|
|
|
The vertical axis of each plot represents the probability associated
with a labeled event. The ordinate value of each point labeled with an
"L" represents the probability that a laboratory result on a
specimen from a patient whose true homeostatic lipid mean is at the
limit of the desirable range will exceed the limit of the undesirable
range under the condition specified by the abscissa, assuming that the
patient has the specified CVb and that the laboratory has the NCEP
maximum allowable CVa and bias. Thus, in Fig. 1A
, the probability that
a laboratory result on a specimen from a patient with a true TC mean
concentration of 5.17 mmol/L (200 mg/dL) and a CVb of 6.1%
will exceed 6.21 mmol/L (240 mg/dL) is 0.10 in a laboratory with an
inherent CVa of 3%, an inherent bias of 3%, and a systematic bias 2.5
times the inherent analytic SD.
The ordinate value of each point labeled with an "R" represents the
probability that a patient result will be reported based on the
single-pool (solid line) or two-pool (dashed line) multirule Shewhart
QC procedure described in Materials and Methods under the
condition specified by the abscissa, assuming that the laboratory has
the NCEP maximum allowable CVa and the NCEP maximum allowable bias.
Thus, in Fig. 1A
, the probability that a patient result will be
reported is 0.31 for single-pool QC or 0.10 for two-pool QC in a
laboratory with an inherent CVa of 3%, an inherent bias of 3%, and a
systematic bias 2.5 times the inherent analytic SD. The ordinate value
of each point labeled with a "P" represents the joint probability
that a patient result will be correctly classified as being in the
desirable range if single-pool QC (solid line) or two-pool QC (dashed
line) is used under the condition specified by the abscissa, assuming
that the patient has the specified CVb and that the laboratory has the
NCEP maximum allowable CVa and the NCEP maximum allowable bias. Thus,
in Fig. 1A
, the probability that a laboratory result (on a specimen
from a patient with a true TC mean concentration of 5.17 mmol/L (200
mg/dL) and CVb of 6.1%) will be correctly classified as having a TC
value
5.17 mmol/L (200 mg/dL) is 0.97 for single-pool QC and 0.99 for
two-pool QC in a laboratory with an inherent CVa of 3%, an inherent
bias of 3%, and a systematic bias 2.5 times the inherent analytic SD.
We obtained results similar to those presented in Figs. 1
and 2
using
Shewhart mean and range charts rather than multirule QC procedures. The
OC curves in Figs. 1
and 2
(as well as similar curves for TG and HDLC)
for correct classification probability based on single- or two-pool QC
procedures never drop below 0.96 when the inherent bias and CVa are at
the extremes of the NCEP recommendations and there is no systematic
bias or increased random analytic error. In addition, the curves for
two-pool QC never drop below 0.97 when there are increases in
systematic bias or random analytic error, except for LDLC when the
systematic bias is between 0.5 SDAnalytic and 2.0
SDAnalytic. In this range, the probability that the patient
LDLC result will be reported has not declined enough to mitigate the
influence of systematic bias on the probability that the laboratory
result will exceed 4.14 mmol/L (160 mg/dL). The lowest probability of
correct LDLC classification based on two-pool QC over the length of
this interval is 0.91.
To elucidate some specific cases included in the OC charts, we demonstrate the joint probability calculations associated with correct classification of a patient, based on TC measured in a laboratory under "worst case" conditions (i.e., the true patient mean is at a decision limit and during characterization the laboratory is operating just within the NCEP guidelines: inherent bias is 3%, and inherent CVa is 3%). We assume that the laboratory measures a specimen from a patient who on average (throughout the course of a year) has a TC value of 5.17 mmol/L (200 mg/dL) and that the CVb for this patient is 6.1%. We then determine the probability that this patient will be misclassified (i.e., declared to have a TC value of 6.21 mmol/L [240 mg/dL] or greater) on the basis of a single patient specimen obtained at a random time period during the year. We will also assume that the laboratory that measures the patient specimen routinely measures two QC samples per analytical run from each of two QC pools (for a total of four QC samples). There are essentially four groups of scenarios under which a misclassification could occur during the analytical run in which the patient specimen is analyzed: (a) the inherent bias is 3%, there is no systematic bias, and the CVa does not exceed 3%; (b) the inherent bias is 3%, there is an additional systematic bias, and the CVa does not exceed 3%; (c) the inherent bias is 3%, there is no systematic bias, and the CVa exceeds 3%; and (d) the inherent bias is 3%, there is an additional systematic bias, and the analytic CVa exceeds 3%. The probability of correct patient classification can be computed for any specific scenario within these four groups. No probability, however, can be attached to whether one particular scenario or another will occur, because we do not know the probability that a particular systematic bias or increase in CVa will or will not occur. The joint probability calculations for the worst case scenario from each of these four groups, which are presented in detail below, produce joint probabilities of correct classification of 0.993, 0.998, 0.995, and 1.00, respectively.
(a) The measurement system is just within the NCEP accuracy and
precision guidelines (i.e., as a worst case, inherent CVa is 3% and
inherent bias is 3%) during the analytical run on which the patient
specimen is analyzed.
The joint probability of correct
classification in this case is 0.993 (= 1.00 - 0.0076 x
0.98). That is, it is the complement of the product of the
probabilities of two independent events: (a) the measured
result (on average, M = 1.03 x 5.1720 mmol/L [1.03 x
200 mg/dL]) will be >6.2064 mmol/L (240 mg/dL), which is equivalent
to a standard normal variate being >2.43 (= [6.2064 -
M]/[(.03 x M) (.061 x
M)]1/2; this event occurs with
probability = 0.0076); and (b) the QC samples measured
during the analytical run in which the patient specimen was analyzed
will be within acceptable limits (this event occurs with a probability
approximately equal to 0.99 when there is no systematic
bias over and above the inherent 3% bias and two QC samples per
analytical run from two QC pools are used).
(b) The measurement system exceeds the NCEP accuracy guidelines
during the analytical run in which the patient specimen is
analyzed.
For example, suppose that, in addition to its allowable
inherent bias of 3%, the laboratory has the minimum systematic shift
that is detectable with probability = 0.90 using the single-pool,
or 0.99 using the two-pool multirule Shewhart QC procedure described in
Materials and Methods. (Note: The results for other
systematic shift scenarios are indicated in Fig. 1A
.) For two-sample
single-pool QC, the minimum detectable shift is 9.3%. (Note: 9.3%
represents 3.1 SD and was determined from simulations similar to those
performed by Westgard et al. (20).) Thus, in a worst case
scenario (i.e., inherent CVa is 3% and inherent bias is 3%), a
laboratory might have an operating bias of 12.3% (= 9.3% 3.0%).
The joint probability of correct classification in this case is 0.998
(=1.00 - 0.1566 x 0.01). That is, it is the complement of
the product of the probabilities of two independent events:
(a) the measured result (on average M = 1.123 x
5.1720 mmol/L [1.123 x 200 mg/dL]) will be >6.2064 mmol/L (240
mg/dL), which is equivalent to a standard normal variate being >1.01
(=[6.2064 - M]/[(.03 x M) (.061 x
M)]1/2; this event occurs with
probability = 0.1566); and (b) the QC samples measured
during the analytical run in which the patient sample was analyzed will
be within acceptable limits (this event occurs with probability
approximately equal to 0.10 when two-pool QC is used and
there is a systematic bias of 9.3% over and above the inherent 3%
bias).
(c) The measurement system exceeds the NCEP precision guidelines
during the analytical run on which the patient specimen is
analyzed.
For example, suppose that in addition to its allowable
inherent CVa of 3%, the laboratory has the minimum increase in CVa
that is detectable with probability = 0.80 using the single-pool,
or 0.96 using the two-pool multirule Shewhart QC procedure described in
Materials and Methods. (Note: Results for other increases in
CVa are included in Fig. 1B
.) For two-sample single-pool QC, the
minimum detectable increase in CVa is 3.5 times the characterization
CVa, which corresponds to an operating CVa of 13.5% rather than 3.0%.
(Note: The minimum detectable increase in CVa of 3.5 was determined
from simulations similar to those performed by Westgard et al.
(20).) As before, we need to compute the probability that
the patient will have a measured result (on average, M = 1.03
x 5.1720 mmol/L [1.03 x 200 mg/dL]) >6.2064 mmol/L (240
mg/dL). If the total CVa deteriorates from 3% to 13.5% and if the CVb
is 6.1%, then in a worst case scenario (i.e., inherent CVa is 3% and
inherent bias is 3%) the laboratory would have a 3% positive bias in
addition to its 13.5% CVa. The joint probability of correct
classification in this case is 0.995 (= 1.00 - 0.1326 x
0.04). That is, it is the complement of the product of the
probabilities of two independent events: (a) the measured
result (on average, M = 1.03 x 5.1720 mmol/L [1.03 x
200 mg/dL]) will be >6.2064 mmol/L (240 mg/dL), which is equivalent
to a standard normal variate being >1.115 (= [6.2064 -
M]/[(.135 x M) (.061 x
M)]1/2); this event occurs with
probability = 0.1326); and (b) the QC samples measured
during the analytical run in which the patient sample was analyzed will
be within acceptable limits (this event occurs with probability
approximately equal to 0.20 when two-pool QC is used and
there is an increase in CVa that is 3.5 times the inherent CVa of 3%).
(d) The measurement system is not meeting NCEP accuracy or
precision guidelines during the analytical run on which the patient
specimen is analyzed.
For example, suppose that, in addition to
its allowable inherent bias of 3% and its allowable total CVa of 3%,
the laboratory has the minimum detectable systematic shift with
power = 0.90 and the minimum detectable increase in total CVa with
power = 0.80, using the single-pool multirule Shewhart QC
procedure described in Materials and Methods and
corresponding values of 0.99 and 0.96, using the two-pool procedure.
Thus, in a worst case scenario (i.e., inherent CVa is 3% and inherent
bias is 3%), a laboratory might have an operating bias of 12.3% (=
9.3% 3.0%) and a total CVa that deteriorates from 3% to 13.5%.
Then, if the CVb is 6.1%, the total CV associated with a given patient
result will be 14.8%. The joint probability of correct classification
in this case is ~1.00 (= 1.00 - 0.3217 x
1.0E-12). That is, it is the complement of the product of
the probabilities of two independent events: (a) the
measured result (on average, M = 1.123 x 5.1720 mmol/L
[1.123 x 200 mg/dL]) will be >6.2064 mmol/L (240 mg/dL), which
is equivalent to a standard normal variate being >0.465 (=
[6.2064 - M]/[(.135 x M) (.061 x
M)]1/2); this event occurs with
probability = 0.3217); and (b) the QC samples measured
during the analytical run in which the patient sample was analyzed will
be within acceptable limits (this event occurs with probability =
0.000001 when two-pool QC is used, there is a 3.5-fold
increase in CVa, and a systematic bias of 9.3% over and above the
inherent 3% bias). Thus, although the patient result is quite likely
(probability = 0.3217) to be >6.2064 mmol/L (240 mg/dL), the
probability that the patient result will be reported based on the
outcome of the two QC pools is extremely unlikely (probability =
0.000001).
In Table 2
, we present the lowest expected probability of correct patient
classification over the range of systematic biases or increases in
total CVa considered. In each case, we assume that a single patient
specimen is measured, that the CVb is as given in Table 1
, that the
laboratory verified that it was just meeting the NCEP guidelines for
accuracy and precision during characterization of the method, and that
the single- or two-pool multirule Shewhart QC procedure described in
Materials and Methods is used. Column 5 of Table 2
displays
the probability that a measured patient result will exceed the decision
limit specified in column 3. The highest probability in column 5 is
0.1728 and corresponds to the probability that a person with an average
LDLC of 4.14 mmol/L (160 mg/dL) would have a measured value <3.36
mmol/L (130 mg/dL) if the laboratory has (in addition to its inherent
bias of 4%) a systematic bias equal to 1.5 analytic SD. Based on a
single-pool multirule Shewhart QC procedure using two QC samples per
analytical run (for a total of only two QC samples), the probability
that this result would be reported is 0.74 (column 6), so that the
final probability of correct classification is 0.87 (column 7). This
final probability could be increased to 0.91 (column 7 in parentheses)
by using a two-pool multirule Shewhart QC procedure with two QC samples
per analytical run (for a total of four QC samples). These results, as
well as those in Figs. 1
and 2
, clearly demonstrate the important role
QC plays in ensuring correct patient classification.
|
As mentioned in Materials and Methods, the CVbs used to
generate the OC curves in Figs. 1
and 2
and Table 2
are average CVb
values estimated by Smith et al. (19). If a particular
individual has a larger CVb than that given in Table 1
for a particular
lipid, the probability of correct classification will decrease
slightly. For example, in scenario 2 above, if the CVb = 10.0%
instead of 6.1%, the probability of correct classification would be
0.997 instead of 0.998. Thus, individuals with larger than average CVbs
are not as likely to be correctly classified, but the decrease in
likelihood is minimal.
| Discussion |
|---|
|
|
|---|
We have demonstrated for TC, TGs, HDLC, and LDLC that the NCEP accuracy
and precision recommendations are adequate to ensure a high likelihood
(>90%) of correct patient classifications. Actually, we found that a
laboratory satisfying the NCEP recommendations and performing adequate
QC procedures can attain correct classifications for TC, TGs, HDLC, and
LDLC with probability
0.97, except for LDLC (with probability >0.90)
when the systematic bias is between 0.5 SDAnalytic and
2.0 SDAnalytic. These analyses assume that laboratories are
meeting the NCEP guidelines for inherent method bias and analytic
precision and are using standard QC procedures (e.g., Shewhart mean and
range chart QC or multirule Shewhart QC) that incorporate at least
two QC samples from each of two QC pools (for a total of four QC
samples). We suggest, therefore, that at least two concentrations of QC
material be included in the QC scheme to ensure that the measurement
system is operating within desired specifications across the entire
range of desirable and high-risk lipid concentrations and to ensure
with high probability that patients are correctly classified.
It is important, however, to note that inherent method bias, unlike systematic bias, cannot be determined by QC procedures. Therefore, a method characterization experiment must be conducted that includes external reference materials with externally assigned target values. The estimated mean bias from such an experiment should fall within the NCEP accuracy guidelines. QC characterization analyses can be used to compute the inherent CVa, which should fall within the NCEP precision guidelines. Once a laboratory has established that the NCEP recommendations are satisfied, routine QC procedures such as described in the previous paragraph should provide adequate protection against reporting patient results when the method is no longer meeting previously verified performance standards.
Although not directly addressed in this paper, we assumed that the QC analysis length (the number of patient samples analyzed between each QC sample) has been optimized such that QC outcomes truly relate to patient samples (22). We also suggest that CVb can be reduced by obtaining two serial patient specimens at least 1 week apart (23). The relative range of the two results can be used to determine if additional patient specimens are required because of unusually high CVb. Of course, even if perfect patient lipid classification could be achieved, it would not guarantee perfect clinical diagnosis, which depends on the accuracy of the lipid screening strategy used to identify an individual at increased risk (24).
| Acknowledgments |
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
The following articles in journals at HighWire Press have cited this article:
![]() |
C. Gillespie, C. Ballew, B. A Bowman, R. Donehoo, and M. K Serdula Intraindividual variation in serum retinol concentrations among participants in the third National Health and Nutrition Examination Survey, 1988-1994 Am. J. Clinical Nutrition, April 1, 2004; 79(4): 625 - 632. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. M. Blanck, B. A. Bowman, G. R. Cooper, G. L. Myers, and D. T. Miller Laboratory Issues: Use of Nutritional Biomarkers J. Nutr., March 1, 2003; 133(3): 888S - 894. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Gotto and G. R. Cooper Citation Classics in Lipid Measurement and Applications Clin. Chem., November 1, 1998; 44(11): 2234 - 2237. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |