|
|
||||||||
Articles |
1
Division of Laboratory Systems, Public Health Practice Program Office, Centers for Disease Control and Prevention, Atlanta, GA 30341.
2
Statistics and Public Health Research Division,
Analytical Sciences, Inc., Durham, NC 27713.
a Address correspondence to this author at: Laboratory Practice Assessment Branch, Division of Laboratory Systems, Public Health Practice Program Office, Centers for Disease Control and Prevention, 4770 Buford Hwy NE, Mailstop G-23, Atlanta, GA 30341-3724. Fax 770-488-8275; e-mail sns9{at}cdc.gov.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
15% of the time
(3)(4)(5)(6)(7)(8)(9)(10). The rate of clinically significant errors affecting
patient outcome was even smaller,
0.1% (9)(10)(11). These
rates, however, are based on self-reports by medical clinics, clinical
laboratories, and other sites involved in the total testing process
(TTP).2
The true problem rate is greater, and is a function of the
observed problem rate, the effectiveness of existing monitoring and
reporting systems, and how a problem is defined
(4)(5). In 1990, CDC began conducting the five studies listed in Section 4(a) of the CLIA, including, "... study of the effect on laboratory test accuracy of errors in each of the components of the clinical testing process ... ". In late 1990, CDC solicited a study proposal to address the five study areas outlined in CLIA. Insufficient resources, however, prevented implementation of the proposed design (12)(13). Therefore, CDC began implementing focused, independent projects to address issues that would have been covered by the more comprehensive design. In 1994, CDC began developing and evaluating a prototype process that used a split-specimen (SS) design to determine the frequency and type of problems that occurred in certain portions of the TTP for specific laboratory tests performed in both hospital laboratories (HLs) and physician office laboratories (POLs). A 3-month feasibility study was conducted to evaluate the use of an SS process to determine the frequency and type of problems that occurred in a portion of the TTP (14). On the basis of these findings (14), the present full-scale evaluation study was initiated to evaluate the SS design and its logistics, to assess the usefulness of the SS methodology for measuring test result discrepancies, and to assess the feasibility and effectiveness of the SS process as a quality-assessment system for laboratory testing.
| Materials and Methods |
|---|
|
|
|---|
|
participating facilities
All 11 facilities (8 POLs and 3 HLs) were located within 240
kilometers (150 miles) of Durham, NC. Selection criteria for
POLs, in addition to geographic location, were an annual volume of
>5000 tests and performing each of the two (TC and K) tests. Four of
the POLs served 19 (mean, 6.0) clinicians in different multispecialty
practices, two POLs served 1 and 8 clinicians in internal medicine
practices, and two POLs served 3 and 48 clinicians in family practice
settings. Laboratories in three hospitals ranging in size from
100
to
1000 beds each served a referral multispecialty practice: 13
clinicians in an internal medicine practice, and 2 clinicians in a
family practice setting.
referral laboratory
A commercial RL central to the study participants was selected.
The RL analyzed specimens within 18 h of blood collection, with
results then electronically transmitted and imported into the study
database.
referee laboratory
An academic referee laboratory was selected to adjudicate testing
discrepancies and to conduct audit-sample analysis. This laboratory was
certified through the Lipid Standardization Program of the CDC to
perform TC testing using standardized testing procedures referenced to
the Abell-Kendall reference method (15) and participated
satisfactorily in an accredited College of American Pathologists
proficiency testing (PT) program for TC and K.
holding facility
Sera obtained from S3 specimens were sent to Analytical Sciences,
Inc., the contractor for this study, where the integrity of each sample
was examined visually and noted if compromised, recentrifuged if
necessary, and stored at -20 °C for possible future retrieval.
Randomly selected samples (up to 30 per clinic) were retrieved and
divided into three audit samples. Samples were also divided when the
percentage of the difference between S1 and
S2 results exceeded set values (see the
discussion of SS design earlier in this section).
patient selection
Any patient from 18 to 80 years of age for whom the laboratory
test was ordered for a specific clinical reason and not as part of a
laboratory test profile was eligible for inclusion. Excluded patients
were those who were unable or unwilling to provide informed consent,
those who had given
100 mL of blood, or those whose blood was
collected by fingerstick. The six facilities involved in TC testing
recruited 30191 patients (mean, 108), whereas the five facilities
involved in K testing recruited 75199 patients (mean, 146) into the
study. The number of patients recruited by each facility was positively
related, although not linearly proportional, to its typical test
volume.
laboratory tests used
Serum TC and K were selected because of their frequent use
for both ambulatory and hospitalized patients, their clinical
relevance, specimen stability, and the availability of a standard
reference method. There were 646 patients with TC specimens (four POLs
and two HLs) and 732 patients with K specimens (four POLs and one HL).
Laboratory measurements were made using cholesterol oxidase methods
(for TC) and ion-selective electrodes (for K).
ss result differences
Two different types of discrepancy criteria were used in this
study. One was based on predefined standards (standard-based), whereas
the other was based on the actual data collected from each facility,
taking into account each center's estimated measurement variability
and its systematic result differences compared with the RL
(data-driven). For assessing SS result discrepancies, two methods used
predefined standards to determine whether SS result differences
exceeded set thresholds. Standard-based methodologies are comprehensive
because they may identify result discrepancies that are associated with
any causes and are more widely accepted than their data-driven
counterpart because they are based on preexisting and validated
standards, allowing identification of discrepancies as data are
received. The data-driven discrepancy method, on the other hand, tends
to detect result discrepancies that are "out-of-context" on the
basis of each facility's results. The data-driven discrepancy method
is, therefore, designed to identify isolated problems within the TTP by
adjusting for systematic result differences between laboratories.
Therefore, result discrepancy identification using data-driven criteria
requires prior analysis of laboratory results from each facility.
For standard-based discrepancy criteria, denoting maximum allowable
variance by s2:
![]() |
so that
![]() |
The critical difference for the standard-based discrepancy
criteria was obtained as follows:
![]() |
where we used a multiplier of 3 for s in conformity with a commonly accepted outlier criterion of 3 standard deviations. The maximum allowable standard deviation (s) was obtained as either the mid-range of the abscissa of the Westgard operation process specification charts quality-control lines using CLIA PT standards at bias = 0 [Westgard CLIA PT-based discrepancy method (16)], or was set to the maximum allowable imprecision on the basis of published biological variation data for each analyte, 1/2(intraindividual CV) [biologically-based discrepancy method (17)(18)]. The critical limits to define the SS result differences as discrepant were ± 9.6% (s = 2.25%) and ± 0.48 mmol/L (s = 0.114 mmol/L) for TC and K, respectively, when the CLIA-based discrepancy criteria were used, whereas they were ± 12.9% [s = 1/2(6.1%)] and ± 10.4% [s = 1/2(4.9%)] for TC and K, respectively, when the biologically-based discrepancy criteria were used.
Data-driven result discrepancies were identified by reexamining the
S1 and S2 differences after
accounting for systematic result differences exhibited by the
participating facility and the RL and for measurement variability
associated with the testing process. Generally, if the systematic
result difference between S1 and
S2 (S1 -
S2) is denoted by d and the standard
deviation of the S1 - S2
difference is denoted by sd, then the criterion
for declaring a data-driven result discrepancy is
|[(S1 - d) -
S2]|
3 sd. In
practice, the systematic difference between S1
and S2 was characterized by a linear relationship
rather than a constant d. Measurement error modeling
procedures, with exclusion of outliers, were used in achieving this
characterization (19). In the scatter plots showing the
relationship between S1 and
S2 using data-driven discrepancy criteria,
S1 was adjusted to eliminate any systematic
difference between S1 and
S2 results.
For the assessment of testing bias of the six laboratories whose TC testing was examined, an adjustment for S2 results was made as follows: First, the referee laboratory's A3 results were adjusted to reflect the bias exhibited by the referee laboratory relative to its peer group in the CDC Lipid Standardization Program by linear regression, examining nearly 2 years of the referee laboratory's Lipid Standardization Program TC performance data. Next, the S2 result from the RL was adjusted by linear regression to account for the bias exhibited by the RL relative to the referee laboratory. Finally, to assess the participating laboratories' bias, the linear relationship between A1 and the adjusted A3 was examined using measurement error modeling procedures while excluding outliers as described above. Bias for results from laboratories performing K testing was not computed because a laboratory standardization program for this analyte did not exist.
decision tree algorithm to assess effectiveness of the ss design
We were interested to note what proportion of the time an
S1 result was consistent with its corresponding
S2 and the three audit sample results when the
S1 and S2 results were
called not discrepant by either standard-based discrepancy criterion.
This helps in assessing the effectiveness of not calling an SS
discrepancy in predicting a consistent (and probably "correct")
S1 result. Alternatively, we wanted to know what
proportion of the time an S1 result was
inconsistent with its corresponding S2 and the
three audit sample results when the S1 and
S2 results were called discrepant by either
standard-based discrepancy criterion. This would help in assessing the
effectiveness of calling an SS discrepancy in predicting an
inconsistent (and probably "incorrect") S1
result. A decision tree algorithm was used when audit sample results
were available to assess if the S1 and
S2 results that were discrepant or not discrepant
by the standard-based discrepancy criteria were discrepant or not
discrepant when the same discrepancy criteria were used and the
S1 - A3,
S1 - A2, and
S1 - A1 differences were
examined (Fig. 2
). When the S1 - S2
difference exceeded the critical limits set by each discrepancy
criterion for each analyte, using either one of the two standard-based
discrepancy criteria, the S1 result was called
"in error" by the SS design. The following parameters were used to
assess the effectiveness of the SS design in calling the
S1 result in error (discrepant) or not in error
(not discrepant):
|
adjusted standard-based discrepancy rates
Adjusted discrepancy rates were determined for each standard-based
discrepancy criterion by calling an S1 result
discrepant when it was called in error by the decision tree algorithm.
Because audit sample results were not available for most patients, we
used positive and negative predictive values obtained from the
(standard-based) discrepancy criterion for each analyte to estimate an
"adjusted" discrepancy rate comparable to the rate we would have
obtained if audit sample results for all patients had been available
and we had used the decision tree algorithm to call each
S1 result in error or not in error.
statistical methods
Analytical data suggested that measurement variability tended to
be higher for higher TC concentrations, whereas it tended to remain
constant throughout the range of K concentrations. Because of the
variance heterogeneity in TC concentration, we chose to express
intralaboratory variability as the coefficient of variation (CV, %)
rather than as standard deviation. We excluded outliers from
several analyses, including the regression modeling of
S1 vs S2 and the estimation
of intralaboratory result variability based on A1
- A3 and A2 -
A3 differences, by determining the studentized
residual r and calling results as outliers if |
r | >3.0.
Regression lines were fitted using measurement error regression procedures, rather than simple least squares (19). This was necessary because the value represented by the independent variable was measured with some error, and simple least-squares analysis would have underestimated the slope of linear regression lines.
Audit sample results were used to characterize measurement variabilities associated with the participating laboratories and with the RL. Differences in the audit sample results between the participating and referee laboratories and between the RL and referee laboratory were used to estimate these variabilities because we accounted for the measurement variability of the referee laboratory by using data obtained from the specimen pool (TC) of the Lipid Standardization Program and from six measurements on each of nine selected specimens (K). The variability between the measurements by the participating laboratories and the RL was estimated using these estimates of the measurement variability at the referee laboratory together with the estimated variability associated with the A1 - A3 and A2 - A3 differences for randomly selected audit samples. Outliers were excluded before computing these estimates.
Because of considerable variation in result discrepancy rates across laboratories, standard statistical approaches were not appropriate for some statistical analyses. Specialized analysis techniques were used to account for the variation in result discrepancy and problem rates across facilities. In particular, variance estimation for the odds ratio analysis was based on first-order Taylor series approximations (20) to accommodate the complex sampling design; these were analyzed using the SUDAAN1 software (Research Triangle Institute) package (21).
characterization of standard-based result discrepancies
Result discrepancies identified using either of the two
standard-based result discrepancy criteria (CLIA PT-based and
biologically-based) were attributed to three possible causes: isolated
(classified as discrepancies by the data-driven criterion), measurement
variability, or systematic result differences (biases) between
laboratories. The following algorithm was used in classifying
standard-based result discrepancies:
(a) standard-based discrepancies that were also identified as data-driven discrepancies were classified as attributable to isolated problems in the TTP;
(b) of the remaining result discrepancies, those that fell outside of either of the two standard-based discrepancy bounds, after S1 was adjusted for any bias against S2, were classified as attributable to measurement variability; and
(c) any remaining discrepancies were ascribed to systematic result differences between participating and referral laboratories.
documented problems
Clinic and laboratory records corresponding to individual patients
and their corresponding specimens were reviewed. All problems were
grouped into either a major stage of the TTP (preanalytical,
analytical, and postanalytical) or as study-induced when the facility
failed to follow the study protocol.
data management
A customized computer application was used for double-blind, 100%
rekey data entry. Data entry inconsistencies were identified and
remedied by a senior data manager.
| Results |
|---|
|
|
|---|
|
|
|
Result discrepancy rates derived using the two standard-based (i.e.,
CLIA PT-based and biologically-based) discrepancy criteria for TC were
also determined after accounting for the bias of the
S2 results from the RL relative to the
A3 results from the referee laboratory after the
latter was adjusted for its bias relative to the group mean of the
Lipid Standardization Program (see Materials and
Methods). This caused the TC discrepancy rates to be reduced from
4.6% (Table 1
) to 2.6% (not shown in Table 1
) when the CLIA PT-based
discrepancy criterion was used and from 1.4% (Table 1
) to 0.9% (not
shown in Table 1
) when the biologically-based discrepancy criterion was
used. Such an accounting for bias does not affect data-driven
discrepancy rates because this discrepancy criterion is designed to
account for systematic result differences between laboratories.
evaluation of the ss design for assessment of result discrepancies
When the decision tree algorithm (Fig. 2
) was used, the efficiency
of the SS design to call a result discrepant was 9398% for TC and
7984% for K. Table 2
lists the efficiency, predictive values, sensitivity, and
specificity of the SS design for calling results discrepant for TC and
K tests, using both CLIA PT-based and biologically-based discrepancy
criteria.
|
characterization of standard-based result discrepancies
Result discrepancies, as determined by either the CLIA PT-based or
biologically-based discrepancy criterion, were classified as resulting
from isolated problems (data-driven discrepancy), from excessive
measurement variability, or from systematic result differences between
participating and referral laboratory results (see Materials and
Methods). The causes of the two standard-based result
discrepancies for both analytes are listed in Table 3
. For TC, no discrepancies were attributed to measurement
variability. However, for K, more discrepancies were attributed to
measurement variability (4%) than to either systematic result
difference (bias) between laboratories (2.7%) or isolated problems
implied by the data-driven discrepancy method (1.9%). When either of
the two standard-based criteria for K was used, ~50% of
discrepancies could be ascribed to excessive measurement variability,
whereas ~30% of discrepancies could be attributed to systematic
result differences between the participating laboratory and the RL. For
both analytes, 2070% of standard-based discrepancies were a result
of systematic result differences.
|
participants' problem-monitoring systems
Except for the usual quality-assurance procedures cited by the
participating hospital laboratories, we found no evidence of existing
monitoring systems specifically designed to detect problems throughout
the TTP. All participating laboratories had facility-specific
preanalytical procedures for specimen collection and processing for
testing, analytical procedures for conducting requested tests on the
provided specimens, and postanalytical procedures for processing and
posting test results. All participating laboratories performed daily
quality-control testing and participated in interlaboratory PT programs
as required by CLIA for laboratory accreditation and licensure. We
observed only two instances (0.1%) where actions had been taken
because of problems identified by existing quality-assurance systems.
In the first instance, the laboratory recommended that a specimen be
re-collected because of hemolysis. In the second, quality-control
testing identified instrument calibration problems that resulted in the
re-analysis of the study specimen.
documented problems
For the 1378 patients included in this study, 40 problems were
identified by review of medical and laboratory records; all of these
occurred in 5 of the 11 participating facilities. Fourteen (35%) of
the problems detected during review of records were study-induced (the
study protocol was not adhered to or the study itself caused a facility
to commit an error they would not have committed routinely). This
included eight patients, all from the same clinic, who were
inappropriately recruited because thyroid tests were ordered instead of
TC tests and four patients from another clinic for whom TC was assayed
twice, once to report the result to Analytical Sciences, Inc. (the
study contractor), and a second time the next day to incorporate the
result into each patient's lipid panel. Two study-induced problems
occurred in a third clinic. One problem occurred when the clinician had
not specifically ordered a K test on the S1 specimen although the
patient had been recruited in the study. The other study-induced
problem occurred when the clinic inadvertently processed an S2 specimen
and sent it for testing by a laboratory that was not participating in
this study. Of the remaining 26 problems, two (8%) were preanalytical.
In one case, the actual specimen collection date and the collection
date reported on the test result differed by 1 day. In the other case,
involving a different facility, when a worker indicated that the S1
specimen for K testing was hemolyzed, another specimen was requested by
the clinician. The patient was recalled on another date, and the
S1 result from a second specimen was reported for
comparison with the S2 result from the original
specimen. Twenty-two of the 26 (85%) problems were postanalytical (2
in facility A, 1 in facility F, 1 in facility G, and 18 in
facility J). Fourteen of these problems related to the untimely posting
of the patient's result on the medical record (delays exceeding 2
weeks). Eleven of these 14 problems were associated with facility J.
The remaining eight postanalytical problems were all related to
transcription of laboratory results, seven of which were also
associated with facility J. This was the only facility in the study
that did not use a computer for managing clinical or laboratory data.
All results were transcribed by hand from the analyzer printout to an
accession log and then onto the medical record. Because of
understaffing and the high workload in facility J, results were not
always timely and transcribed accurately. Because of three
discrepancies with potential clinical impact involving three patients
(TC S1 = 4.55 mmol/L vs S2
= 6.34 mmol/L, TC S1 = 4.78 mmol/L vs
S2 = 8.87 mmol/L, and K S1
= 5.3 mmol/L vs S2 = 4.8 mmol/L), we reviewed all
available medical and laboratory records for these patients
extensively. Although we suspected that the preanalytical specimen for
one of these three patients was switched with another study patient, we
could not document it. In this case, which involved TC testing, two
patients from the same clinic whose blood was collected only 5 min
apart provided what we believe is strong, albeit circumstantial,
evidence for preanalytical specimen switches. For TC testing, one-half
of the documented problems were study-induced, and problem rates were
2040% of the rates for K (Table 4
). For K tests, 90% of problems were related to the routine
testing process, and only 10% were study-induced; the 24% problem
rate refers to facility J.
|
Of the 24 problems identified during document review that were not study-induced and for which complete results were available, 5 result discrepancies were identified. Of the remaining 1354 cases with complete data (among which no document review problems were found), 100 were associated with result discrepancies. Thus, we were 3.3 times more likely to observe result discrepancies when problems could be documented and vice versa. The odds ratio of 3.30 was significantly (P = 0.0125) different from 1.
clinical impact of result discrepancies
Laboratory data may be used to assess the nature and extent of a
potential impact resulting from result discrepancies. Our only attempt
to assess clinical impact was TC result classification by the National
Cholesterol Education Program screening guidelines of
5.17
mmol/L (
200 mg/dL) for moderate or high risk and
6.21 mmol/L (
240
mg/dL) for high risk of developing cardiovascular disease
(22). Of the 646 patients with TC results, only 2 patients
(0.3%) had discrepant results: one result was <5.17 mmol/L, whereas
the other was
6.21 mmol/L (Table 5
). For one of these two patients, the medical record noted that
the TC concentration had markedly improved since the patient's
previous visit only 3 months earlier. The reported and observed
S1 results were both 4.55 mmol/L, but the result
for the previous visit had been 5.28 mmol/L. SS testing yielded
S2, A1,
A2, and A3 results of
6.166.49 mmol/L (Table 5
). Subsequent review of the
S1 results for other study patients in the same
clinic identified an S1 value of 6.10 mmol/L from
another patient, whereas the corresponding S2 and
audit sample results were all between 4.22 and 4.63 mmol/L. In these
and a third case in which such a clinical impact criterion was met,
only the S1 result was discrepant when compared
with the S2, A1,
A2, and A3 results.
|
| Discussion |
|---|
|
|
|---|
The reasons for using an SS methodology as a laboratory
quality-assurance system include its detection of problems not observed
by other TTP quality-monitoring systems, which makes it a complementary
means of quality assessment, and its objectivity once the criterion for
result discrepancy has been defined. The SS methodology, however,
should be considered in view of its limitations, which include
(a) possible bias in both SS results from the laboratory
whose performance is being assessed, or inaccurate results from the RL
in cases in which, as in this study, the results from that laboratory
are used for comparison; (b) insensitivity to TTP
problems that may occur before specimens are collected and after
results are reported; and (c) insensitivity to problems not
impacting laboratory results, such as a switch between two specimens
exhibiting similar analyte concentrations or composition. This study
revealed that result discrepancy rates higher than
1.0%, as seen in
published reports of laboratory error rates, were observed for both
analytes. This comparison, however, should be made in light of
the fact that our study is based on result discrepancy rates involving
the referral and participating laboratories and that these rates cannot
be equated to errors in laboratory test results. Our study design is
different from virtually all other published reports in that
(a) objective result discrepancy criteria are used on all
split specimens obtained, whereas the other studies assessing
laboratory test errors are highly dependent on the effectiveness of
existing quality-assurance systems and detect testing problems in only
a portion of the study population; (b) the definitions of a
laboratory testing problem are probably different; (c)
different parts of the TTP are probably monitored; and (d)
our use of a different laboratory (RL), in contrast with most SS
studies, avoids the possible existence of the same measurement bias in
each SS analysis done by the same facility, which contributes to
increased but valid discrepancy rates. We did address this issue
successfully with the TC test because the bias in measurements from the
RL could be evaluated by use of a referee laboratory participating in
the CDC Lipid Standardization Program.
The first cited limitation of our SS design was evaluated by assessing
the efficiency, predictive values, sensitivity, and specificity of the
methodology, using a decision tree algorithm that was based on
S1, S2,
A1, A2, and
A3 values. Although the absence of an SS result
discrepancy was highly predictive of the S1
result not being in error (negative predictive value of 93100%), the
presence of an SS result discrepancy was much less predictive of the
S1 result being in error (positive predictive
value of 4367%), which led to an efficiency of 9398% for TC and
7984% for K (Table 2
). Adjusting the discrepancy rates for the
predictive values of the SS design had little or no effect on two
standard-based discrepancy rates, it decreased the CLIA PT-based
discrepancy rate for TC (by 24%) from 4.6% to 3.5%, and increased
the biologically-based discrepancy rate for K (by 22%) from 8.7% to
10.6% (Table 1
).
It is critical that the operational logistics associated with the SS methodology be such that the actual testing process (from collection of specimens to reporting of results) is monitored and is not a variation of this process induced by the methodology itself. In our study, 14 of the 40 problems (35%) that could be documented by retrospective review of medical and laboratory records were study-induced. We surmise that most of these problems emanated from the lack of familiarity of the participating facilities with the SS process. Although the SS design, as it was implemented here, had different laboratories analyzing the various specimens, facilities using the SS methodology for monitoring of the testing process may use this procedure differently in that all specimens are probably analyzed in the same laboratory. The major drawback of such a system, as stated earlier, is that SS result discrepancies are likely to be underestimated if the same laboratory is involved in testing because processes that lead to a biased (quantitative) or incorrect (qualitative) result may be operating during testing processes involving both specimens. It is unlikely, however, especially during the current period of increasing fiscal restraint, that medical facilities will use other laboratories for analyzing the S2 specimens or audit samples for quality assurance of their testing processes.
The operational logistics associated with an SS design should be such that the procedure would not induce variation in the routine testing process. This was not quite the case in this study in that of the 40 problems documented in medical and laboratory records, 14 (35%) were not caused by routine processes within the TTP, but were study-induced problems. SS designs should, therefore, be constructed to minimize study-induced problems so that detected result discrepancies reflect actual problems within the testing process itself. Of the 26 TTP problems, only two (8%) could not be related to a stage of the TTP. Of the other 24 problems, none was of an analytical nature, two (8%) were related to the preanalytical stage of the TTP, and the remainder (92%) were postanalytical problems. This is in agreement with the observation by others that most problems are related to the nonanalytical stage of the TTP (5)(9)(10)(11). Such classification of problems is valuable in that knowledge of both the type of mistakes and the TTP stage at which they occur may assist in maximizing problem detection so that measures may be taken to minimize their occurrence. However, what is missing from all published reports, including this one, is an evaluation of the impact of each problem type on test result accuracy and precision, turnaround time, medical decision making, disease and health management, and eventually, health outcome. Furthermore, studies that investigate problems in each step of the TTP should additionally assess the medical impact of these problems so that quality improvement efforts may focus on the more critical stages of the testing process.
In this study, problems in the TTP were identified directly only through retrospective medical and laboratory record review. However, the identification of result discrepancies is obviously motivated by the hypothesis that these discrepancies are indicative of TTP problems as well. Therefore, a strong association between result dis- crepancies and document review problems would lend support to this hypothesis; and, in fact, this study did reveal a significant association as evidenced by the odds ratio of 3.30, P = 0.0125. However, little redundancy was found between result discrepancies, as identified by at least one of the three discrepancy criteria, and problems identified by document review. Of the 105 result discrepancies, only 5 (5%) were also associated with document review problems. Of the 24 documented TTP problems for which complete results were available, only 5 (21%) were also associated with a result discrepancy. Therefore, used alone, either of these two quality-assessment measures (retrospective document review and result discrepancy analysis) is not likely to constitute as effective a means of identifying TTP problems as when they are used in combination.
The major emphasis of this study was to implement and evaluate an SS design, in conjunction with retrospective medical and laboratory record reviews, and to identify the nature and extent of problems within the TTP. As such, this report was not meant to describe how an SS design should be implemented but rather what can be learned by the implementation and evaluation of such an experimental design. Our goal was not to actually determine the frequency and type of problems within the TTP with any generalizable certainty. These results should be considered in light of laboratory selection bias attributable to the small sample size (11 facilities) and the oversampling of certain types of facilities in a limited geographic region. We found widely varying result discrepancy and documented problem rates among facilities, which also contributed to the uncertainty associated with any reported overall problem rate. Logistically, we encountered numerous difficulties in recruiting medical clinics and hospital and office laboratories to participate in this study. We faced a general resistance by clinicians as well as some laboratorians, which stemmed from complaints that they were too busy, overburdened with too much paperwork, understaffed, or that there was insufficient monetary incentive to participate. Once facilities agreed to participate, we received excellent cooperation, which allowed us to attain 92% of our collection goal of 1500 sets of split specimens.
In summary, our findings indicate that the SS methodology and its logistics could be implemented and evaluated, that the SS design used could provide a measure of result discrepancies with an overall efficiency (as defined in Materials and Methods) of 7998% for the two analytes studied, and that in combination with retrospective review of medical and laboratory records (and perhaps other effective TTP quality-assurance systems), this methodology can serve as a monitor for a portion of the TTP (from collection of specimens to reporting of laboratory results).
| Acknowledgments |
|---|
| Footnotes |
|---|
2 Nonstandard abbreviations: TTP, total testing process; SS, split specimen; HL, hospital laboratory; POL, physician office laboratory; RL, referral laboratory; TC, total cholesterol; and PT, proficiency testing. ![]()
1 4 Use of trade names and commercial sources is for identification only and does not imply endorsement by the Public Health Service or by the US Department of Health and Human Services. ![]()
| References |
|---|
|
|
|---|
The following articles in journals at HighWire Press have cited this article:
![]() |
P. Bonini, M. Plebani, F. Ceriotti, and F. Rubboli Errors in Laboratory Medicine Clin. Chem., May 1, 2002; 48(5): 691 - 698. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |