Clinical Chemistry AACC Online Job Center
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Clinical Chemistry 48: 1802-1804, 2002;
This Article
Right arrow Extract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow Submit an electronic Letter to
the Editor about this paper
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via ISI Web of Science (4)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Horn, P. S.
Right arrow Articles by Pesce, A. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Horn, P. S.
Right arrow Articles by Pesce, A. J.
Related Collections
Right arrow Laboratory Management
Right arrow General Clinical Chemistry
(Clinical Chemistry. 2002;48:1802-1804.)
© 2002 American Association for Clinical Chemistry, Inc.


Technical Briefs

Effect of Ethnicity on Reference Intervals

Paul S. Horn1a and Amadeo J. Pesce2

1 Department of Mathematical Sciences, University of Cincinnati, Cincinnati, OH 45221-0025,
2 Department of Pathology and Laboratory Medicine, University of Cincinnati, Cincinnati, OH 45267-0714

aauthor for correspondence: fax 513-556-3417, e-mail paul.horn{at}uc.edu

The clinical chemist desiring to set up appropriate reference intervals must consider race or ethnicity as a possible factor on which to subdivide an interval. The effect of ethnicity on various analyte values has not been systematically studied (1)(2). The advent of National Health and Nutrition Examination Survey (NHANES) data (3) has made it possible to determine the effect of ethnicity on reference interval estimation and whether separate reference intervals should be derived for each ethnic group.

The Third National Health and Nutrition Examination Survey (NHANES III) contains data for 33 994 persons ages 2 months and older who participated in the survey. Clinical chemistry measurements were made on several analytes, and health status was determined by a physician. In this report, we will use only individuals with the "Excellent"rating of health status as recommended by the IFCC (4) and NCCLS (5).

The issue of whether to use separate reference intervals has been addressed by Harris and Boyd (6). Their recommendation, which is used by the NCCLS (5), is that two groups should be combined into a single group unless their means and/or SDs exceed appropriate predetermined thresholds. By treating the known healthy individuals as a "gold standard", the effect of the addition of ethnicity on these estimators will be examined. We used a previously described outlier detection scheme (7) to reduce the effect of atypical observations on these estimators.

The NHANES population was obtained from NHANES 3x Ver. 1.21 (3). The data are the result of a complex survey design involving stratification and clustering that yielded individual weights for each observation. However, for our purposes in this report, we will treat the individuals as coming from a random sample, i.e., individual weights will be ignored. Analyses involving the individual weights will be explored in future work.

We used the variable DMARETHN (Race-ethnicity) to analyze the following three groups: non-Hispanic white (NHW), non-Hispanic black (NHB), and Mexican-American (MA). The variable HSSEX was used to define sex and HSAGEIR to define age at interview. The health status was derived from the variable DMAPEP 13A (physician’s impression). Any individual with a missing value in any of these variables was dropped from the analyses. Our analyses were based on individuals 20 years of age and older. The sample sizes of the groups were as follows: for males, 1032, 734, and 1000 for NHW, NHB, and MA, respectively; for females, 1279, 842, and 909, for NHW, NHB, and MA, respectively. The following 33 analytes were examined: albumin, alanine aminotransferase (ALT), alkaline phosphatase (AP), aspartate aminotransferase (AST), blood urea nitrogen (BUN), calcium, chloride, creatinine (CR), urinary creatinine (CR_URIN), {gamma}-glutamyltransferase (GAMMAGT), glucose, granulocytes (GRAN), hematocrit (HEMOCRIT), hemoglobin (HEMOGLOB), potassium (K), lactate dehydrogenase (LDH), lymphocytes (LYMPH), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHCON), mean corpuscular volume (MCV), monocytes (MONO), mean platelet volume (MPV), serum osmolality (OSMO), platelet count (PLATELET), phosphate (PO4), red blood cell count (RBC), red blood cell distribution width (RBCDW), sodium, total bilirubin (TBILIRUB), total serum carbon dioxide (TCO2), total serum protein (TPROTEIN), uric acid (URIC), and white blood cell count (WBC).

Outliers in the NHANES III data set were removed using the method described by Horn et al. (7). Briefly, this method first transforms the data by use of the Box–Cox power transformation, (x{lambda} - 1)/{lambda}, where x is an original observation. The outlier cutoffs are then computed for the transformed data by use of Tukey inner fences. If the transformed data come from a gaussian population, then this technique will identify ~0.7% of the data as outliers (7). The parameter {lambda} was also examined because its value indicates the nature of the underlying distribution of the analyte. For example, if {lambda} = 0, then the natural logarithm was used as the transformation, and therefore the original data were log-normally distributed. Clearly, if {lambda} = 1, then the data are left untransformed. Outlier detection in conjunction with either a traditional nonparametric or robust estimator of the reference interval was computed using a described previously algorithm (7)(8)(9).

The Harris and Boyd (6) method was used to determine whether to combine samples in the computation of a reference interval. This method uses the following test statistic (5):

where 1 and 2 are the sample means of the two subgroups, s12 and s22 are the sample variances, and n1 and n2 are the sample sizes in each subclass, respectively. If there are at least 60 individuals in each group, the z-test above is essentially a nonparametric test (because of the Central Limit Theorem) and may be applied to the original data regardless of whether the values conform to a gaussian distribution. The groups are not combined if the test statistic, z, is too large, or equivalently, if z* =z/[(n1 +n2)/240]1/2 is greater than k. Suggested values for k are 3 or 5 (10). [For gaussian populations, it is also suggested that groups not be combined if the maximum (s1, s2)/|s2 - s1| is <3, i.e., if the SDs are significantly different (5).]

Shown in Table 1 of the data supplement (available with the online version of this Technical Brief at www.clinchem.org/content/vol48/issue10/) are the 95% reference intervals for males and females for the 33 analytes separately for each of the three ethnic groups as well as the combined group. Note that the robust and nonparametric reference intervals are, in most cases, very similar. When there are differences, the robust upper endpoint is smaller than that of the nonparametric. The values of {lambda} for the Box–Cox–Tukey outlier detection, as described by Horn et al. (7), and c for the case {lambda} = 0 are also given. The different values of {lambda} are noteworthy and reflect the varied distributions of analytical values. That is, one cannot simply look at the data as normal ({lambda} = 1) or log-normal ({lambda} = 0) in general. Although these underlying distributions appear nongaussian, we use the Harris and Boyd test for separation as an approximate guide.


View this table:
[in this window]
[in a new window]
 
Table 1. z* values and results for Harris and Boyd separation test between pairs of ethnic groups.

The z* values by analyte for the Harris and Boyd separation test for pairs of ethnic groups are listed in Table 1Up . One problem of testing pairs of groups is that inconsistencies may arise as the result of intransitivity: the test may suggest that NHW and NHB be partitioned but that MA be combined with both NHW and NHB. Table 1Up shows that for males and k = 3, the three groups could be combined for 15 analytes, two of the three combined for 9 analytes, none combined for 1 analyte (creatinine), and that intransitivity occurred for 8 analytes. For k = 5, the numbers of analytes for these four cases were 29, 2, 0, and 2, respectively. For females, the patterns were similar although the analytes were different, especially for k = 3.

We have used our previously described methods of determining reference intervals that use outlier detection with either nonparametric or robust estimators of the endpoints. The final values presented in this study are medically conservative in that they will tend to reduce false negatives. Tables 1Up and 2 in the data supplement give reasonable estimates of the reference intervals for the different genders and ethnic groups. From these data, the Harris and Boyd separation criteria (10) indicate that some degree of separation of the ethnic groups is necessary (Table 1Up in the text).

This study showed that the Box–Cox {lambda} parameter could vary among ethnic groups for the same analyte. However, in the vast majority of cases values were similar, implying that pooling of the data is possible. Values of {lambda} from -0.1 to 0.1 were set equal to 0, indicating the appropriateness of a log transformation. This was found to be the case in very few of the analytes. Similarly, there were few cases, <25%, where no transformation ({lambda} = 1) was indicated.

In conclusion, taking advantage of the large numbers of patients evaluated in the NHANES III survey and our previously described analytical procedures, we have shown that use of the Boyd and Harris criteria (10) indicates that separate reference intervals are warranted among the three ethnic groups considered. One possible guideline is to separate or combine the groups based on whether z* is >5 and, if intransitivity exists, base the decisions on whether z* is >3.


References

  1. Sasse EA. Reference intervals and clinical decision limits. Kaplan LA Pesce AJ eds. Clinical chemistry: theory, analysis, and correlation 1996:365-381 CV Mosby St. Louis, MO. .
  2. Jacobs DS, Kasten BL, Demott WR, Tilzer L. Laboratory test handbook 1996:1142 American Association of Clinical Chemistry Press Washington, DC. .
  3. The Third National Health and Nutrition Examination Survey (NHANES III), 1988–1994 [CD-ROM], 3x Ver. 1.21 National Center for Health Statistic Data Dissemination Branch, Centers for Disease Control and Prevention Hyattsville, MD. .
  4. . International Federation of Clinical Chemistry. Approved recommendation (1987) on the theory of reference values. Part 5. Statistical treatment of collected reference values. Determination of reference limits. J Clin Chem Clin Biochem 1987;25:645-656.[ISI]
  5. Sasse EA, Aziz KJ, Harris EK, Krishnamurthy S, Lee HT, Jr, Rutland A, Seamonds B. How to define and determine reference intervals in the clinical laboratory; approved guideline. NCCLS Document C28-A, Vol. 15, No. 4 1995 NCCLS Wayne, PA. .
  6. Harris EK, Boyd JC. On dividing reference data into subgroups to produce separate reference ranges. Clin Chem 1990;36:265-270.[Abstract/Free Full Text]
  7. Horn PS, Feng L, Li Y, Pesce AJ. Effect of outliers and nonhealthy individuals on reference interval estimation. Clin Chem 2001;47:2137-2145.[Abstract/Free Full Text]
  8. Horn PS, Pesce AJ, Copeland BE. A robust approach to reference interval estimation and evaluation. Clin Chem 1998;44:622-631.[Abstract/Free Full Text]
  9. Horn PS, Pesce AJ, Copeland BE. Reference interval computation using robust vs parametric and nonparametric analyses. Clin Chem 1999;45:2284-2285.[Free Full Text]
  10. Harris EK, Boyd JC. Statistical bases of reference values in laboratory medicine 1995:86 Marcel Dekker, Inc. New York. .




This Article
Right arrow Extract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow Submit an electronic Letter to
the Editor about this paper
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via ISI Web of Science (4)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Horn, P. S.
Right arrow Articles by Pesce, A. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Horn, P. S.
Right arrow Articles by Pesce, A. J.
Related Collections
Right arrow Laboratory Management
Right arrow General Clinical Chemistry


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS