Clinical Chemistry AACC Online Job Center
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Clinical Chemistry 48: 1987-1999, 2002;
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow Submit an electronic Letter to
the Editor about this paper
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (10)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lahti, A.
Right arrow Articles by Boyd, J. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lahti, A.
Right arrow Articles by Boyd, J. C.
Related Collections
Right arrow Laboratory Management
Right arrow General Clinical Chemistry
Right arrow Evidence Based Laboratory Medicine and Test Utilization
(Clinical Chemistry. 2002;48:1987-1999.)
© 2002 American Association for Clinical Chemistry, Inc.

Impact of Subgroup Prevalences on Partitioning of Gaussian-distributed Reference Values

Ari Lahti1a, Per Hyltoft Petersen2,3 and James C. Boyd4

1 Department of Clinical Chemistry, Rikshospitalet University Hospital of Oslo, N-0027 Oslo, Norway.

2 Department of Clinical Biochemistry, Odense University Hospital, 5000 Odense C, Denmark.

3 NOKLUS, Norwegian Centre for External Quality Assurance of Primary Care Laboratories, Division for General Practice, University of Bergen, N-5020 Bergen, Norway.

4 Department of Pathology, University of Virginia Health System, Charlottesville, VA 22908.

aAddress correspondence to this author at: Department of Clinical Chemistry, Rikshospitalet University Hospital of Oslo, N-0027 Oslo, Norway. Fax 47-23071080; e-mail ari.lahti{at}rikshospitalet.no.


   Abstract
Top
Abstract
Introduction
Theory
Application Example
Discussion
References
 
Background: The aims of this report were to examine how unequal subgroup prevalences in the source population may affect reference interval partitioning decisions and to develop generally applicable guidelines for partitioning gaussian-distributed data.

Methods: We recently proposed a new model for partitioning reference intervals when the underlying data distribution is gaussian. This model is based on controlling the proportions of the subgroup distributions that fall outside each of the common reference limits, using the distances between the reference limits of the subgroup distributions as functions to these proportions. We examine the significance of the unequal prevalence effect for the partitioning problem and quantify it for distance partitioning criteria by deriving analytical expressions to express these criteria as a function of the ratio of prevalences. An application example, illustrating various aspects of the importance of the prevalence effect, is also presented.

Results: Dramatic shrinkage of the critical distances between reference limits of the subgroups needed for partitioning was observed as the ratio of prevalences, the larger one divided by the smaller one, was increased from unity. Because of this shrinkage, the same critical distances are not valid for all ratios of prevalences, but specific critical distances should be used for each particular value of this ratio. Although proportion criteria used in determining the need for reference interval partitioning are not dependent on the prevalence effect, this effect should be accounted for when these criteria are being applied by adjusting the sample sizes of the subgroups to make them correspond to the ratio of prevalences.

Conclusions: The prevalences of subgroups in the reference population should be known and observed in the calculations for every reference interval study, irrespective of whether distance or proportion criteria are being used to determine the need for reference interval partitioning. We present detailed methods to account for the prevalences when applying each of these types of criteria. Analytical expressions for the distance criteria, to be used when high precision is needed, and approximate distances, to be used in practical work, are derived. General guidelines for partitioning gaussian distributed data are presented. Following these guidelines and using the new model, we suggest that partitioning can be performed more reliably than with any of the earlier models because the new model not only offers an improved correspondence between the critical distances and the critical proportions, but also accounts for the prevalence effect.


   Introduction
Top
Abstract
Introduction
Theory
Application Example
Discussion
References
 
Partitioning reference values by population subgroups is an essential part of reference interval studies. Except for demographic descriptors, such as age, gender, or race, many lifestyle factors may affect blood concentrations of clinical biochemical quantities, and these factors should accordingly be considered as potential partitioning categories. As long as population-based reference intervals remain the predominant way to use clinical biochemical laboratory measurements in medical diagnosis and screening, partitioning is a valuable tool to improve the specificity of these intervals.

The usefulness of partitioning has recently become more evident with the advent of large-scale reference interval studies involving up to thousands of reference individuals (1)(2)(3), because large reference samples make partitioning into smaller and smaller subgroups feasible. However, existing partitioning models, as reviewed in the monograph of Harris and Boyd on calculation methods of reference values (4), do not seem to give reliable estimates of the proportions of the subgroup distributions falling outside the reference limits of the combined distribution. Because these proportions are considered today as the most relevant criteria for partitioning, we have undertaken to construct a new model that hopefully gives better control over them. Our model, introduced recently (5), is based on correlating these proportions to distances between reference limits of the subgroup distributions, one pair of reference limits being considered at a time.

However, when applying the new model to various partitioning problems, we discovered a complication caused by the subgroup prevalences. Unequal prevalences seem to make the critical distances used as partitioning criteria shorter compared with equal prevalences. Moreover, it appears that subgroup prevalences should also be considered when using proportion criteria or, in practice, whenever reference data are being partitioned. As far as we are aware, the importance of subgroup prevalences on the partitioning problem has not been recognized previously.

Our previous report (5) was intended to be an exhaustive study on partitioning gaussian-distributed data. However, after discovery of the prevalence effect, it appears that the methodology described therein applies only to the development of partitioning criteria for the very special case of equal subgroup prevalences. The present report is meant to be a generalization of our previous work, covering the situation of unequal subgroup prevalences.

Because it is easy to confuse prevalences and sample sizes, we will spend some time at the beginning of this study to make the distinctions between these two concepts clear. Thereafter, we will derive analytical expressions for critical distances as a function of the ratio of prevalences and calculate approximate distances to be used in practical work. An application example, contrived for this particular purpose, is meant to illustrate the consequences on partitioning if the prevalence effect is not accounted for. We will also present general guidelines for partitioning gaussian-distributed data. These guidelines, which are intended to replace those presented in our previous report (5), are an important generalization of the earlier ones.


   Theory
Top
Abstract
Introduction
Theory
Application Example
Discussion
References
 
calculation methods
We used Microsoft® Excel 97 to calculate probit values and proportions of gaussian distributions, and Mathematica® (Ver. 4) to solve some specific problems of this study.

background
Proportions of two gaussian distributions, having equal prevalences, outside the reference limits of the combined distribution.
By definition, 2.5% of the reference values of a reference distribution lie outside each of its two reference limits because the reference interval is conventionally defined as the central 95% of the population (6). Fig. 1A shows two gaussian distributions, denoted a and b, that are identical except for different means. The lower reference limit, or the 2.5 percentile, is indicated for both distributions, as well as the lower reference limit of the combined distribution (the thick vertical bar).



View larger version (26K):
[in this window]
[in a new window]
 
Figure 1. Two gaussian subgroup distributions.

Panel A illustrates the proportions (pa and pb) of two gaussian subgroup distributions, a and b, which are identical except for different means, outside the lower reference limit of the combined distribution. This common limit is indicated by a thick vertical bar, but for clarity, the nongaussian combined distribution itself is not plotted. D is the distance between the lower reference limits of the subgroup distributions, composed of da and db, which are the distances between the lower reference limit of the combined distribution and those of the respective subgroup distributions. Panel B shows the same subgroups as panel A, but distribution b has been multiplied by 2 to visualize that the prevalence of subgroup b is twice as large as the prevalence of subgroup a. Observe that the unit on the ordinate is not probability density, in contrast to panel A, because, with an area equal to 2.0, distribution b is not a probability distribution but a frequency distribution.

The 2.5% of the reference values that lie outside the lower reference limit of the combined distribution in Fig. 1AUp are composed of proportions pa and pb of the distributions a and b, respectively, outside this common reference limit:

(1)
where wi is a weight factor describing distribution i. If pa and pb were fixed, the lower common reference limit would also be fixed, as can be seen in Fig. 1AUp . How to interpret the weight factors wa and wb in Eq. 1Up is an important point for consideration. One might suggest that these factors be interpreted as the numbers of reference values, or na and nb, in the two subgroups corresponding to the distributions a and b, or subgroups a and b. However, these numbers may be a more or less accidental result of performing the reference interval study, or the designer of the study may have chosen them at will.

It can easily be shown that if the ratio of na/nb changes from one reference interval study to the next, the determined common reference limit will also change. Considering Eq. 1Up , if wa and wb were interpreted as na and nb, no fixed values of pa and pb could be valid for both experimental designs simultaneously (except for the trivial case with pa = pb = 2.5%). Because pa and pb define the localization of the lower common reference limit, one would thereby obtain different lower reference limits for different ratios of na and nb. But reference limits intended for general use should obviously not be dependent on such contingencies as the numbers of reference values in a particular reference interval study. Hence, na and nb are not an appropriate interpretation of the weight factors in Eq. 1Up . These factors should, in our opinion, be interpreted as prevalences of the subgroup populations, or as fa and fb.

If we specifically assume equal prevalences (fa = fb), Eq. 1Up gives the following simple relationship between pa and pb:

(2)
The proportions pa and pb lie in the intervals [2.5%, 5.0%) and (0.0%, 2.5%], respectively, and as Eq. 2Up indicates, their sum is always exactly 5.0%. The respective asymptotes of 0.0% and 5.0% cannot be reached because the probability density of a gaussian distribution is positive for any real argument.

The case of equal prevalences was examined in our previous report on partitioning (5). Starting from the currently widely accepted analytical bias quality specifications (7)(8)(9), we derived critical values for the proportions of the subgroup distributions outside the common reference limits as partitioning criteria. Our suggestions were 4.1% as the critical proportion for partitioning, or pCritP, and 3.2% as the critical proportion for nonpartitioning, or pCritNP. Because the sum of pa and pb equals 5.0% (Eq. 2Up ), these suggestions simultaneously implied that the complementary critical proportions, also abbreviated pCritP and pCritNP, outside the other one of the two subgroup distributions (distribution b in Fig. 1AUp ) should be 0.9% and 1.8%, respectively. We recommended partitioning if any of the proportions outside the common reference limits was >=4.1% or <=0.9% and nonpartitioning if all of these proportions lay between 1.8% and 3.2%.

We were also able to convert these critical proportions to critical distances between the reference limits of the subgroup distributions. We suggested 0.75 s as the approximate partitioning criterion and 0.25 s as the approximate nonpartitioning criterion, where s is the standard deviation of the narrower one of the two subgroup distributions. These approximate critical distances were considered valid whenever the ratio between the standard deviations of the subgroup distributions (R) is <=1.5.5.1 [Note: According to Harris and Boyd (10), larger ratios of R imply partitioning because one of the proportions outside the common reference limits then exceeds 4.0%.]

As stated above, these results were derived under the implicit assumption of equal prevalences, and their applicability is consequently limited to this special case. What follows next is a generalization of our previous study, intended to cover unequal prevalences.

Proportions of two gaussian distributions, having unequal prevalences, outside the reference limits of the combined distribution.
In the section above, we suggested that the weight factors wa and wb of Eq. 1Up should be interpreted as prevalences instead of as numbers of reference values. In contrast to these numbers, prevalences cannot be manipulated by the designer of the reference interval study because they are fixed attributes of the subgroup populations.

Consider an example with prevalence fa = 0.33 and prevalence fb = 0.67, i.e., with the ratio of prevalences, or Fr, equal to 2.0. Insertion of these prevalences as wa and wb into Eq. 1Up produces the following equation:

Fig. 1BUp illustrates this example with Fr = 2.0. In Fig. 1BUp , distribution a is the same gaussian distribution as in Fig. 1AUp , but distribution b has been multiplied by two, which makes the area of distribution b twice as large as that of distribution a. However, the means, the standard deviations, and the lower reference limits of both distributions are as in Fig. 1AUp . The enlarged area of distribution b simply indicates the larger prevalence of the subgroup population b, or the weight of distribution b with respect to the weight of distribution a.

Because D (Fig. 1AUp ) is identical at both values of Fr, Fr = 1.0 and Fr = 2.0, it can be used to determine the required adjustments of pa and pb. First, calculate D for Fr = 1.0 [The quantiles (Q) or the probit values of gaussian distributions were obtained using the NORMINV function of the Microsoft Excel 97 program. Its arguments are proportion, mean, and standard deviation of the distribution, respectively]:

(3)
We next have to find out the values of pa and pb corresponding to D = 0.40 (unit of D is standard deviation) and Fr = 2.0. Imagine that the lower reference limits of the two distributions in Fig. 1BUp are made to coincide, or that both pa and pb are set equal to 0.025. As distribution b is moved to the right, i.e., as D is increased from its initial value of zero, there has to be an increase of two units of pa for each decreased unit of pb to keep the proportion of the combined distribution outside the common reference limit equal to 2.5% all the time, because distribution b is twice as "heavy" as distribution a. Hence, as pb is decreased from 0.025 to, e.g., 0.024, pa is increased from 0.025 not to 0.026 but to 0.027. Observing this ratio of changes, the pa and pb for D = 0.40 and Fr = 2.0 can be calculated by solving the following equation:

(4)
This equation must be solved numerically, using, e.g., the FindRoot routine of the Mathematica computer program. The solution is x = 0.0084. Consequently, for D = 0.40 and Fr = 2.0, pa = 0.025 + 2(0.0084) {approx} 0.042, and pb = 0.025 - 0.0084 {approx} 0.017. Observe that these new proportions fulfill Eq. 1Up :

Comparing these new proportions with those obtained for Fr = 1.0 at the same value of D = 0.40 (pa = 0.036 and pb = 0.014), we notice first that the sum of the new proportions, 0.059, exceeds 0.05, or 5.0%. In general, as opposed to the proportions in the case of equal prevalences, Eq. 2Up is not valid for the proportions in the case of unequal prevalences.

The second thing that we notice is that, whereas the original pa was below the critical proportion for partitioning, pCritP (4.1% or 0.041), the new pa exceeds this criterion. In other words, if D = 0.40 s, partitioning is recommended at Fr = 2.0 but not at Fr = 1.0. This means, of course, that the critical distance corresponding to the critical partitioning proportion must be shorter for Fr = 2.0 than for Fr = 1.0. This is the case, because if distribution b is "heavier" than distribution a, the lower common reference limit follows closer to distribution b, which makes pa grow faster and reach the critical proportion at a shorter distance. Therefore, if a reference interval study is being carried out without consideration of the prevalences, it will be easy to make false decisions on partitioning. It is obvious that the relationship between the critical distances and the ratio of prevalences needs to be examined systematically.

distance criteria for partitioning with prevalences taken into consideration
To examine the behavior of the critical distances as correlated to the ratio of prevalences, we consider two cases or "topographies of prevalences" separately. In one of these topographies, the distribution with the more extreme value of the reference limit (distribution a in Fig. 1Up ) has the smaller prevalence (topography of prevalences A, abbreviated TPrevA), and in the other topography this distribution has the larger prevalence (topography of prevalences B, or TPrevB).

However, consideration of these topographies of prevalences is not sufficient because the calculations are complicated by one or more sources of variability. The subgroup distributions may also have different standard deviations, and we need, therefore, to consider both ends of these distributions not only from the point of view of prevalences but also from that of standard deviations. Analogous to the topographies of prevalences, we must distinguish between a case in which the distribution with the more extreme value of the reference limit (distribution a in Fig. 1Up ) has the smaller standard deviation (topography of standard deviations A, or TStDevA) and the other case, in which this distribution has the larger standard deviation (topography of standard deviations B, or TStDevB).

Taken together, the topographies of prevalences and standard deviations involve four different topographic combinations, indexed AA, AB, BA, BB, where the first letter refers to the topography of prevalences and the second letter to the topography of standard deviations. These four combinations of topographies are depicted schematically in Fig. 2 .



View larger version (16K):
[in this window]
[in a new window]
 
Figure 2. Illustration of the four topographic combinations of the subgroup distributions that need to be covered when developing generally applicable critical distances.

The topographies are considered as seen from the left-hand sides of the panels, with the lower reference limit of distribution a imagined as the origin and distribution b as having been shifted to the right from this origin. The proportion of distribution a outside the common reference limit will thus always be larger than that of distribution b. Although we are defining the topographies by considering the lower end of the distributions, the same rules apply at the upper end with only self-evident adjustments needed. (A), topographic combination TPrevA, TStDevA. Distribution a, which has a smaller value of the lower reference limit, has a smaller prevalence and a smaller standard deviation than distribution b. (B), topographic combination TPrevA, TStDevB. Distribution a, which has a smaller value of the lower reference limit, has a smaller prevalence but a larger standard deviation than distribution b. (C), topographic combination TPrevB, TStDevA. Distribution a, which has a smaller value of the lower reference limit, has a larger prevalence but a smaller standard deviation than distribution b. (D), topographic combination TPrevB, TStDevB. Distribution a, which has a smaller value of the lower reference limit, has a larger prevalence and a larger standard deviation than distribution b. Observe that distribution a has a larger mean than distribution b in panel D. The order of the subgroup distributions, used to define the topographies, is based on the order of the reference limits, not that of the means.

In Appendix 2, published in a data supplement that accompanies this article at Clinical Chemistry Online (www.clinchem.org/content/vol48/issue11/), we present in detail the rather tedious theoretical work needed to derive analytical expressions relating distances between reference limits and the ratio of prevalences for each of the four topographic combinations depicted in Fig. 2Up . In parallel with derivation of these expressions, Appendix 2 describes how the critical distances can be approximated over ratios and topographies of standard deviations as a function of the ratio of prevalences for each of the two topographies of prevalences, TPrevA and TPrevB. The results of this approximation process are summarized graphically in Fig. 3 , covering the range 1.0 <= Fr <= 10, and as a list of selected values in Table 1 , covering a larger range of Fr.



View larger version (35K):
[in this window]
[in a new window]
 
Figure 3. Nomogram showing the approximate critical partitioning (DCritPAave and DCritPBave) and nonpartitioning distances (DCritNPAave and DCritNPBave) as functions of the ratio of prevalences (Fr) for the topographies of prevalences A and B, respectively, plotted on a logarithmic Fr scale.

This graphical presentation is restricted to the interval 1.0 <= Fr <= 10.0 to maximize the resolution, because the critical distances change very little beyond Fr = 10.0. As discussed in Appendix 2, the following approximated values apply for Fr > 10.0: DCritNPAave = 0.13 s, DCritNPBave = 0.16 s, DCritPAave = 0.26 s, and DCritPBave = 0.46 s. Observe that DCritPAave, or the critical partitioning distance for TPrevA, is particularly sensitive to the shrinking effect of increasing Fr, and this is true especially at small values of Fr (remember the logarithmic scale). Because the plots of DCritPAave and DCritPBave differ considerably from each other, the averaging process described in Appendix 2 and illustrated in Figs. 2 and 3 of Appendix 2, carried out successfully over ratios and topographies of standard deviations to develop generally applicable approximate distance criteria, cannot be extended to the topographies of prevalences. One is therefore obliged to consider the topographies of prevalences even when applying this nomogram. The distances are measured in s, or the standard deviation of the narrower one of the two subgroup distributions.


View this table:
[in this window]
[in a new window]
 
Table 1. Critical distances corresponding to selected values of Fr for TPrevA (A) and TPrevB (B).1

proportion criteria for partitioning with prevalences taken into consideration
The proportion criteria to determine the need for reference interval partitioning are not themselves affected by the prevalences, i.e., the same critical proportions are valid for all Fr. However, the positions of the common reference limits obviously depend on the numbers of reference values included in the calculations from each subgroup. If more reference values are included in either of the subgroups, the common reference limits will tend to be shifted toward the reference limits of the distribution corresponding to this subgroup.

To determine correct positions for the common reference limits, one should therefore not automatically include all of the reference values in the calculations. This would lead to 2.5% of these particular values lying outside the calculated reference limits, whereas we are in fact interested in obtaining such limits that 2.5% of the true source population would fall outside each of them. Hence, the ratio of the numbers of reference values included in the calculations should actually reflect the ratio of prevalences.

To make the ratio of numbers of reference values equal to the ratio of prevalences, one should either exclude an appropriate proportion of reference values from one of the subgroups by use of some randomization procedure or add new reference values to the other subgroup by use of stochastically selected replicates of the original values. It is perhaps better to create new values using replicates than to eliminate a part of the original values, but either approach involves statistical manipulation of the original material. In contrast to the proportion criteria, use of the distance criteria leaves the original material intact and is statistically perhaps more sound. However, if the conclusion is nonpartitioning, the common reference limits will be required in any case, and the problems inherent in determining these limits will be present even when the distance criteria are used.

Another way to determine the common reference limits is to use the numerical calculation method described below in the Application Example section. This method is based on first calculating the proportions outside each common reference limit and then identifying the common reference limits using these proportions.

guidelines for partitioning gaussian-distributed data
These guidelines are a generalization of the guidelines presented earlier (5), intended to cover all ratios of prevalences and standard deviations between the two subgroups, evaluated for partitioning.

Partitioning criteria as expressed in proportions of the subgroup distributions outside the common reference limits.

Partitioning criteria as expressed in approximate distances between the reference limits of the subgroup distributions (easy-to-use distance criteria read from the nomogram).

Partitioning criteria as expressed in accurate distances between the reference limits of the subgroup distributions.
If accurate critical distances are desired, these can be calculated from Eqs. 3, 6, 12, and 13 in Appendix 2 for any particular values of R and Fr. To apply these equations, the topographies of both standard deviations and prevalences of the subgroup distributions need to be considered, as illustrated in Fig. 2Up .

If critical proportions other than those recommended by ourselves are to be used, such new critical proportions need to be substituted for pa and pb in Eqs, 3, 6, 12, and 13 of Appendix 2. In addition, the cutoff value for R needs to be recalculated, using the procedure described in Appendix 1.

These two criterion categories, proportions or distances, can be applied independently, using the type of data that is better available. Application of the distance criteria presupposes that both subgroup distributions are gaussian.

As far as partitioning into more than two subgroups is concerned, we wrote earlier (5) that such cases could be solved by applying the guidelines for partitioning to each pair of these subgroups at a time. If the prevalences are equal, this approach is valid in most instances because the approximate DCritP (0.75 s) calculated by us (5) was three times as large as the approximate DCritNP (0.25 s). The case of more than two subgroups having unequal prevalences may be more problematic because inequality of the prevalences may decrease the ratio of DCritP to DCritNP from the value of 3 to <2 at certain values of Fr (Fig. 3Up ). We will therefore probably return to the case of several subgroups in more detail in a later study.


   Application Example
Top
Abstract
Introduction
Theory
Application Example
Discussion
References
 
To illustrate the theory, we will consider the following application example, contrived for this particular purpose. Let there be two gaussian-distributed subgroups, called 1 and 2, that involve 1300 and 1000 reference values, respectively (Table 2 ).


View this table:
[in this window]
[in a new window]
 
Table 2. Specifications of the subgroups of the application example.

Let the mean and the standard deviation of the subgroup distributions be 100 and 10 for subgroup 1, and 103 and 11 for subgroup 2, and let the prevalences of the subgroup populations in the source population be 0.25 and 0.75, respectively. The distributions are illustrated in Fig. 4 both as probability distributions (Fig. 4A ) and as frequency distributions using the numbers of reference values (Fig. 4B ) and the prevalences (Fig. 4C ) as frequencies. In what follows, these three cases will be called A, B, and C for convenience.



View larger version (27K):
[in this window]
[in a new window]
 
Figure 4. Distributions of the application example.

(A), the distributions characterized in Table 2Up of the application example are depicted as probability distributions. The common reference limits (thick vertical bars) and the proportions outside these limits at both ends of the distributions are shown, as listed in Table 3Up . (B), as in panel A, but the distributions are depicted as frequency distributions, with the numbers of reference values in the subgroups as frequencies. Observe the shift of the common reference limits toward the reference limits of distribution 1 and the corresponding changes of the proportions, compared with panel A. (C), as in panel A, but the distributions are depicted as frequency distributions with the prevalences of the subgroups as frequencies. The common reference limits have been shifted toward the reference limits of distribution 2, and the proportions have changed accordingly, compared with panel A.

We will first examine partitioning of subgroups 1 and 2 using the approximate distance criteria as read from Table 1Up , and thereafter using the accurate distance criteria, as calculated from the equations of the theory. Finally, we will calculate the proportions of the subgroup distributions outside the common reference limits as well as these limits by solving the pertinent equations numerically.

case a
In Fig. 4AUp , the distributions are depicted as probability distributions, determined by their means and standard deviations, i.e., we are ignoring both the numbers of reference values and the prevalences. The reference limits of the subgroup distributions will be the same for the cases A–C because these limits are calculated separately for each subgroup distribution, using the same reference values in all cases. In contrast, the reference limits of the combined distribution depend on the ratio of prevalences. However, in this example we will misuse the ratio of the numbers of reference values in case B and account for the prevalences only in case C. We will therefore obtain different common reference limits for each case. In case A, the common reference limits are primarily of theoretical interest, because in practice the numbers of reference values will most often be considered—either correctly, accounting for the prevalences, or incorrectly, using them as they are—to construct the combined distribution and to calculate the common reference limits. For comparison, we will also identify the common reference limits in case A, using the mathematical properties of gaussian distributions.

Approximate distance criteria.
Following the guidelines for partitioning, presented above, we calculate that R = 11/10 = 1.1 < 1.36 and observe that this ratio is in itself not large enough to support partitioning. Because we are ignoring the prevalences in case A, Fr = 1.0. The reference interval of the subgroup distribution 1 is 100 ± 1.96 x 10 = 80.4–119.6, and that of the subgroup distribution 2 is 103 ± 1.96 x 11 = 81.4–124.6. Hence, we obtain DL = (81.4 - 80.4)/10 s = 0.10 s, and DU = (124.6 - 119.6)/10 s = 0.50 s (observe that the unit is the smaller standard deviation).

Because Fr = 1.0, we do not have to be concerned about the topographies of prevalences in case A, and because we are intending to use the approximate critical distances, we do not have to be concerned about the topographies of standard deviation either. The approximate critical distances can be read from either Table 1AUp or Table 1BUp because the first rows of these tables, corresponding to Fr = 1.0, are identical. Hence, we obtain the following critical distances for case A: DCritPAave = DCritPBave = 0.68 s and DCritNPAave = DCritNPBave = 0.27 s. Because DL = 0.10 s lies below DCritNPAave (or below DCritNPBave), the conclusion from the lower end is nonpartitioning, and because DU = 0.50 s lies between DCritNPAave and DCritPAave (or between DCritNPBave and DCritPBave), the conclusion from the upper end is equivocal. Following our guidelines, the final conclusion suggested by this approximate method is thereby equivocal, i.e., the decision on partitioning should be made using nonstatistical considerations, such as clinical judgment or literature data.

Accurate distance criteria.
The lower end of the distributions has topography of standard deviations TStDevA, because distribution 1, having the smaller value of the lower reference limit, also has a smaller standard deviation. Hence, the accurate critical distances for this end can be calculated analytically by setting Fr = 1.0 in Eqs. 3 or 12 of Appendix 2 (these are identical for Fr = 1.0) as follows (we will use Eq. 3 of Appendix 2):


Similarly, the accurate critical distances for the upper end, having a topography of standard deviations TStDevB, can be obtained by use of either Eq. 6 or Eq. 13 in Appendix 2, which are identical for Fr = 1.0 (we will use Eq. 13 of Appendix 2):


These equations account for the particular value of R and for the topographies of standard deviations, whereas the approximate method presented above averaged both over R = 1.0 and R = 1.36 and over the two topographies of standard deviations. The critical distances obtained by the two methods deviate from each other by 0.03 s (DCritPBB) at most (Table 3 ), i.e., they are nearly identical. Observe that all of the accurate critical distances are slightly smaller than the respective approximate ones. This is partly attributable to averaging over R = 1.0 and R = 1.36, because the critical distances tend to increase with increasing R (consider Fig. 2 in Appendix 2), and R = 1.1 lies closer to the lower end of the R scale. Although the conclusions at both ends obtained by applying the approximate critical distances are the same in this case as those obtained for the accurate critical distances, it is possible to obtain different outcomes in borderline cases, as illustrated by case C below.


View this table:
[in this window]
[in a new window]
 
Table 3. Critical distances, both approximate and accurate; proportions outside the common reference limits; and these limits at both ends of the distributions in cases A, B, and C of the application example.

Proportion criteria.
We will next apply the proportion criteria to case A. Instead of first localizing the common reference limits and then calculating the proportions of the subgroup distributions outside these limits, as Harris and Boyd (4) did using computer simulations, we will first calculate the proportions and determine the common reference limits thereafter using these proportions. As an example, we will detail the calculations needed to determine the proportions and the common reference limit at the lower end of case A. If we equate the distance between reference limits as expressed in Eq. 3 of Appendix 2 with DL = 0.10 (the unit is s on both sides, so we will omit it), we will obtain:

The solution of this equation, obtained numerically, is pa = 0.028, or 2.8%. From Eq. 2Up , we further calculate that pb = 2.2%. These proportions outside the lower common reference limit are shown in Fig. 4AUp . The lower common reference limit can now be calculated as Q(0.028;100;10) = Q(0.022;103;11) {approx} 80.9. Using Eq. 13 in Appendix 2 we find in the same way that the upper common reference limit of case A lies at 122.5 and that this limit will cut off 1.2% from distribution 1 and 3.8% from distribution 2. The data for case A are summarized in the first column of Table 3Up .

We now are ready to apply the proportion criteria, as presented in the guidelines. At the lower end both proportions are smaller than the critical proportion for nonpartitioning (pCritNP = 3.2%), and both of them exceed the complementary pCritNP = 1.8%. Hence, the conclusion from this end is nonpartitioning. At the upper end, one of the proportions lies between pCritNP = 3.2% and pCritP = 4.1%, and the other one lies between the complementary pCritP = 0.9% and the complementary pCritNP = 1.8%. The conclusion from this end is consequently equivocal, and this is also the final conclusion for case A, using the proportion criteria. Hence, the conclusions from both ends, as well as the final conclusion, agree with the conclusions obtained above using the distance criteria for this case.

case b
Case B illustrates the method used most often to solve the partitioning problem today, i.e., by considering the numbers of reference values but ignoring the prevalences. The common reference limits are usually calculated by involving all the reference values that one happens to have in the subgroups. These limits and the proportions outside them thus reflect the ratio of the numbers of reference values instead of that of the prevalences. We will first solve case B by the Harris Boyd model, which is at present the most widely applied partitioning method. It does not account for the prevalences, because originally, when developing their model, Harris and Boyd assumed equal sample sizes and thereby, implicitly, equal sample prevalences (10).

The Harris-Boyd model.
If the Harris-Boyd model is applied to our example by inserting the means, the standard deviations, and the numbers of reference values from Table 2Up in the expressions for the test parameter and the critical values of this model (4, 5, 10), we will obtain a z-value of 6.7 and a critical range zCrit3zCrit5 of 9.3–15.5. Because the z-value lies below the critical range, the conclusion suggested by the Harris-Boyd model is nonpartitioning.

The present model.
In this example, the numbers of reference values in the subgroups are 1300 and 1000, respectively (Table 2Up ). To account for these numbers in the calculations, we will treat them as weight factors of the distributions, even if this is incorrect, as was discussed in the Background section of this article. Hence, we will have to treat the ratio of these numbers (1300/1000 = 1.3) as if it were a ratio of prevalences and observe the topographies accordingly. Under this assumption, the topographies in case B are obviously TPrevB, TStDevA at the lower end and TPrevA, TStDevB at the upper end. Using Table 1Up , B and A, and Eqs. 12 and 6 of Appendix 2, respectively, we can easily obtain the values for the critical distances in case B, reported in Table 3Up and, applying the numerical method described above, also determine the proportions and the common reference limits in this case, shown in Table 3Up and Fig. 4BUp .

Considering the results listed in Table 3Up , we notice that the conclusions from both ends are identical to those obtained for case A, whichever criteria we apply. Observe in particular that both the distance criteria, the approximate ones as well as the accurate ones, and the proportion criteria lead to the same conclusion even at the upper end, although this end is clearly a near-to-borderline case with respect to partitioning. We selected the ratio of the numbers of reference values deliberately to make the larger proportion close to the critical value of 4.1% to test the precision of the distance criteria as correlates of the proportion criteria. The accurate critical distance (0.52 s) lies very near to DU = 0.50 s and so does the approximate critical distance (0.54 s). Had DU been only 0.02 s or 0.04 s wider, respectively, or the proportion only 0.1% larger, the conclusion would have been partitioning. Hence, correspondence between proportions and distances seems quite good. Notice also that the approximate and the accurate distances differ from each other by only 0.02 s at most (Table 3Up ).

Real-life common reference limits as calculated with use of the reference values of the combined distribution would probably differ slightly from those calculated with our numerical method. However, both methods should in principle produce the same common reference limits because both of them weigh distribution 1 in the proportion of 1.3 to 1 with respect to distribution 2. Observe how this weighing has shifted the common reference limits slightly toward the reference limits of distribution 1 from the situation in case A. The changes of the proportions between cases A and B reflect these transitions of the common reference limits in a logical way. All of these tendencies would be more marked if the number of reference values in subgroup 1 were increased further with respect to that of subgroup 2.

In spite of the agreement obtained by use of proportions and distances in both this and the previous case, these results are nonsense; the numbers of reference values should neither be used to define common reference limits nor to make conclusions on partitioning, and the prevalences have not yet been considered.

case c
Case C is intended to be an illustration of how the partitioning problem should, in our opinion, be solved, i.e., by accounting for the prevalences. The ratio of prevalences in this example is equal to 3.0 (Fr = 0.75/0.25 = 3.0), and the topographies in case C are TPrevA, TStDevA at the lower end and TPrevB, TStDevB at the upper end (observing Fig. 4CUp , one might suggest that the topography at the lower end should rather be TPrevB, TStDevB, but the distribution having the smaller value of the lower reference limit is distribution 1, as shown in Table 2Up ). Hence, the results of this case, listed in Table 3Up , can be obtained using Table 1Up , A and B, and Eqs. 3 and 13 in Appendix 2, respectively.

The decision at the lower end is once again nonpartitioning, as can be concluded by use of either the distance or the proportion criteria. However, the situation at the upper end is precarious because we have intentionally chosen the ratio of prevalences to obtain the critical partitioning value of 0.9% for the smaller proportion. As we see, the accurate DCritP = 0.50 s = DU, as expected, i.e., using the accurate critical distance, we would achieve a perfect correspondence with the proportion criterion. In contrast, the approximate DCritP (0.53 s) exceeds DU slightly, and use of it would accordingly lead to an equivocal conclusion. In spite of this discrepancy in conclusions, the agreement between the approximate and the accurate distances is still good, 0.03 s being the largest difference between these distances (Table 3Up ).

It is obvious that in borderline cases the approximate distance will most often deviate in either direction from the accurate distance and easily produce a false conclusion. The only way to assure oneself of making a correct decision in such cases is to use the accurate distances.

Observe that in case C the common reference limits have been shifted toward the reference limits of distribution 2, in accordance with this distribution having the larger prevalence, and the changes in proportions are opposite to those seen in case B (Table 3Up and Fig. 4CUp ).

the significance of the numbers of reference values
Apart from case B, which was an example of how the partitioning problem should not be solved, we have not yet had any use of the numbers of reference values in the subgroups. However, the reference limits of each of the subgroup distributions are calculated separately by use of the respective reference values. Moreover, the numbers of reference values determine the statistical quality of reference limits and, thus, the quality of distances between these limits. The 90% confidence intervals (CIs) about the 2.5 and 97.5 percentiles of a gaussian distribution can be calculated as follows (5)(6):

(5)
In our example, we have 1300 reference values in subgroup 1 and 1000 reference values in subgroup 2. Inserting these values and the respective standard deviations of 10 and 11 in Eq. 5Up , we obtain the following 90% CIs for the reference limits of distributions 1 and 2, respectively: ± 0.78 and ± 0.97. The CIs for the distances between reference limit pairs can be calculated as ± (0.782 + 0.972)1/2 = ± 1.24, or ± 0.12 s, where s is the standard deviation of distribution 1.

Conclusion
Although this example was a contrived one and the parameters were manipulated to obtain borderline situations, the characteristics of the distributions (Table 2Up ) should be rather typical for two subgroups subjected to comparison in a reference interval study. Because the demographic descriptors of these groups are often similar, their means and standard deviations tend to lie close to each other. The need for partitioning in many cases thus is not self-evident but must be examined carefully.

The ratio of prevalences was set to 3.0, by no means an extreme value for this ratio and large enough to make partitioning recommended, in this example, whereas our first guess, when considering Fig. 4AUp , may well have been nonpartitioning. Partitioning was indeed the final conclusion based on both the distance and the proportion criteria, when the prevalence effect was accounted for. When this effect was ignored, we obtained other conclusions with both the present model, which suggested an equivocal conclusion, and the Harris-Boyd model, which suggested nonpartitioning. That consideration of the prevalences tends to favor partitioning was an expected result because the prevalence effect makes the critical distances shorter and partitioning as the conclusion more probable.

Proper consideration of the prevalences is obviously of vital importance whenever decisions on partitioning are to be made. Such decisions should, in our opinion, not be made at all if the prevalences are not known. Very clear-cut cases where the distances between the reference limits exceed the critical distances for Fr = 1.0 are an exception from this rule, of course.

As shown by this application example, the accuracy of the approximate distances should be high enough for most practical purposes. The method is relatively easy to use even for hand calculations: one has only to observe the prevalences, consider the topographies of prevalences, calculate the differences between the reference limits, and compare these differences with the critical distances read from Table 1Up or Fig. 3Up . The analytical calculations are not hard to apply either, and we recommend them when high precision is desired, particularly at borderline situations, as demonstrated in case C. Because imprecision is unavoidable whenever statistically calculated parameters are being used, the accuracy of such analytical calculations is not absolute, however. It is important to keep this fact in mind when performing analysis of reference data in practice.


   Discussion
Top
Abstract
Introduction
Theory
Application Example
Discussion
References
 
To our knowledge, prevalences have not been considered as a component of the partitioning problem before. Regardless of whether awareness of the importance of the prevalences has ever existed, attempts to account for unequal prevalences seem not to have been made in earlier partitioning models. Notably, the Harris-Boyd model, which probably is the most widely applied partitioning model today, does not account for the prevalences. If the prevalence effect were to be incorporated in the Harris-Boyd model afterward, one should adjust the proportions outside the common reference limits, obtained with computer simulations by Harris and Boyd (4)(10), to various ratios of prevalences (Fr). For each series of adjusted proportion vs z-value curves, corresponding to selected values of R and to one particular value of Fr, one should determine the critical z-value. But, as was discussed in our previous report on partitioning (5), determination of critical distances with precision is not easy in the Harris-Boyd model. Hence, a nomogram presenting critical distances as functions of Fr could not be very accurate, either, in that model.

Application of the present model to the partitioning problem is undeniably more laborious than that of any of the earlier models, but as the only one to account for the prevalence effect, the present model hopefully enables more reliable decision-making on partitioning. In contrast to earlier models, it can also guarantee a good correspondence between the proportions of the subgroup distributions outside the common reference limits and the distances between the reference limits of these distributions, as demonstrated in the application example above.

Because the prevalence effect not only concerns the distance criteria but also application of the proportion criteria, the prevalences should be considered whenever the need for partitioning is being evaluated. The easiest way to do so is to use the approximate distance criteria, plotted in Fig. 3Up and listed in Table 1Up . Both the accurate distance criteria and the proportion criteria require slightly more engagement in the calculations, whereas the accuracy of the approximate distance criteria probably suffices for most practical purposes.

As Fig. 3Up illustrates, the critical partitioning distances shrink quite rapidly, particularly at the very beginning of the Fr scale: for TPrevA, DCritPave is 0.49 s at Fr = 1.5, whereas it is 0.68 s at Fr = 1.0. It is likely that most often when a poorly documented assumption or a mere guess on the equality of the prevalences is made, these are in reality not quite equal, but their ratio takes values somewhere near the origin of the Fr scale, i.e., precisely where the critical distances change most unpredictably. Hence, the critical distances may be quite different from what they are supposed to be, and the risk of false decisions on partitioning is considerable even at ratios of prevalences close to unity. How should earlier studies on partitioning, which did not account for the prevalence effect and which may have included an implicit false assumption of equal prevalences, actually be judged? We feel inclined to recommend careful reconsideration of all such studies where the decision on partitioning has not been clear-cut. Obviously, if the distances between the reference limits have been either larger than the largest of our DCritP (0.68 s) or smaller than the smallest of our DCritNP (0.11 s), one can be confident on the adequacy of partitioning and nonpartitioning, respectively, whatever the topographies of the subgroup distributions have been.

As the application example demonstrates, decisions on partitioning may depend on rather small changes in the proportions and the distances in absolute terms. However, these changes are not necessarily small compared with the ranges of the respective parameters. As an example, in the case of equal prevalences, the larger proportion outside the common reference limits has a range that extends from 2.5% to 5.0%. We (5) divided this range of 2.5% into nonpartitioning, equivocal, and partitioning areas, which have widths of 0.7% [2.5–3.2%), 0.9% [3.2–4.1%), and 0.9% [4.1–5.0%), respectively. Divisions other than this one, based on the analytical bias quality specifications (5)(7)(8)(9), may well turn out to be better for partitioning purposes in the future, but whatever these new divisions are, the room for play is in fact not too large. One operates with tenths of a percent at any case. Hence, tenths of a percent may also be significant when decisions on partitioning are being made. How these tenths of a percent convert themselves to changes of distances depends on Fr and on the topographies of the distributions (as an example, in case B of the application example two-hundredths of an s unit corresponded to one-tenth of a percent). Obviously, whichever criteria are being used, even changes that are very small in absolute terms may be significant.

These changes should be compared with the statistical imprecision of the respective test parameters. The CIs of reference limits (Eq. 5Up ) are probably most often so large that they could change the conclusion on partitioning if the distances between reference limits were extended accordingly. But imprecision of the test parameters is inherent to any statistical model, and as was discussed in our previous report (5), it is in fact usually smaller in the present model than in the Harris-Boyd model. Because of this unavoidable imprecision, common sense should always guide interpretation of the results suggested by these models.

If prevalences have the influence on partitioning that we believe they have, one should carefully observe them in every reference interval study. The prevalences are sometimes evident, for example, the prevalences of both genders are 0.5 in most populations, and sometimes they have been established in epidemiologic studies, but probably most often they are neither evident nor known from the literature. In such cases, they should be determined or estimated by the organizers of the reference interval study. Ideally, one should know even the prevalences of smokers and nonsmokers, birth control pill users and nonusers, or joggers and nonjoggers in the source population when partitioning of these groups is being examined. Because the prevalences of these lifestyle groups vary over time, estimates trying to assess long-term average values for these prevalences might be the only feasible way to go. In contrast, when partitioning of, e.g., defined genetic subgroups is studied, the prevalences should be determined rigorously before partitioning is undertaken.

In conclusion, we have shown that the prevalences of the subgroup populations in the source population of a reference interval study have a significant effect on the partitioning criteria used to determine whether these subgroups need separate reference intervals. The critical distances can be shortened substantially because of this prevalence effect, and negligence of the prevalences may therefore easily lead to false decisions concerning partitioning. The proportion criteria themselves do not depend on the prevalences, but when these criteria are applied, the prevalences should be accounted for by adjusting the sample sizes to make them correspond to the ratio of prevalences. We have developed a nomogram on approximate critical distances to enable an effective and, for most practical purposes, accurate enough way to account for the prevalence effect in the case of gaussian subgroup distributions. We also offer analytical expressions to be used whenever accurate critical distances, corresponding to any desired critical proportions, are needed.


   Acknowledgments
 
We gratefully acknowledge financial support for this study from NorFA (Nordisk Forskerutdanningsakademi), the Department of Clinical Chemistry at Rikshospitalet University Hospital of Oslo, and the Nordic Reference Interval Project.


   Footnotes
 
1 We accounted in fact for ratios of SD up to 1.5 in our previous report on partitioning (5), because according to Harris and Boyd (4)(10), one of the proportions outside the common reference limits was guaranteed to exceed 4.0% when R >1.5. This cutoff value, considered as a separate criterion for partitioning by Harris and Boyd (4)(10), was originally determined using computer simulations. However, it is also possible to calculate this value. Such calculations, detailed in Appendix 1, which is published in a data supplement accompanying this article at Clinical Chemistry Online (www.clinchem.org/content/vol48/issue11/), suggest that instead of R = 1.5, it suffices to choose R = 1.36 as an upper limit for R. As explained in Appendix 1, this value applies to any Fr >=1.0, which means that partitioning is needed whenever R >=1.36. Because a lower value for the upper limit of R leads to better quality of the approximate critical distances, we will use the more precise value of 1.36 for it throughout this report.


   References
Top
Abstract
Introduction
Theory
Application Example
Discussion
References
 

  1. Assessing quality in measurements of plasma proteins. The Nordic Protein Project and related projects. Upsala J Med Sci 1994;99:195-389.[Medline] [Order article via Infotrieve]
  2. Ferré-Masferrer M, Fuentes-Arderiu X, Alvarez-Funes V, Güell-Miro, Castiñeiras-Lacambra MJ. Multicentric reference values: shared reference limits. Eur J Clin Chem Clin Biochem 1997;35:715-718.[ISI][Medline] [Order article via Infotrieve]
  3. Nordic Reference Interval Project. http://home.online.no/~rustadp/ref/refprosj.html (Accessed May 15, 2002)..
  4. Harris EK, Boyd JC. Statistical bases of reference values in laboratory medicine 1995:77-91 Marcel Dekker New York. .
  5. Lahti A, Hyltoft Petersen P, Boyd JC, Fraser CG, Jørgensen N. Objective criteria for partitioning gaussian-distributed reference values into subgroups. Clin Chem 2002;48:338-352.[Abstract/Free Full Text]
  6. Solberg HE. Approved recommendation on the theory of reference values (1987). Part 5. Statistical treatment of collected reference values. Determination of reference limits. J Clin Chem Clin Biochem 1987;25:645-656.[ISI]
  7. Fraser CG, Hyltoft Petersen P, Libeer J-C, Ricós C. Proposals for setting generally applicable quality goals solely based on biology. Ann Clin Biochem 1997;34:8-12.
  8. Fraser CG, Hyltoft Petersen P, Ricós C, Haeckel R. Proposed quality specifications for the imprecision and inaccuracy of analytical systems for clinical chemistry. Eur Clin Chem Clin Biochem 1992;30:311-317.
  9. Stöckl D, Baadenhuijsen H, Fraser CG, Libeer J-C, Hyltoft Petersen P, Ricós C. Desirable routine analytical goals for quantities assayed in serum. Discussion paper from the members of the external quality assessment (EQA) working group A on analytical goals in laboratory medicine. Eur J Clin Chem Clin Biochem 1995;33:157-169.[ISI][Medline] [Order article via Infotrieve]
  10. Harris EK, Boyd JC. On dividing reference data into subgroups to produce separate reference ranges. Clin Chem 1990;36:265-270.[Abstract/Free Full Text]



The following articles in journals at HighWire Press have cited this article:


Home page
Clin. Chem.Home page
A. Lahti, P. Hyltoft Petersen, J. C. Boyd, P. Rustad, P. Laake, and H. E. Solberg
Partitioning of Nongaussian-Distributed Biochemical Reference Data into Subgroups
Clin. Chem., May 1, 2004; 50(5): 891 - 900.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available