|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular Diagnostics and Genetics |
1 Rosetta Inpharmatics, Merck & Co., Inc., Seattle, WA; 2 Merck Research Laboratories, Upper Gwynedd, PA.
aAddress correspondence to this author at: Rosetta Inpharmatics, Merck & Co., Inc., 401 Terry Ave. N., Seattle, WA 98109. Fax 206-802-6501; e-mail thomas_fare{at}merck.com.
| Abstract |
|---|
|
|
|---|
- and β-globin transcripts. We describe a means to quantify the interference of globin transcripts on profiling and the effectiveness of globin transcript mitigation by (a) defining and characterizing globin interference, (b) reproducing globin interference with synthetic transcripts, and (c) using ROC curves to measure sensitivity and specificity for a protocol for removing
- and β-globin transcripts. Methods: We collected blood at 2 sites and extracted total RNA in PreAnalytiX PAXgene tubes. As a reference for characterizing interference, we supplemented aliquots of total RNA with synthesized globin transcripts and total RNA from human brain. Selected aliquots were processed with Ambion GLOBINclear to remove globin transcripts. All aliquots were labeled and hybridized to Agilent DNA microarrays by means of pooling schemes designed to quantify the mitigation of globin interference and to titrate gene expression signatures. Quantitative reverse transcription–PCR data were generated for comparison with microarray results.
Results: Our supplementation and pooling strategy for comparing the microarray data among samples demonstrated that mitigation could reduce an interference signature of >1000 genes to approximately 200. Analysis of samples of endogenous globin transcripts supplemented with brain RNA indicated that results obtained with the GLOBINclear treatment approach those of peripheral blood mononuclear cell preparations.
Conclusion: We confirmed that both the absolute concentrations of globin transcripts and differences in transcript concentrations within a sample set are factors that cause globin interference (Genes Immun 2005;6:588–95). The methods and transcripts we have developed may be useful for quantitatively characterizing globin mRNA interference and its mitigation.
| Introduction |
|---|
|
|
|---|
To reduce the occurrences of such signatures, investigators have introduced technologies that decrease the time between blood draw and RNA stabilization [e.g., PAXgene (PreAnalytiX), Tempus (Applied Biosystems)]. Although these systems may stabilize RNA at the point of collection, they introduce interfering signatures because of the abundance and variability in the amounts of
- and β-globin mRNA, which obscure signatures of biological interest (1)(6)(7). The degree of globin message representation in mRNA in whole blood can vary widely, with globin mRNA species constituting up to 70% of whole-blood mRNA in patients with high reticulocyte counts (8). We describe methods for quantifying and characterizing the globin message in samples, how to recognize a globin message–induced signature in profiling data, and the efficacies of measures for mitigating interference by globin messages.
| Materials and Methods |
|---|
|
|
|---|
blood collection and extraction of total rna
We drew blood samples from volunteers after they had provided informed consent and the protocols had been approved. At site A (BSC Labs), 1 unit of whole blood was drawn from each of 10 healthy donors and preserved with EDTA. We filled 20 PAXgene tubes (VWR catalog no. 77776–026) for subsequent extraction of total RNA (sample sets A and D). The EDTA-preserved blood (75 mL) was added to 3 Accuspin tubes (Sigma-Aldrich, catalog no. A7054) within 2 h of collection for PBMC isolation and extraction of total RNA (sample set E) with an in-house semiautomated procedure that follows the protocol (9) accompanying the PAXgene 96 Blood RNA Kit (Qiagen, catalog no. 762331). We conducted no additional purification steps before treating samples with GLOBINclear (Ambion). At site B, we drew blood directly into 10 PAXgene tubes for each of 10 healthy donors and extracted total RNA (sample sets B and C). We created intradonor pools for each donor/extraction protocol set (see Pooling and Supplementation Strategy and Fig. 1
).
|
supplementation with globin transcripts
We added
- and β-globin transcripts to human whole-blood samples containing total RNA and used these supplemented samples to quantify the reduction in the globin message and the mitigation of the interference. We used multiple sequences of globin transcripts in the NCBI Reference Sequence collection (RefSeq) to construct consensus sequences for
- and β-globin mRNA (see Figs. 1 and 2 in the Data Supplement that accompanies the online version of this article at http://www.clinchem.org/content/vol54/issue2 ) and added 30 nucleotides of poly(A) 3' to the sequence. We then submitted these sequences to Blue Sky Biotech for subcloning and in vitro transcription. They isolated full-length clones, subcloned them, and sequenced the clones for verification. After we verified the transcript sequences with our RefSeq consensus, a sequence analysis identified a point mutation in the 3' untranslated region of the β-globin clone that coincided with the array probe for the β-globin transcript; regardless, the array probe was saturated because of the abundance of the β-globin message.
supplementation with human brain total rna
We obtained human brain total RNA from BioChain (catalog no. R1234035–50, lot no. A703158) and added it to aliquots of peripheral blood total RNA at the following concentrations: 0 mg/g, 0.5 mg/g, and 5 mg/g. We then used the resulting exogenous brain RNA signature and ROC curve analysis to evaluate the sensitivity and specificity of each mitigation protocol (10).
globin transcript mitigation
Ambions GLOBINclear product, a globin transcript–mitigation technology for removing
- and β-globin transcripts, is used to pretreat total RNA before amplifying a target. We precipitated a 2-µg aliquot of each sample in ethanol, resuspended the sample in 14 µL water, and processed the sample with the GLOBINclear kit according to the manufacturers protocol (11). In brief, we mixed custom biotinylated oligonucleotides complementary to globin RNA sequences with RNA prepared from blood samples and annealed the oligonucleotides to
- and β-globin transcripts. We then added streptavidin-coated paramagnetic beads to bind the biotinylated duplexes and remove the captured globin transcripts from the preparations of total RNA, adjusted the GLOBINclear-treated samples of total RNA to 150 µL, and used aliquots for quantitative reverse transcription–PCR (qRT-PCR) and microarray experiments.
pooling and supplementation strategy
Characterization of globin transcript interference in microarray data (sample sets A and B).
We used several pooling strategies to generate reference channels for the 2-color array format used in this study. In one strategy (Fig. 1A
), we pooled total RNA extracted from blood that had been aliquoted into PAXgene tubes from a given donor, combined copy RNA (cRNA) generated from each donor pool of total RNA to form a multidonor pool [also referred to as a mass-balanced, self-referenced pool (10)] to make interdonor comparisons, and compared each donor with the pool. We used the resulting hybridization data to characterize globin transcript interference. We used this pooling strategy with sample sets A and B to quantify such interference, with 2 separate interdonor comparisons as examples.
Creating and mitigating interference by synthetic globin transcripts (sample set C).
In another strategy (Fig. 1B
), we combined total RNA from multiple donors to create a multidonor pool for supplementation experiments. We split aliquots of pooled total RNA into separate containers to which we had added a titration of synthetic globin transcripts. We generated cRNA from each supplemented sample and formed a mass-balanced, self-referenced pool from all cRNA preparations for fluor-reverse pairing in hybridization experiments. We used sample set C to quantify the effects of globin transcript mitigation on microarray data via comparison with untreated samples.
Mitigation of an endogenous globin interference (sample sets D and E).
In the third set of experiments, we quantified the effect of endogenous globin interference on the ability to recover an exogenous brain signature. Sample sets D and E were formed according to the pooling scheme summarized in Fig. 1C
. We split a donor sample into aliquots and added different known amounts of brain RNA into the individual aliquots. cRNA was made from each aliquot, and a self-referenced pool of all donors was made from cRNA synthesized from aliquots with no added brain RNA. We hybridized cRNA from the supplemented samples against cRNA synthesized from samples with no supplemented brain RNA and measured the ability to identify brain-specific sequences for sample sets D and E, with and without mitigation of globin mRNA.
qRT-PCR analysis with taqman and sybr® green
We selected primer sets targeted to the second intron/exon junctions of both
- and β-globin mRNA. We submitted sequences for 200-nucleotide stretches spanning intron/exon junction 2 of
- and β-globins to Applied Biosystems Assay-by-Design service for TaqMan primer/probe sets, which produced the following forward, reverse, and reporter
-globin sequences, respectively: 5'-GCACGCGCACAAGCT-3', 5'-GGGTCACCAGCAGGCA-3', and 5'-ACTTCAAGCTCCTAAGCCAC-3'. The forward, reverse, and reporter β-globin sequences were 5'-AAGCTGCACGTGGATCCT-3', 5'-GATGGGCCAGCACACAGA-3', and 5'-CCCAGGAGCCTGAAGTT-3', respectively. We also used these primers for SYBR Green qRT-PCR analysis. For total RNA and cRNA, we used Applied Biosystems High Capacity Archive Kit (catalog no. 4322171) with 10 ng of each sample to generate T7- dT–primed cDNA. In short, total RNA was dried down and resuspended in 20 µL master mix containing 1x deoxynucleoside triphosphates and 1x RT buffer, 50 U MultiscribeTM, 40 U RNase OUT (Invitrogen, catalog no. 10777–019) and 5 pmol oligo-dT (Ambion, catalog no. AM5730G). Reactions were carried out on a DNA Engine Tetrad (MJ Research) at 70 °C for 5 min, 37 °C for 120 min, and 95 °C for 5 min).
We conducted quadruplicate TaqMan or SYBR Green assays with 100-pg cDNA aliquots and the primer or probe, respectively. We carried out all reactions simultaneously in an Applied Biosystems 7900HT Fast Real-Time PCR System and calculated results by means of a relative-abundance method (12)(13).
sample amplification and microarray analysis
We profiled and analyzed samples of total RNA extracted from human whole blood (14)(15). In brief, we amplified total RNA from blood by means of a modified 2-round reverse transcription reaction mediated by Moloney murine leukemia virus reverse transcriptase, followed by in vitro transcription and labeling with a Cy dye. Samples were hybridized to custom 25 000–probe oligonucleotide arrays (Agilent Technologies) in fluor-reverse pairs. Scanned microarray images were processed with a feature extractor developed in house with MATLAB (The MathWorks). The feature extractor automatically locates the arrayed features on a scanned image, calculates the mean pixel intensity for each feature, and flags features that either show artifacts or have intensities at the scanners background or saturation level. Feature intensities normalized by the mean intensity of the nonflagged features for the Cy3 and Cy5 channels are used to form ratios of the 2 channels for each reporter. An in-house error model based on a null hypothesis of no differential regulation was used to assess the significance of an observed ratio (16). Hybridization-ratio data were analyzed with MATLAB.
| Results |
|---|
|
|
|---|
|
We have arranged the experiments summarized in Fig. 2
, A and B, on the vertical axis in descending order of qRT-PCR–derived globin transcript content. This ordering shows the interference as a globin content–dependent pattern of gene regulation. One can divide the pattern into 2 regions: cross-hybridization and normalization (Fig. 2A
). We call cross-hybridization a clustering of differentially expressed genes that can be associated with AT-poor/GC-rich probes when experiments are arranged in order by globin mRNA content in heat maps. Within the constraints of the 3'-biased probe selection, we obtained approximately 1250 cross-hybridization genes (P <0.01) and have observed gene sets as large as 2500 in more severe cases (data not shown). In ratio-based microarray experiments, we used the mean intensity of all biological features to normalize feature intensities in each channel over the entire chip (16). In this case, the intensity due to cross-hybridization is sufficient to skew channel normalization, producing an "inverse" effect. In other words, for a set of genes found to be up-regulated because globin transcript cross-hybridization, there is another set of genes calculated to be down- regulated, compensating for the skewed signal levels. In this sense, the normalization effect is a consequence of globin transcript cross-hybridization interference. Specifically, sample set B has fewer than 250 cross-hybridization genes, compared with 1250 in sample set A. Consequently, the normalization interference is not as pronounced in sample set B, and the globin-dependent pattern is not as robust.
To investigate further the difference between the sample sets, we analyzed the qRT-PCR data and their relationship to the microarray data. Fig. 2C
is a bar chart of the qRT-PCR data for sample sets A and B and is arranged by globin transcript content. We note that the overall content of
- and β-globin transcripts as a percentage of total RNA is higher in sample set A. Furthermore, sample set A shows an absolute difference of approximately 3% in the proportion of globin RNA between the samples with the highest and lowest proportions (approximately 4.5% and approximately 1.5%, respectively; Fig. 2C
), which represents a 3-fold change in the relative amount of globin transcripts. We see with sample set B a corresponding absolute change of 0.7% or a relative change of just over 2-fold (approximately 1.3% and 0.6%, respectively). This analysis reveals that the total abundance of and variation in globin RNA content for sample set A is greater than for sample set B.
In Fig. 2D
, we show how globin mRNA concentration expressed as a percentage of total RNA (as measured by qRT-PCR) correlates with the mean log ratio (MLR) for
-globin, β-globin, and
-globin (an embryonic form of globin with 50% homology to β-globin). The microarray MLR of a gene is defined as the log10 ratio of the mean of the 2 normalized treatment-channel intensities to the mean of the 2 normalized control-channel intensities. Intensities are normalized by dividing the specific spot intensity by the global biological intensity for the entire chip. The qRT-PCR MLR is defined as the log10 of the specific abundance divided by the mean of all the abundance data for the experiment. Although the microarray and qRT-PCR data show good correlation for all 3 globins (R2
= 0.82; R2β = 0.75; R2
= 0.81), the higher slope, m, of the trend line for
globin (m
= 6.2; mβ = 10.1; m
= 15.5) indicates a larger dynamic range for the MLR. The shallower slopes for
-globin and β-globin were due to saturation of the respective probes. From the data in Fig. 2
, we conclude that qRT-PCR of HBE12
(hemoglobin, epsilon 1) MLR data can be used to assess the likelihood of globin transcript interference in a sample set.
creating and mitigating the interference by synthetic globin rna
To characterize the interference further and assess potential mitigation technologies, we developed a means to reliably recreate globin RNA interference by titrating synthesized globin gene transcripts into a background of total RNA extracted from pooled samples of whole blood. We measured the ratio of
-globin mRNA to β-globin mRNA for donor sets A and B via qRT-PCR analysis of total RNA. For each donor, we measured a consistent
-globin/β-globin mRNA ratio of 3:1 [for example, the mean
-globin/β-globin mRNA ratio for the data in Fig. 2C
was 74.6:25.4 (n = 17), CV
= 3%, and CVβ = 9%]. We used this ratio for our globin mRNA–supplementation experiments. To the same parent pool of total RNA, we added synthesized globin RNA to concentrations of 0, 10, 25, 40, and 70 mg/g total RNA (sample set C). For simplicity, we refer to the experimental points by the supplementation amounts. Our intent was to create substantial interference because the highest proportion of endogenous globin RNA that we had previously encountered was 65 mg/g of the total RNA (data not shown). We also wanted to ensure large variation in the globin RNA content within a sample set.
For sample set C, we divided each sample into 2 aliquots, one of which we treated with GLOBINclear and the other we left untreated. BioAnalyzer traces with the RNA 6000 Pico LabChip (Agilent Technologies) showed that GLOBINclear effectively eliminates the globin RNA peak at all supplementation levels (see Fig. 3 in the online Data Supplement). Furthermore, a qRT-PCR analysis of total RNA before and after treatment demonstrates a reduction of representation in all samples (Table 1
). In particular, 97% (
= 0.23%) of the globin in the sample supplemented with globin mRNA at 70 mg/g was removed from total RNA. Absolute representation was reduced for all globin RNA additions by at least 94% of the original content. Not only are the magnitudes of globin mRNA representation lowered overall, but the difference between the samples with highest and lowest globin mRNA content (
-globin plus β-globin) was also greatly reduced after GLOBINclear treatment.
|
We then amplified treated and untreated samples for 2 independent self-referenced pool hybridization plans (Fig. 1B
). We quantified the globin transcript content of the cRNA produced, including the self-referenced pools, with the same globin transcript assays used with total RNA. Table 1
shows that the amplification product increases as the amount of added globin RNA increases. In addition, the self-referenced pool has the approximate mean globin transcript content (approximately 25 mg/g) expected from mixing the total set of supplemented samples together. We conclude from these data that GLOBINclear-treated samples have substantially reduced globin message, both in the starting total RNA and in the subsequent cRNA product. We hybridized samples according to a self-referenced pool scheme. We found in array hybridization data (Fig. 3
) that a 1-D agglomerative cluster for the untreated controls (left heat map) demonstrated globin message interference; that is, a large signature (>1500 genes, P <0.01) correlated with A-poor/GC-rich probes and a corresponding normalization effect. The magnitudes and directions of the interference signatures correlate with the supplement quantity. A comparison with the GLOBINclear-treated sample (right heat map) demonstrates that the interference is essentially eliminated. Whereas the mean number of signatures per supplement for the untreated sample panel was 1126, the GLOBINclear-treated sample set had a mean of 210 signatures, a difference that is statistically significant (P <0.01). The remnant GLOBINclear signatures do not correlate with the A-poor/GC-rich probes and represent <1% of the total features differentially detected.
|
mitigation of interference by endogenous globin transcripts
To quantify mitigation strategies for a traceable endogenous signature, we created sample sets D and E. We supplemented donor total RNA with brain total RNA (0, 0.5, and 5.0 mg/g) and then treated this sample set with GLOBINclear or not. In addition, we created sample set E (see Materials and Methods) as a reference for globin transcript mitigation. We processed each sample set as depicted in Fig. 1C
, with the modification that the reference pool was not supplemented with brain RNA. This strategy induced a signature in total RNA from blood.
Fig. 4
is a combined 1-D agglomerative cluster with experiments organized first into 3 groups by treatment (no mitigation, GLOBINclear, PBMCs) and then by HBE1 MLR, according to the no-mitigation protocol. The cross-hybridization and normalization effects are marked in the figure. Without mitigation, there is substantial interference when the data are ordered by HBE1 expression. GLOBINclear treatment significantly reduces both globin message and normalization interferences to background levels (from approximately 1200 signatures to 0; P = 0.01).
|
Next, we identified a brain signature (18)(19) (Fig. 4
inset). We used ROC curve analysis to measure the impact of GLOBINclear treatment on sensitivity and specificity, relative to the brain signature (Fig. 5
). We specifically compared nontreated samples with GLOBINclear-treated samples and PBMC samples with respect to the 0.5-mg/g brain signature. The sensitivity and specificity of GLOBINclear approach those obtained with PBMCs and show a nominal improvement over no treatment. Each curve in the figure represents the mean of 36 arrays (9 samples, 4 arrays) for a given sample condition.
|
| Discussion |
|---|
|
|
|---|
| Acknowledgments |
|---|
Financial Disclosures: None declared.
Acknowledgments: We acknowledge Robert Rosler, Jennifer Garnett, Jaime Forbes, Lori Roadcap, and Bin Li for technical support, Mark Parrish for critical reviews of the manuscript, and the Gene Expression Laboratory for sample processing and data generation.
| Footnotes |
|---|
2 Human genes: HBE1, hemoglobin, epsilon 1. ![]()
| References |
|---|
|
|
|---|

CT method. Methods 2001;25:402-408.[CrossRef][Web of Science][Medline]
[Order article via Infotrieve]The following articles in journals at HighWire Press have cited this article:
![]() |
H. P.Y. Fan, C. Di Liao, B. Y. Fu, L. C.W. Lam, and N. L.S. Tang Interindividual and Interethnic Variation in Genomewide Gene Expression: Insights into the Biological Variation of Gene Expression and Clinical Implications Clin. Chem., April 1, 2009; 55(4): 774 - 785. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kohlmann, E. Haschke-Becher, B. Wimmer, A. Huber-Wechselberger, S. Meyer-Monard, H. Huxol, U. Siegler, M. Rossier, T. Matthes, M. Rebsamen, et al. Intraplatform Reproducibility and Technical Precision of Gene Expression Profiling in 4 Laboratories Investigating 160 Leukemia Samples: The DACH Study Clin. Chem., October 1, 2008; 54(10): 1705 - 1715. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |