Clinical Chemistry
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Clinical Chemistry 54: 249-255, 2008. First published December 10, 2007; 10.1373/clinchem.2007.099366
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
clinchem.2007.099366v1
54/2/249    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (3)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ridker, P. M
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ridker, P. M
(Clinical Chemistry. 2008;54:249-255.)
© 2008 American Association for Clinical Chemistry, Inc.


Special Reports

Rationale, Design, and Methodology of the Women’s Genome Health Study: A Genome-Wide Association Study of More Than 25 000 Initially Healthy American Women

Paul M Ridker1,2,a, Daniel I. Chasman1,2, Robert Y.L. Zee1,2, Alex Parker3, Lynda Rose1, Nancy R. Cook1,2, Julie E Buring1,2 for the Women’s Genome Health Study Working Group

1 Center for Cardiovascular Disease Prevention, Brigham and Women’s Hospital, Boston; 2 Donald W Reynolds Center for Cardiovascular Research, Harvard Medical School, Boston; 3 Amgen, Inc, Cambridge, MA.

aAddress correspondence to this author at: Center for Cardiovascular Disease Prevention, Brigham and Women’s Hospital, 900 Commonwealth Ave East, Boston, MA 02215. Fax 617 734-1508; e-mail pridker{at}partners.org.


   Abstract
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
The primary aim of the Women’s Genome Health Study (WGHS) is to create a comprehensive, fully searchable genome-wide database of >360 000 single nucleotide polymorphisms among at least 25 000 initially healthy American women participating in the ongoing NIH-funded Women’s Health Study (WHS). These women have already been followed over a 12-year period for major incident health events including but not limited to myocardial infarction, stroke, cancer, diabetes, osteoporosis, venous-thromboembolism, cognitive decline, and common visual disorders such as age- related macular degeneration and cataracts. Investigations within the WGHS will seek to identify relevant patterns of genetic polymorphism that predict future disease states in otherwise healthy American women, and to evaluate patterns of genetic polymorphism that relate to multiple intermediate phenotypes including blood-based determinants of disease that were measured at baseline for each study participant. By linking genome-wide data to the existing epidemiologic databank of the parent WHS, which includes comprehensive dietary, behavioral, and traditional exposure data on each participant since cohort inception in 1992, the WGHS will also allow exploration of gene-environment and gene-gene interactions as they relate to incident disease states. Thus, with continued follow-up of the WHS, the WGHS provides a unique scientific resource—a full-cohort, prospective, genome-wide association study among initially healthy American women.


   Introduction
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
With advances in our understanding of the genetic basis of human disease, it has become apparent that the underlying causes of many chronic disorders are multifactorial and involve a complex interplay between acquired and inherited risk factors. With the advent of genome-wide scanning technologies, it is now possible to ascertain information on most of the common genetic variations in individual patients. Obtaining this genome-wide information in well-characterized patient groups with or at risk for disease is a critical step in moving toward a genome-based practice of medicine that not only provides insight into the root causes of disease, but also forms the basis for discovery of new and specific targets for drug therapy. Genome-based medicine may also fulfill the promise of personalized medicine and provide the means to implement patient-specific preventive programs years in advance of clinical symptoms (1)(2)(3).

A favored analytic approach for such discovery is the genome-wide association study (GWAS)1 in which genetic variation across the human genome is compared between patients with different disease states or different risk-factor profiles. Success in GWAS requires a comprehensive knowledge of genome-wide variation and linkage disequilibrium patterns, the availability of dense genotyping chip sets containing several hundred thousand single-nucleotide polymorphisms (SNPs), and the availability of large, well-phenotyped patient populations (4)(5)(6)(7)(8)(9)(10). Appropriate patient populations can take the form of retrospective case-control studies (in which patients with and without existing disease are compared for genetic variation), prospective nested case-control studies (in which incident cases and matched controls who remain free of disease are selected from within an ongoing prospective cohort), or cross-sectional family-based studies (in which affected and unaffected parents and children are evaluated across generations).

A potentially more powerful approach to GWAS is the large-scale prospective cohort study, in which initially healthy individuals are followed over long time periods and assessed for the development and all members of the cohort undergo comprehensive genotyping. Such full-ascertainment prospective cohort studies have the advantage of avoiding bias in the selection of case and control subjects and enable simultaneous evaluation of a large number of environmental exposures and potential disease states in an epidemiologically efficient manner. Large-scale prospective cohort studies are also an optimal setting in which to evaluate gene-gene and gene-environment exposures likely to be of interest for complex disorders such as cardiovascular disease, stroke, diabetes, and cancer for which substantive environmental determinants are known. Unlike retrospective case-control or prospective nested case-control study designs, the full-cohort approach also allows the investigation of different diseases simultaneously, can easily include future cases in analyses without concern for ascertainment bias, reduces laboratory variability because the full cohort is genotyped at the same time and in random order, and markedly improves the ability to evaluate gene-environment interactions when environmental exposure is rare. The disadvantages of this approach are the greater expense of cohort assembly and baseline genotyping, as well as the need for comprehensive and ongoing long-term follow-up and endpoint ascertainment. Decade or longer follow-up periods are typically required in prospective cohort settings to allow for sufficient accrual of incident disease states.

The Women’s Genome Health Study (WGHS) is an ongoing prospective cohort GWAS that derives from the NIH-funded Women’s Health Study (WHS) and includes more than 25 000 initially healthy women who have already been followed for more than 12 years for the development of common disorders such as myocardial infarction, stroke, cancer, venous thromboembolism, diabetes, osteoporosis, cognitive decline, and common visual disorders such as age-related macular degeneration and cataracts. Because each WGHS participant is also a WHS participant, full epidemiologic data on a broad range of behavioral, dietary, and environmental risk exposures are available for the study population. In addition, each WGHS participant was included in the parent WHS and provided a baseline blood sample that was already evaluated for multiple disease biomarkers including total, HDL, and LDL cholesterol, triglycerides, apolipoprotein A-I, apolipoprotein B100, lipoprotein, homocysteine, high-sensitivity C-reactive protein (hsCRP), soluble intercellular adhesion molecule type-1 (sICAM-1), fibrinogen, creatinine, and hemoglobin A1c. Each baseline blood sample also had genomic DNA extracted and is now undergoing genotyping for more than 360 000 single-nucleotide polymorphisms (SNPs) using the HapMap-based Human-Hap300 Duo-plus BeadChip platform.

In this report we describe the WGHS and its parent WHS from the perspectives of cohort assembly, follow-up, endpoint validation, baseline plasma phenotyping, DNA extraction, genotyping, participant confidentiality, power and sample size and discuss the WGHS in context with other ongoing GWAS being performed in related areas.


   cohort assembly and prospective follow-up
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
All members of the WGHS cohort were participants in the WHS who provided an adequate baseline blood sample for plasma and DNA analysis and who gave consent for blood-based analyses and long-term follow-up.

The WHS was initiated in 1992 to evaluate the balance of benefits and risks of low-dose aspirin and vitamin E in the primary prevention of cardiovascular disease and cancer in women (1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14). Since its inception, the study has been continuously funded by the National Heart, Lung, and Blood Institute and the National Cancer Institute, with study agents provided by Bayer and the Natural Source Vitamin E Association.

Between September 1992 and May 1995, letters of invitation to participate in the WHS were sent to more than 1.7 million US female health professionals; 453 787 women completed the questionnaires, with 65 169 initially willing and eligible to enroll. Women were eligible if they were 45 years old or older; had no history of coronary heart disease, cerebrovascular disease, cancer (except nonmelanoma skin cancer), or other major chronic illness; had no history of side effects to study medications; were not taking aspirin or nonsteroidal antiinflammatory medications more than once a week; were not taking anticoagulants or corticosteroids; and were not taking individual supplements of vitamin A, E, or β-carotene more than once a week. Eligible women were enrolled in a 3-month run-in period of placebo administration to identify a group likely to be compliant with long-term treatment and follow-up. A total of 39 876 women were willing, eligible, and compliant during the run-in phase and were randomized in a 2 x 2 factorial design to 1 of 4 treatment groups: active aspirin (100 mg orally every other day) and vitamin E placebo, aspirin placebo and active vitamin E (600 IU orally every other day), both active agents, or both placebos.

Annually during the trial period, WHS participants were sent a 1-year supply of monthly calendar packs containing active agents or placebo, as well as questionnaires seeking information about compliance, side effects, the occurrence of relevant clinical endpoints, risk factors, and a comprehensive food frequency questionnaire. Study medications and endpoint ascertainment were continued in a blinded fashion through the scheduled end of the trial; randomized follow-up was completed in February 2005. At that time, rates of follow-up with respect to morbidity and mortality were 97.2% complete and 99.4% complete, respectively. The primary findings of the WHS with regard to the effects of aspirin and vitamin E on the primary trial endpoints of cardiovascular disease and cancer were presented in 2005 (12)(13)(14). Follow-up of the WHS cohort has continued without interruption since that time and is ongoing with 98% participation rates.

The WGHS cohort described here comprises 28 345 (70.6%) of the 39 876 WHS participants who provided a baseline blood sample adequate for plasma and DNA analysis before randomization and consented to ongoing analyses linking blood-derived observations with baseline risk factor profiles and incident disease events. The WHS trial and cohort follow-up, as well as the WGHS, were approved by the institutional review board of Brigham and Women’s Hospital, Boston, MA, and monitored by an external data and safety monitoring board.


   endpoint validation
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
Since study enrollment, all WHS participants have been followed prospectively for the occurrence of common clinical outcomes. For the primary trial endpoints of cardiovascular disease and cancer, full medical records are obtained for reported endpoints and reviewed by an endpoints committee of physicians unaware of randomized treatment assignment.

WHO criteria are used to confirm the occurrence of myocardial infarction on the basis of symptoms and associated abnormal concentrations of cardiac enzymes or diagnostic electrocardiograms. Stroke is confirmed if the participant has a new neurologic deficit of sudden onset that persists for >24 h. Clinical information as well as computed tomographic scans or MRI are used to distinguish hemorrhagic from ischemic events. Cardiovascular deaths are confirmed by autopsy reports, death certificates, medical records, and information obtained from family members. Reports of coronary revascularization procedures (bypass surgery or percutaneous coronary angioplasty) are confirmed by record review. Transient ischemic attacks are confirmed if the neurologic deficit of sudden onset lasted for <24 h. The diagnosis of deep-vein thrombosis is confirmed by a positive venous ultrasonography or venography report, whereas the diagnosis of pulmonary embolism is confirmed by a positive angiogram or computed tomography scan of the chest, or a ventilation-perfusion scan with 2 or more mismatched defects. Deaths due to pulmonary embolism are confirmed when autopsy reports, symptoms, circumstances, and medical history are consistent with this diagnosis.

Cancers are confirmed on the basis of pathologic or cytology reports (96.8%) or, rarely, based on strong clinical and radiological or laboratory marker evidence (e.g., increased CA-125) when a pathology or cytology review was not conducted. All cancers are coded for site, type, and when available, metastatic spread.

Additional endpoints ascertained in the WHS include the occurrence of diabetes, incident hypertension, bone fracture, osteoporosis, cognitive decline, peripheral arterial disease, colonic polyps, and common visual disorders such as age-related macular degeneration and cataracts. The methods used for validation of these endpoints are described elsewhere (15)(16)(17)(18).


   baseline blood collection, processing, storage, and plasma phenotyping
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
All participants in the WGHS provided baseline blood samples collected in EDTA and citrate that were shipped overnight in cooled packaging to a central storage facility where they were separated into plasma and buffy-coat fractions, divided into aliquots, and stored in liquid nitrogen.

Funding from the Donald W. Reynolds Foundation (Las Vegas, NV) (19) has enabled biomarker analysis of each plasma sample in a core laboratory certified by the National Heart, Lung, and Blood Institute/CDC Lipid Standardization program. Concentrations of total cholesterol (TC) and HDL-C were measured enzymatically on a Hitachi 911 autoanalyzer (Roche Diagnostics) with day-to-day reproducibility of 1.36% and 1.07% for TC concentrations of 129.8 and 277.2 mg/dL, respectively, (throughout this report, concentrations and units given are those reported in the referenced sources) and of 1.98% and 2.68% for HDL-C concentrations of 35 and 55 mg/dL, respectively. LDL-C was determined directly (Genzyme) with reproducibility of 2.16% and 1.98% for concentrations of 76.2 and 148.7 mg/dL, respectively. Apolipoprotein-B100 and apolipoprotein-A-I were measured by an immunoturbidimetric technique, also on the Hitachi 911 analyzer. These assays employed the WHO/IFCC standards, and a validation study with those used at the Northwest Lipid Research Laboratory revealed a correlation coefficient of 0.98, intercept of 0.26 mg/dL, and slope of 0.97 for apoliporotein-B100, and correlation coefficient of 0.99, intercept of 0.264 mg/dL, and a slope of 1.0 for apoliporotein-A-I (20). Reproducibility was 3.68% and 2.95% for apolipoprotein-A-1 concentrations of 56.4 and 164.2 mg/dL, respectively, and 4.94% and 4.13% for apolipoprotein-B100 concentrations of 49.7 and 146.3 mg/dL, respectively. Triglycerides were measured enzymatically, with correction for endogenous glycerol, using a Hitachi 917 analyzer and reagents and calibrators from Roche Diagnostics; reproducibility was 1.52% and 1.49% for triglyceride concentrations of 82.5 and 178.8 mg/dL, respectively (21). In addition, full nuclear MR-based lipoprotein profiling is available on the full study cohort (LipoScience).

High-sensitivity C-reactive protein (hsCRP) was measured using a validated immunoturbidimetric method (Denka Seiken) with reproducibility of 2.16% and 3.34% for hsCRP concentrations of 1.94 and 11.42 mg/L, respectively (22). Lp(a) was evaluated with an apo(a)-independent assay with reproducibility of 2.47% and 1.45% for lipoprotein concentrations of 18.5 and 53.3 mg/dL, respectively (23). Homocysteine was determined enzymatically (Catch) with reproducibility of 4.72% and 3.06% at concentrations of 6.0 and 13.3 µmol/L, respectively and hemoglobin A1c was measured using turbidimetric immunoinhibition directly from packed red blood cells (Roche Diagnostics) with reproducibility of 3.63% and 3.77% at levels of 5.2% and 8.8%, respectively (24). Creatinine was measured by a rate-blanked method based on the Jaffe reaction using Roche Diagnostics reagents with reproducibility of 3.67% and 1.60% at concentrations of 1.17 and 6.40 mg/dL, respectively. Fibrinogen was measured by a mass-based immunoturbidimetric assay (DiaSorin) with reproducibility of 5.20% and 3.99% at concentrations of 99.1 and 273.7 mg/dL, respectively (25). Finally, sICAM was measured by quantitative sandwich ELISA (R&D Systems) with reproducibility of 8.89% and 6.39% at concentrations of 171.8 and 289.1 µg/L, respectively (26).

Of samples received by the core laboratory, 27 748 (98%) underwent successful evaluation for all biomarkers.


   dna extraction and genotyping procedures
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
Genomic DNA extraction was performed on buffy-coat samples obtained at baseline from each participant (made possible by funding from Roche Diagnostics, the Doris Duke Charitable Foundation, and the Leducq Foundation). The MagNA Pure LC System (Roche Molecular Biochemicals) based on magnetic bead technology was used according to manufacturer’s specifications to perform all DNA isolation steps. The integrity of the isolated DNA was checked randomly on 1% agarose gel together with molecular weight marker III (Roche Molecular Biochemicals). DNA yields were calculated from the OD260 nm measurement, and purity assessed by calculating the ratio of OD260 nm to OD280 nm (24).

SNP genotyping of these DNA samples is performed using the Illumina Infinium II assay (27) to query a genome-wide set of 315 176 haplotype-tagging SNP markers (the Human HAP300 panel) (28). We added to this a focused panel of 45 882 missense and haplotype-tagging SNPs selected to enhance coverage of genomic regions in which we have a strong a priori interest owing to presence of genes believed to be of relevance to cancer as well as metabolic, cardiovascular, and inflammatory diseases (Human HAP300 Duo-plus). DNA samples are genotyped in batches of 95 WGHS participants with 1 CEPH (Centre d’Etude de Polymorphism Humain) DNA (NA10846) included to monitor genotyping consistency and plate orientation. Genotyping reactions use 750 µg of genomic DNA where possible, although in some cases successful genotyping has been performed with as little as 45 µg of DNA. The Infinium II process was implemented using Illumina Infinium Robot Control software and monitored using the Illumina Infinium laboratory information management system. The hardware platform consists of 4 Tecan EVO liquid-handling robots, 8 hybridization ovens, 3 Illumina BeadStation confocal scanners, and dual-processor workstations with access to >1 TB of disk array storage to monitor workflow and generate high-quality reduced data. Genotype calls are generated and subjected to quality control using Illumina BeadStudio v3.1 software.


   participant confidentiality
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
As in the WHS, participant confidentiality in the WGHS is maintained throughout all aspects of each study. Investigators within the WGHS have no access to any direct patient identification information; these data are held confidentially by staff members of the WHS who are involved in patient contact and follow-up, but not in any data analysis or interpretation. Separate data files are kept for participant clinical covariate and endpoint data, plasma phenotyping data, and genomic data. Blood samples sent to the plasma phenotyping laboratory and to the genetic laboratories are labeled only with a sample ID number that cannot be tracked by laboratory personnel to any patient identification variables or to any clinical covariate data. All GWAS data included in the WGHS are maintained on a separate and fully protected computer system that is isolated and distinct from computing systems used for the parent study. A unique and fully distinct participant ID number is used in the WGHS, making direct linkage to the WHS for scientific investigators impossible. WGHS participants are provided a mechanism to withdraw consent for any reason and at any time. Should individual consent be withdrawn, any remaining stored blood samples, plasma, buffy coat, and DNA traceable to that participant are destroyed. These procedures have proven highly effective for the protection of patient confidentiality; with 300 000 person years of follow-up accrued to date, no breach of these procedures has ever led to the inadvertent unmasking of any participant’s identity.


   statistical considerations and power for the wghs
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
Controversy exists within the genetic epidemiology community regarding the most appropriate methods to analyze data that derive from GWAS. As described in detail elsewhere, the complexity inherent in measuring up to 360 000 SNPs in each of more than 25 000 study participants within the WGHS will raise issues of data processing and quality control, as well as fundamental issues regarding correction for multiple hypothesis testing (3)(4)(29)(30).

With regard to data processing and quality control, before performing any genetic analyses, all SNPs within the WGHS are evaluated for high call rates and the percentage of missing SNPs for each individual calculated. For SNPs with adequate data, Hardy-Weinberg disequilibria are evaluated to identify potential genotyping errors. We also compare the Illumina-based SNP data for each individual participant for a panel of approximately 70 common SNPs that have previously been ascertained in the WHS population using alternative genotyping technologies; this step is used as a secondary check to ensure accurate specimen labeling before any analyses. Finally, we use principal-component analysis to examine the data for any evidence of population stratification.

With regard to issues of multiple hypothesis testing, initial analyses within the WGHS will seek to define those relationships between individual SNPs, haplotypes, or genetic pathways that determine either incident clinical events or intermediate phenotypic traits. For all initial analyses, the WGHS will follow recent guidelines (29)(30) in which the P-value for genome-wide significance is predetermined to be at a level of 10–7 or smaller, a conservative approach consistent with Bonferroni correction. For this approach, sample size and power for the WGHS to reach the level of statistical significance are presented in Fig. 1 for clinical events such as myocardial infarction or diabetes and in Fig. 2 for continuous intermediate phenotypes such as HDL cholesterol. As shown, the large size of the WGHS provides more than adequate power at a genome-wide level of significance for clinical endpoints with 500 or more incident events, as well as extremely high power to detect genetic differences on an additive model for intermediate phenotypes for which the polymorphism of interest explains as little as 0.15% of the variance (see Fig. legends for detail). Compared to smaller GWAS already underway, the WGHS is well positioned to address gene-gene and gene-environment interactions across a wide range of clinical outcomes and environmental exposures.


Figure 1
View larger version (32K):
[in this window]
[in a new window]

 
Figure 1. Estimates of power in the WGHS for clinical endpoints.

The large size of the WGHS provides a unique opportunity to identify true genetic associations by virtue of ample power for statistical significance at the genome wide level (P value approximately 10–7). For incident disease, power is related to the magnitude of the effect of the minor allele of a polymorphism expressed as a relative risk, and its frequency in the population. For example, for a common disorder such as osteoporosis, for which >3500 cases will occur in the WHS, there is at least 80% power at the genome-wide level for odds-ratios of 1.3 or greater (0.75 or less) associated with the minor allele of polymorphisms with minor allele frequency of about 10% or greater. For a less common disorder such as myocardial infarction, for which approximately 500 cases will occur, there is 80% power at the genome-wide level for genetic effects, with relative risk of about 1.6 or greater (0.6 or less) for the minor allele.


Figure 2
View larger version (29K):
[in this window]
[in a new window]

 
Figure 2. Estimates of power in the WGHS for continuous intermediate phenotypes.

For quantitative traits such as body mass index or plasma biomarkers such as HDL cholesterol or C-reactive protein, power is closely related to the proportion of variance explained by a given SNP (thin lines). In the WGHS, SNPs explaining as little as 0.15% of the variance will have 80% power at the genome wide level (P value approximately 10–7). Thus, for a minor allele frequency of approximately 10%, this effect corresponds to a normalized shift in the mean value of the quantitative trait due to the minor allele of approximately 0.08 (β in Fig.), or for the case of LDL-C, about 2.7 mg/dL.


   relationship of the wghs to other gwas
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 
The sample size and epidemiologic scope make the WGHS a unique genetic resource. In addition, multiple other GWAS are underway with similar goals. Thus, through collaborative networks of investigators, observations made in one GWAS can be tested in multiple other settings, a crucial issue for generalizability and estimating population-specific attributable risks. Presentation standards for genotype–phenotype association studies that will allow such replication have recently been enumerated by National Cancer Institute/National Human Genome Research Institute Working Group on Replication in Association Studies and are structurally integrated into the WGHS (31). Examples of other ongoing GWAS with aims similar to those of the WGHS include the federally funded Framingham SNP Health Association Resource of approximately 9000 residents of Framingham, MA, as well as GWAS data being generated from unique populations such as the Atherosclerotic Risk in Communities study, the Jackson Heart Study, the MultiEthnic Study of Atherosclerosis, and the Cardiovascular Health Study. To encourage collaborative work, investigators within the WGHS have participated in the NIH-funded Genetic Analysis Information Network as well as in the Pharmacogenetics Research Network and the Pharmacogenomics and Risks of Cardiovascular Disease collaboration. The goals of the WGHS also parallel those of several international GWAS programs including the Wellcome Trust Case Control Consortium of 14 000 cases of 7 common diseases and 3000 shared controls, and the DeCode Genetics programs in Iceland (4)(9).


   Acknowledgments
 
Grant/funding Support: The WGHS is a collaborative study supported by both federal and nonfederal entities. Primary support for the underlying WHS derives from investigator-initiated RO1 grants from the National Heart Lung and Blood Institute and the National Cancer Institute (Bethesda, MD) (J.E.B., PI). Primary support for the plasma phenotyping of all WGHS participants was supported by investigator-initiated grants from the Donald W Reynolds Foundation, Las Vegas, NV (P.M.R., PI). The process of DNA extraction was supported by investigator-initiated grants from the Doris Duke Charitable Foundation, NY, NY (P.M.R., PI) and the Fondation Leducq (Paris, France) (P.M.R., PI). The genome-wide scans using the Illumina based Human-Hap300 BeadChip technology are being conducted by Amgen (Cambridge, MA). Primary financial support for the genome-wide scans is provided by Amgen, with additional financial support provided by the National Heart, Lung, and Blood Institute, the Donald W. Reynolds Foundation, and the Leducq Foundation.

Financial Disclosures: Alex Parker is an employee of Amgen, Inc.

Acknowledgments: The WGHS Investigators are indebted to Joseph P. Miletich for his foresight in understanding the role of genetics in common diseases affecting women, to the staff of the Women’s Health Study, and to the dedicated and conscientious women who are participating in this study.


   Footnotes
 
1 Nonstandard abbreviations: GWAS, genome-wide association study; WGHS, Women’s Genome Health Study; WHS, Women’s Health Study; SNP, single-nucleotide polymorphism.


   References
Top
Abstract
Introduction
cohort assembly and prospective...
endpoint validation
baseline blood collection,...
dna extraction and genotyping...
participant confidentiality
statistical considerations and...
relationship of the wghs...
References
 

  1. Christensen K, Murray JC. What genome-wide association studies can do for medicine. N Engl J Med 2007;356:1094-1097.[Free Full Text]
  2. Lander ES. The new genomics: global views of biology. Science (Wash DC) 1996;274:536-539.[Free Full Text]
  3. Hirschorn JN, Daly MJ. Genome-wide association studies for common disease and complex traits. Nature (Lond) Rev Genet 2005;6:95-108.
  4. . Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature (Lond) 2007;447:661-678.[CrossRef][Web of Science][Medline] [Order article via Infotrieve]
  5. Scott LJ, Mohlke KL, Bonnycastle LL, Willer CJ, Li Y, Duren WL, et al. A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science (Wash DC) 2007;316:1341-1345.[Abstract/Free Full Text]
  6. Saxena R, Voight BF, Lyssenko V, Burtt NP, de Bakker PI, Chen H, et al. Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science (Wash DC) 2007;316:1331-1336.[Abstract/Free Full Text]
  7. Samani NJ, Erdmann J, Hall AS, Hengstenberg C, Mangino M, Mayer B, et al. Genome-wide association analysis of coronary artery disease. N Engl J Med 2007;357:443-453.[Abstract/Free Full Text]
  8. McPherson R, Pertsemlidis A, Kavaslar N, Stewart A, Roberts R, Cox DR, et al. A common allele on chromosome 9 associated with coronary heart disease. Science (Wash DC) 2007;316:1488-1491.[Abstract/Free Full Text]
  9. Helgadottir A, Thorleifsson G, Manolescu A, Gretarsdottir S, Blondal T, Jonasdottir A, et al. A common variant on chromosome 9p21 affects the risk of myocardial infarction. Science (Wash DC) 2007;316:1491-1493.[Abstract/Free Full Text]
  10. . The International HapMap Consortium. A haplotype map of the human genome. Nature (Lond) 2005;437:1299-1320.[CrossRef][Medline] [Order article via Infotrieve]
  11. Rexrode KM, Lee I-M, Cook NR, Hennekens CH, Buring JE. Baseline characteristics of participants in the Women’s Health Study. J Womens Health Gend Based Med 2000;9:19-27.[CrossRef][Web of Science][Medline] [Order article via Infotrieve]
  12. Ridker PM, Cook NR, Lee I-M, Gordon D, Gaziano JM, Manson JE, et al. A randomized trial of low-dose aspirin in the primary prevention of cardiovascular disease in women. N Engl J Med 2005;352:1293-1304.[Abstract/Free Full Text]
  13. Lee I-M, Cook NR, Gaziano JM, Gordon D, Ridker PM, Manson JE, et al. Vitamin E in the primary prevention of cardiovascular disease and cancer. The Women’s Health Study: a randomized controlled trial. JAMA 2005;294:56-65.[Abstract/Free Full Text]
  14. Cook NR, Lee I-M, Gaziano JM, Gordon D, Ridker PM, Manson JE, et al. Low-dose aspirin in the primary prevention of cancer. The Women’s Health Study: a randomized controlled trial. JAMA 2005;294:47-55.[Abstract/Free Full Text]
  15. Pradhan AD, Manson JE, Rifai N, Buring JE, Ridker PM. C-reactive protein, interleukin 6, and risk of developing type 2 diabetes mellitus. JAMA 2001;286:327-334.[Abstract/Free Full Text]
  16. Sesso HD, Buring JE, Rifai N, Blake GJ, Gaziano JM, Ridker PM. C-reactive protein and the risk of developing hypertension. JAMA 2003;290:2945-2951.[Abstract/Free Full Text]
  17. Kang JH, Cook N, Manson J, Buring JE, Grodstein F. Low-dose aspirin and cognitive function in the Women’s Health Study cognitive cohort. BMJ 2007;12(334(7601)):987[e-pub 2007 Apr 27].
  18. Pradhan AD, Shrivastava S, Cook NR, Rifai N, Creager MA, Ridker PM. Symptomatic peripheral arterial disease in women: Nontraditional biomarkers of elevated risk. Circulation 2007 (in press)..
  19. Ridker PM, Buring JE, Rifai N, Cook NR. Development and validation of improved algorithms for the assessment of global cardiovascular risk in women. The Reynolds Risk Score. JAMA 2007;297:611-619.[Abstract/Free Full Text]
  20. Ridker PM, Rifai N, Cook NR, Bradwin G, Buring JE. Non-HDL cholesterol, apolipoproteins A-I and B100, standard lipid measures, lipid ratios, and CRP as risk factors for cardiovascular disease in women. JAMA 2005;294:326-333.[Abstract/Free Full Text]
  21. Bansal S, Buring JE, Rifai M, Sacks FM, Ridker PM. Fasting compared with nonfasting triglycerides and risk of cardiovascular events in women. JAMA 2007;298:309-316.[Abstract/Free Full Text]
  22. Ridker PM, Rifai N, Rose L, Buring JE, Cook NR. Comparison of C-reactive protein and low-density lipoprotein cholesterol levels in the prediction of first cardiovascular events. N Engl J Med 2002;347:1557-1565.[Abstract/Free Full Text]
  23. . SukDanik J, Rifai N, Buring JE, Ridker PM. Lipoprotein (a), measured with an assay independent of apolipoprotein (a) isoform size, and risk of future cardiovascular events among initially healthy women. JAMA 2006;296:1363-1370.[Abstract/Free Full Text]
  24. Zee RY, Mora S, Cheng S, Erlich HA, Lindpaintner K, Rifai N, et al. Homocysteine, 5,10-methylenetetrahydrofolate reductase 677C>T polymorphism, nutrient intake, and incident cardiovascular disease in 24,968 initially healthy women. Clin Chem 2007;53:845-851.[Abstract/Free Full Text]
  25. Mora S, Rifai N, Buring JE, Ridker PM. Additive value of immunoassay-measured fibrinogen and high-sensitivity C-reactive protein levels for predicting incident cardiovascular events. Circulation 2006;114:381-387.[Abstract/Free Full Text]
  26. Ridker PM, Hennekens CH, Buring JE, Rifai N. C-reactive protein and other markers of inflammation in the prediction of cardiovascular disease in women. N Engl J Med 2000;342:836-843.[Abstract/Free Full Text]
  27. Gunderson KL, Steemers FJ, Ren H, Ng P, Zhou L, Tsan C, et al. Whole-genome genotyping. Methods Enzymol 2006;410:359-376.[Medline] [Order article via Infotrieve]
  28. Gunderson KL, Kuhn KM, Steemers FJ, Ng P, Murray SS, Shen R. Whole-genome genotyping of haplotype tag single nucleotide polymorphisms. Pharmacogenomics 2006;7:641-648.[CrossRef][Web of Science][Medline] [Order article via Infotrieve]
  29. Wang YSW, Barratt BJ, Clayton DG, Todd JA. Genome-wide association studies: theoretical and practical concerns. Nature (Lond) Rev Genetics 2005;6:109-118.[CrossRef]
  30. Balding DJ. A tutorial on statistical methods for population association studies. Nature (Lond) Rev Genetics 2006;7:781-791.[CrossRef]
  31. . NCI-NHGRI Working Group on Replication in Association Studies. Replicating genotype-phenotype associations: what constitutes replication of a genotype-phenotype association, and how best can it be achieved?. Nature (Lond) 2007;447:655-660.[CrossRef][Medline] [Order article via Infotrieve]



The following articles in journals at HighWire Press have cited this article:


Home page
J Am Coll CardiolHome page
P. Libby, P. M. Ridker, G. K. Hansson, and for the Leducq Transatlantic Network on Atherothro
Inflammation in Atherosclerosis From Pathophysiology to Practice.
J. Am. Coll. Cardiol., December 1, 2009; 54(23): 2129 - 2138.
[Abstract] [Full Text] [PDF]


Home page
Circ Cardiovasc GenetHome page
A. Dehghan, Q. Yang, A. Peters, S. Basu, J. C. Bis, A. R. Rudnicka, M. Kavousi, M.-H. Chen, J. Baumert, G. D.O. Lowe, et al.
Association of Novel Genetic Loci With Circulating Fibrinogen Levels: A Genome-Wide Association Study in 6 Population-Based Cohorts
Circ Cardiovasc Genet, April 1, 2009; 2(2): 125 - 133.
[Abstract] [Full Text] [PDF]


Home page
Circ Cardiovasc GenetHome page
G. Pare, D. I. Chasman, A. N. Parker, R. R.Y. Zee, A. Malarstig, U. Seedorf, R. Collins, H. Watkins, A. Hamsten, J. P. Miletich, et al.
Novel Associations of CPS1, MUT, NOX4, and DPEP1 With Plasma Homocysteine in a Healthy Population: A Genome-Wide Evaluation of 13 974 Participants in the Women's Genome Health Study
Circ Cardiovasc Genet, April 1, 2009; 2(2): 142 - 150.
[Abstract] [Full Text] [PDF]


Home page
Circ Cardiovasc GenetHome page
J. S. Danik, G. Pare, D. I. Chasman, R. Y.L. Zee, D. J. Kwiatkowski, A. Parker, J. P. Miletich, and P. M Ridker
Novel Loci, Including Those Related to Crohn Disease, Psoriasis, and Inflammation, Identified in a Genome-Wide Association Study of Fibrinogen in 17 686 Women: The Women's Genome Health Study
Circ Cardiovasc Genet, April 1, 2009; 2(2): 134 - 141.
[Abstract] [Full Text] [PDF]


Home page
Circ Cardiovasc GenetHome page
R. Y.L. Zee, R. J. Glynn, S. Cheng, L. Steiner, L. Rose, and P. M Ridker
An Evaluation of Candidate Genes of Inflammation and Thrombosis in Relation to the Risk of Venous Thromboembolism: The Women's Genome Health Study
Circ Cardiovasc Genet, February 1, 2009; 2(1): 57 - 62.
[Abstract] [Full Text] [PDF]


Home page
Circ Cardiovasc GenetHome page
P. M. Ridker, G. Pare, A. N. Parker, R. Y.L. Zee, J. P. Miletich, and D. I. Chasman
Polymorphism in the CETP Gene Region, HDL Cholesterol, and Risk of Future Myocardial Infarction: Genomewide Analysis Among 18 245 Initially Healthy Women From the Women's Genome Health Study
Circ Cardiovasc Genet, February 1, 2009; 2(1): 26 - 33.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
N. P. Paynter, D. I. Chasman, J. E. Buring, D. Shiffman, N. R. Cook, and P. M. Ridker
Cardiovascular Disease Risk Prediction With and Without Knowledge of Genetic Variation at Chromosome 9p21.3
Ann Intern Med, January 20, 2009; 150(2): 65 - 72.
[Abstract] [Full Text] [PDF]


Home page
Clin. Chem.Home page
D. I. Chasman, G. Pare, and P. M Ridker
Population-Based Genomewide Genetic Analysis of Common Clinical Chemistry Analytes
Clin. Chem., January 1, 2009; 55(1): 39 - 51.
[Abstract] [Full Text] [PDF]


Home page
JAMAHome page
T. A. Pearson and T. A. Manolio
How to Interpret a Genome-wide Association Study
JAMA, March 19, 2008; 299(11): 1335 - 1344.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
clinchem.2007.099366v1
54/2/249    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (3)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ridker, P. M
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ridker, P. M


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS