Abstract
Although the rapid advancements in high throughput genotyping technology have made genome-wide association studies possible, these studies remain an expensive undertaking, especially when considering the large sample sizes necessary to find the small to moderate effect sizes that define complex diseases. It is therefore prudent to utilize all possible information contained in a genome-wide scan. We propose a straightforward analytical approach that tests often unused SNP data without sacrificing statistical validity. We simulate genotype miscalls under a variety of models consistent with observed miscall rates and test for departures from HWE using the standard Pearson's χ2-test. We find that true disease susceptibility loci subjected to various patterns of genotype miscalls can be largely out of HWE and, thus, be candidates for removal before association testing. These loci, we demonstrate, can maintain sufficient statistical power even under extreme error models. We additionally show that random miscalls of null SNPs, independent of the phenotype, do not induce bias in case-control or cohort studies, and we suggest that a significant HWE test should not prevent a SNP from being tested when conducting genome-wide association studies in these scenarios. However, association findings for SNPs that are out of HWE must be treated more carefully than 'regular' findings, for example, by re-genotyping the SNP in the same study using a different genotyping technology.
Original language | English |
---|---|
Pages (from-to) | 1676-1682 |
Number of pages | 7 |
Journal | European Journal of Human Genetics |
Volume | 17 |
Issue number | 12 |
DOIs | |
State | Published - 2009 |
Bibliographical note
Funding Information:We thank three anonymous reviewers for helpful comments and suggestions. This work was supported in part by grant no. HL07427 (to DWF) from the National Institutes of Health, Bethesda, MD, USA.
ASJC Scopus subject areas
- Genetics
- Genetics(clinical)