TY - JOUR
T1 - A data-driven method for identifying rare variants with heterogeneous trait effects
AU - Zhang, Qunyuan
AU - Irvin, Marguerite R.
AU - Arnett, Donna K.
AU - Province, Michael A.
AU - Borecki, Ingrid
PY - 2011/11
Y1 - 2011/11
N2 - Collapsing multiple variants into one variable and testing their collective effect is a useful strategy for rare variant association analysis. Direct collapsing, however, is not valid or may significantly lose power when a group of variants to be collapsed have heterogeneous effects on target traits (i.e. some positive and some negative). This could be especially true for quantitative traits (such as blood pressure and body mass index), regardless of whether subjects are sampled randomly from a population or selectively from two extreme tails of the trait distribution. To deal with this problem, we propose a novel, data-driven method, the P-value Weighted Sum Test (PWST), which allows each variant to be individually weighted according to the evidence of association from the data itself. Specifically, both significance and direction of individual variant effects are used to calculate a single weighted sum score based on rescaled left-tail P-values from single-variant analysis, after which a permutation test of association is performed between the score and the trait. Our simulation under different sampling strategies shows that PWST significantly increases statistical power when there are heterogeneous variant effects. The appeal of the PWST approach is illustrated in an application to sequence data by detecting the collective effect of variants in the peroxisome proliferator-activated receptor alpha (PPARα) gene on triglycerides (TG) response to fenofibrate treatment from 300 subjects in the Genetics of Lipid Lowering and Diet Network study.
AB - Collapsing multiple variants into one variable and testing their collective effect is a useful strategy for rare variant association analysis. Direct collapsing, however, is not valid or may significantly lose power when a group of variants to be collapsed have heterogeneous effects on target traits (i.e. some positive and some negative). This could be especially true for quantitative traits (such as blood pressure and body mass index), regardless of whether subjects are sampled randomly from a population or selectively from two extreme tails of the trait distribution. To deal with this problem, we propose a novel, data-driven method, the P-value Weighted Sum Test (PWST), which allows each variant to be individually weighted according to the evidence of association from the data itself. Specifically, both significance and direction of individual variant effects are used to calculate a single weighted sum score based on rescaled left-tail P-values from single-variant analysis, after which a permutation test of association is performed between the score and the trait. Our simulation under different sampling strategies shows that PWST significantly increases statistical power when there are heterogeneous variant effects. The appeal of the PWST approach is illustrated in an application to sequence data by detecting the collective effect of variants in the peroxisome proliferator-activated receptor alpha (PPARα) gene on triglycerides (TG) response to fenofibrate treatment from 300 subjects in the Genetics of Lipid Lowering and Diet Network study.
KW - Collapsing
KW - Heterogeneous effects
KW - Quantitative trait
KW - Rare variant
KW - Sum test
UR - http://www.scopus.com/inward/record.url?scp=80054750776&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80054750776&partnerID=8YFLogxK
U2 - 10.1002/gepi.20618
DO - 10.1002/gepi.20618
M3 - Article
C2 - 21818776
AN - SCOPUS:80054750776
SN - 0741-0395
VL - 35
SP - 679
EP - 685
JO - Genetic Epidemiology
JF - Genetic Epidemiology
IS - 7
ER -