Analysis of rare variant population structure in Europeans explains differential stratification of gene-based tests.

Eur J Hum Genet 2014 Sep 8;22(9):1137-44. Epub 2014 Jan 8.

1] Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA [2] Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA.

There is substantial interest in the role of rare genetic variants in the etiology of complex human diseases. Several gene-based tests have been developed to simultaneously analyze multiple rare variants for association with phenotypic traits. The tests can largely be partitioned into two classes - 'burden' tests and 'joint' tests - based on how they accumulate evidence of association across sites. We used the empirical joint site frequency spectra of rare, nonsynonymous variation from a large multi-population sequencing study to explore the effect of realistic rare variant population structure on gene-based tests. We observed an important difference between the two test classes: their susceptibility to population stratification. Focusing on European samples, we found that joint tests, which allow variants to have opposite directions of effect, consistently showed higher levels of P-value inflation than burden tests. We determined that the differential stratification was caused by two specific patterns in the interpopulation distribution of rare variants, each correlating with inflation in one of the test classes. The pattern that inflates joint tests is more prevalent in real data, explaining the higher levels of inflation in these tests. Furthermore, we show that the different sources of inflation between tests lead to heterogeneous responses to genomic control correction and the number of variants analyzed. Our results indicate that care must be taken when interpreting joint and burden analyses of the same set of rare variants, in particular, to avoid mistaking inflated P-values in joint tests for stronger signals of true associations.

Download full-text PDF

Source
http://dx.doi.org/10.1038/ejhg.2013.297DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4135410PMC
September 2014
7 Reads

Publication Analysis

Top Keywords

tests
12
rare variants
12
joint tests
12
gene-based tests
12
higher levels
8
test classes
8
inflation tests
8
differential stratification
8
rare variant
8
population structure
8
variant population
8
rare
6
variants
6
joint
5
consistently higher
4
stratification caused
4
interpopulation distribution
4
distribution rare
4
opposite directions
4
directions consistently
4

References

(Supplied by CrossRef)

A Keinan et al.
Science 2012

M Reppell et al.
Bioinformatics 2012

MR Nelson et al.
Science 2012

JA Tennessen et al.
Science 2012

JK Pritchard et al.
Am J Hum Genet 2001

EA Stahl et al.
Nat Genet 2012

JR Huyghe et al.
Nat Genet 2013

K Liu et al.
Genet Epidemiol 2013

B Li et al.
Am J Hum Genet 2008

M Zawistowski et al.
Am J Hum Genet 2010

AP Morris et al.
Genet Epidemiol 2010

Similar Publications