Publications by authors named "Alisa Manning"

94 Publications

Multi-Ancestry Genome-wide Association Study Accounting for Gene-Psychosocial Factor Interactions Identifies Novel Loci for Blood Pressure Traits.

HGG Adv 2021 Jan 31;2(1). Epub 2020 Oct 31.

Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, 17489, Germany.

Psychological and social factors are known to influence blood pressure (BP) and risk of hypertension and associated cardiovascular diseases. To identify novel BP loci, we carried out genome-wide association meta-analyses of systolic, diastolic, pulse, and mean arterial BP taking into account the interaction effects of genetic variants with three psychosocial factors: depressive symptoms, anxiety symptoms, and social support. Analyses were performed using a two-stage design in a sample of up to 128,894 adults from 5 ancestry groups. In the combined meta-analyses of Stages 1 and 2, we identified 59 loci (p value <5e-8), including nine novel BP loci. The novel associations were observed mostly with pulse pressure, with fewer observed with mean arterial pressure. Five novel loci were identified in African ancestry, and all but one showed patterns of interaction with at least one psychosocial factor. Functional annotation of the novel loci supports a major role for genes implicated in the immune response (), synaptic function and neurotransmission (), as well as genes previously implicated in neuropsychiatric or stress-related disorders (). These findings underscore the importance of considering psychological and social factors in gene discovery for BP, especially in non-European populations.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.xhgg.2020.100013DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8562625PMC
January 2021

Predicting diabetes risk in diverse populations: what next?

Lancet Diabetes Endocrinol 2021 Dec 28;9(12):808-810. Epub 2021 Oct 28.

Center for Public Health Genomics and Department of Public Health Sciences, University of Virginia, Charlottesville 800717, VA, USA. Electronic address:

View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1016/S2213-8587(21)00287-4DOI Listing
December 2021

GEM: scalable and flexible gene-environment interaction analysis in millions of samples.

Bioinformatics 2021 10;37(20):3514-3520

Department of Medicine, Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA 02114, USA.

Motivation: Gene-environment interaction (GEI) studies are a general framework that can be used to identify genetic variants that modify the effects of environmental, physiological, lifestyle or treatment effects on complex traits. Moreover, accounting for GEIs can enhance our understanding of the genetic architecture of complex diseases and traits. However, commonly used statistical software programs for GEI studies are either not applicable to testing certain types of GEI hypotheses or have not been optimized for use in large samples.

Results: Here, we develop a new software program, GEM (Gene-Environment interaction analysis in Millions of samples), which supports the inclusion of multiple GEI terms, adjustment for GEI covariates and robust inference, while allowing multi-threading to reduce computation time. GEM can conduct GEI tests as well as joint tests of genetic main and interaction effects for both continuous and binary phenotypes. Through simulations, we demonstrate that GEM scales to millions of samples while addressing limitations of existing software programs. We additionally conduct a gene-sex interaction analysis on waist-hip ratio in 352 768 unrelated individuals from the UK Biobank, identifying 24 novel loci in the joint test that have not previously been reported in combined or sex-specific analyses. Our results demonstrate that GEM can facilitate the next generation of large-scale GEI studies and help advance our understanding of the genetic architecture of complex diseases and traits.

Availability And Implementation: GEM is freely available as an open source project at https://github.com/large-scale-gxe-methods/GEM.

Supplementary Information: Supplementary data are available at Bioinformatics online.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btab223DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8545347PMC
October 2021

Gene-environment interaction analysis incorporating sex, cardiometabolic diseases, and multiple deprivation index reveals novel genetic associations with COVID-19 severity.

medRxiv 2021 Aug 23. Epub 2021 Aug 23.

Increasing evidence indicates that specific genetic variants influence the severity of outcomes after infection with COVID-19. However, it is not clear whether the effect of these genetic factors is independent of the risk due to more established non-genetic demographic and metabolic risk factors such as male sex, poor cardiometabolic health, and low socioeconomic status. We sought to identify interactions between genetic variants and non-genetic risk factors influencing COVID-19 severity via a genome-wide interaction study in the UK Biobank. Of 378,051 unrelated individuals of European ancestry, 2,402 were classified as having experienced severe COVID-19, defined as hospitalization or death due to COVID-19. Exposures included sex, cardiometabolic risk factors (obesity and type 2 diabetes [T2D], tested jointly), and multiple deprivation index. Multiplicative interaction was tested using a logistic regression model, conducting both an interaction test and a joint test of genetic main and interaction effects. Five independent variants reached genome-wide significance in the joint test, one of which also reached significance in the interaction test. One of these, rs2268616 in the gene, showed stronger effects in males and in individuals with T2D. None of the five variants showed effects on a similarly-defined phenotype in a lookup in the COVID-19 Host Genetics Initiative. These results reveal potential additional genetic loci contributing to COVID-19 severity and demonstrate the value of including non-genetic risk factors in an interaction testing approach for genetic discovery.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1101/2021.08.13.21261910DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404897PMC
August 2021

Genomic Risk Prediction for Breast Cancer in Older Women.

Cancers (Basel) 2021 Jul 14;13(14). Epub 2021 Jul 14.

Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC 3004, Australia.

Genomic risk prediction models for breast cancer (BC) have been predominantly developed with data from women aged 40-69 years. Prospective studies of older women aged ≥70 years have been limited. We assessed the effect of a 313-variant polygenic risk score (PRS) for BC in 6339 older women aged ≥70 years (mean age 75 years) enrolled into the ASPREE trial, a randomized double-blind placebo-controlled clinical trial investigating the effect of daily 100 mg aspirin on disability-free survival. We evaluated incident BC diagnoses over a median follow-up time of 4.7 years. A multivariable Cox regression model including conventional BC risk factors was applied to prospective data, and re-evaluated after adding the PRS. We also assessed the association of rare pathogenic variants (PVs) in BC susceptibility genes (). The PRS, as a continuous variable, was an independent predictor of incident BC (hazard ratio (HR) per standard deviation (SD) = 1.4, 95% confidence interval (CI) 1.3-1.6) and hormone receptor (ER/PR)-positive disease (HR = 1.5 (CI 1.2-1.9)). Women in the top quintile of the PRS distribution had over two-fold higher risk of BC than women in the lowest quintile (HR = 2.2 (CI 1.2-3.9)). The concordance index of the model without the PRS was 0.62 (95% CI 0.56-0.68), which improved after addition of the PRS to 0.65 (95% CI 0.59-0.71). Among 41 (0.6%) carriers of PVs in BC susceptibility genes, we observed no incident BC diagnoses. Our study demonstrates that a PRS predicts incident BC risk in women aged 70 years and older, suggesting potential clinical utility extends to this older age group.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.3390/cancers13143533DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8305131PMC
July 2021

Genome-wide gene-diet interaction analysis in the UK Biobank identifies novel effects on hemoglobin A1c.

Hum Mol Genet 2021 Aug;30(18):1773-1783

Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA.

Diet is a significant modifiable risk factor for type 2 diabetes (T2D), and its effect on disease risk is under partial genetic control. Identification of specific gene-diet interactions (GDIs) influencing risk biomarkers such as glycated hemoglobin (HbA1c) is a critical step towards precision nutrition for T2D prevention, but progress has been slow due to limitations in sample size and accuracy of dietary exposure measurement. We leveraged the large UK Biobank (UKB) cohort and a diverse group of dietary exposures, including 30 individual dietary traits and 8 empirical dietary patterns, to conduct genome-wide interaction studies in ~340 000 European-ancestry participants to identify novel GDIs influencing HbA1c. We identified five variant-dietary trait pairs reaching genome-wide significance (P < 5 × 10-8): two involved dietary patterns (meat pattern with rs147678157 and a fruit & vegetable-based pattern with rs3010439) and three involved individual dietary traits (bread consumption with rs62218803, dried fruit consumption with rs140270534 and milk type [dairy vs. other] with 4:131148078_TAGAA_T). These were affected minimally by adjustment for geographical and lifestyle-related confounders, and four of the five variants lacked genetic main effects that would have allowed their detection in a traditional genome-wide association study for HbA1c. Notably, multiple loci near transient receptor potential subfamily M genes (TRPM2 and TRPM3) interacted with carbohydrate-containing food groups. These interactions were further characterized using non-European UKB subsets and alternative measures of glycaemia (fasting glucose and follow-up HbA1c measurements). Our results highlight GDIs influencing HbA1c for future investigation, while reinforcing known challenges in detecting and replicating GDIs.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1093/hmg/ddab109DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8411984PMC
August 2021

Multi-ancestry genome-wide gene-sleep interactions identify novel loci for blood pressure.

Mol Psychiatry 2021 Apr 15. Epub 2021 Apr 15.

Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands.

Long and short sleep duration are associated with elevated blood pressure (BP), possibly through effects on molecular pathways that influence neuroendocrine and vascular systems. To gain new insights into the genetic basis of sleep-related BP variation, we performed genome-wide gene by short or long sleep duration interaction analyses on four BP traits (systolic BP, diastolic BP, mean arterial pressure, and pulse pressure) across five ancestry groups in two stages using 2 degree of freedom (df) joint test followed by 1df test of interaction effects. Primary multi-ancestry analysis in 62,969 individuals in stage 1 identified three novel gene by sleep interactions that were replicated in an additional 59,296 individuals in stage 2 (stage 1 + 2 P < 5 × 10), including rs7955964 (FIGNL2/ANKRD33) that increases BP among long sleepers, and rs73493041 (SNORA26/C9orf170) and rs10406644 (KCTD15/LSM14A) that increase BP among short sleepers (P < 5 × 10). Secondary ancestry-specific analysis identified another novel gene by long sleep interaction at rs111887471 (TRPC3/KIAA1109) in individuals of African ancestry (P = 2 × 10). Combined stage 1 and 2 analyses additionally identified significant gene by long sleep interactions at 10 loci including MKLN1 and RGL3/ELAVL3 previously associated with BP, and significant gene by short sleep interactions at 10 loci including C2orf43 previously associated with BP (P < 10). 2df test also identified novel loci for BP after modeling sleep that has known functions in sleep-wake regulation, nervous and cardiometabolic systems. This study indicates that sleep and primary mechanisms regulating BP may interact to elevate BP level, suggesting novel insights into sleep-related BP regulation.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41380-021-01087-0DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8517040PMC
April 2021

Clinical and metabolomic predictors of regression to normoglycemia in a population at intermediate cardiometabolic risk.

Cardiovasc Diabetol 2021 02 27;20(1):56. Epub 2021 Feb 27.

Clinical and Translational Epidemiology Unit, Massachusetts General Hospital, 100 Cambridge, Boston, MA, USA.

Background: Impaired fasting glucose (IFG) is a prevalent and potentially reversible intermediate stage leading to type 2 diabetes that increases risk for cardiometabolic complications. The identification of clinical and molecular factors associated with the reversal, or regression, from IFG to a normoglycemia state would enable more efficient cardiovascular risk reduction strategies. The aim of this study was to identify clinical and biological predictors of regression to normoglycemia in a non-European population characterized by high rates of type 2 diabetes.

Methods: We conducted a prospective, population-based study among 9637 Mexican individuals using clinical features and plasma metabolites. Among them, 491 subjects were classified as IFG, defined as fasting glucose between 100 and 125 mg/dL at baseline. Regression to normoglycemia was defined by fasting glucose less than 100 mg/dL in the follow-up visit. Plasma metabolites were profiled by Nuclear Magnetic Resonance. Multivariable cox regression models were used to examine the associations of clinical and metabolomic factors with regression to normoglycemia. We assessed the predictive capability of models that included clinical factors alone and models that included clinical factors and prioritized metabolites.

Results: During a median follow-up period of 2.5 years, 22.6% of participants (n = 111) regressed to normoglycemia, and 29.5% progressed to type 2 diabetes (n = 145). The multivariate adjusted relative risk of regression to normoglycemia was 1.10 (95% confidence interval [CI] 1.25 to 1.32) per 10 years of age increase, 0.94 (95% CI 0.91-0.98) per 1 SD increase in BMI, and 0.91 (95% CI 0.88-0.95) per 1 SD increase in fasting glucose. A model including information from age, fasting glucose, and BMI showed a good prediction of regression to normoglycemia (AUC = 0.73 (95% CI 0.66-0.78). The improvement after adding information from prioritized metabolites (TG in large HDL, albumin, and citrate) was non-significant (AUC = 0.74 (95% CI 0.68-0.80), p value = 0.485).

Conclusion: In individuals with IFG, information from three clinical variables easily obtained in the clinical setting showed a good prediction of regression to normoglycemia beyond metabolomic features. Our findings can serve to inform and design future cardiovascular prevention strategies.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12933-021-01246-1DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7916268PMC
February 2021

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.

Nature 2021 02 10;590(7845):290-299. Epub 2021 Feb 10.

The Broad Institute of MIT and Harvard, Cambridge, MA, USA.

The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes). In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41586-021-03205-yDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7875770PMC
February 2021

The First Genome-Wide Association Study for Type 2 Diabetes in Youth: The Progress in Diabetes Genetics in Youth (ProDiGY) Consortium.

Diabetes 2021 04 21;70(4):996-1005. Epub 2021 Jan 21.

Department of Epidemiology, University of Colorado School of Public Health, Aurora, CO.

The prevalence of type 2 diabetes in youth has increased substantially, yet the genetic underpinnings remain largely unexplored. To identify genetic variants predisposing to youth-onset type 2 diabetes, we formed ProDiGY, a multiethnic collaboration of three studies (TODAY, SEARCH, and T2D-GENES) with 3,006 youth case subjects with type 2 diabetes (mean age 15.1 ± 2.9 years) and 6,061 diabetes-free adult control subjects (mean age 54.2 ± 12.4 years). After stratifying by principal component-clustered ethnicity, we performed association analyses on ∼10 million imputed variants using a generalized linear mixed model incorporating a genetic relationship matrix to account for population structure and adjusting for sex. We identified seven genome-wide significant loci, including the novel locus rs10992863 in ( = 3.2 × 10; odds ratio [OR] = 1.23). Known loci identified in our analysis include rs7903146 in ( = 8.0 × 10; OR 1.58), rs72982988 near ( = 4.4 × 10; OR 1.53), rs200893788 in ( = 1.1 × 10; OR 1.32), rs2237892 in ( = 4.8 × 10; OR 1.59), rs937589119 in ( = 3.1 × 10; OR 1.34), and rs113748381 in ( = 4.1 × 10; OR 1.04). Secondary analysis with 856 diabetes-free youth control subjects uncovered an additional locus in ( = 3.2 × 10; OR 2.1) and consistent direction of effect for diabetes risk. In conclusion, we identified both known and novel loci in the first genome-wide association study of youth-onset type 2 diabetes.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.2337/db20-0443DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7980197PMC
April 2021

Discovery of rare variants associated with blood pressure regulation through meta-analysis of 1.3 million individuals.

Nat Genet 2020 12 23;52(12):1314-1332. Epub 2020 Nov 23.

Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, Denmark.

Genetic studies of blood pressure (BP) to date have mainly analyzed common variants (minor allele frequency > 0.05). In a meta-analysis of up to ~1.3 million participants, we discovered 106 new BP-associated genomic regions and 87 rare (minor allele frequency ≤ 0.01) variant BP associations (P < 5 × 10), of which 32 were in new BP-associated loci and 55 were independent BP-associated single-nucleotide variants within known BP-associated regions. Average effects of rare variants (44% coding) were ~8 times larger than common variant effects and indicate potential candidate causal genes at new and known loci (for example, GATA5 and PLCB3). BP-associated variants (including rare and common) were enriched in regions of active chromatin in fetal tissues, potentially linking fetal development with BP regulation in later life. Multivariable Mendelian randomization suggested possible inverse effects of elevated systolic and diastolic BP on large artery stroke. Our study demonstrates the utility of rare-variant analyses for identifying candidate genes and the results highlight potential therapeutic targets.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41588-020-00713-xDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7610439PMC
December 2020

Genetic Loci and Physiologic Pathways Involved in Gestational Diabetes Mellitus Implicated Through Clustering.

Diabetes 2021 01 13;70(1):268-281. Epub 2020 Oct 13.

Diabetes Unit, Endocrine Division, Massachusetts General Hospital, Boston, MA.

Hundreds of common genetic variants acting through distinguishable physiologic pathways influence the risk of type 2 diabetes (T2D). It is unknown to what extent the physiology underlying gestational diabetes mellitus (GDM) is distinct from that underlying T2D. In this study of >5,000 pregnant women from three cohorts, we aimed to identify physiologically related groups of maternal variants associated with GDM using two complementary approaches that were based on Bayesian nonnegative matrix factorization (bNMF) clustering. First, we tested five bNMF clusters of maternal T2D-associated variants grouped on the basis of physiology outside of pregnancy for association with GDM. We found that cluster polygenic scores representing genetic determinants of reduced β-cell function and abnormal hepatic lipid metabolism were associated with GDM; these clusters were not associated with infant birth weight. Second, we derived bNMF clusters of maternal variants on the basis of pregnancy physiology and tested these clusters for association with GDM. We identified a cluster that was strongly associated with GDM as well as associated with higher infant birth weight. The effect size for this cluster's association with GDM appeared greater than that for T2D. Our findings imply that the genetic and physiologic pathways that lead to GDM differ, at least in part, from those that lead to T2D.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.2337/db20-0772DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7876560PMC
January 2021

Efficient gene-environment interaction tests for large biobank-scale sequencing studies.

Genet Epidemiol 2020 11 30;44(8):908-923. Epub 2020 Aug 30.

Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas.

Complex human diseases are affected by genetic and environmental risk factors and their interactions. Gene-environment interaction (GEI) tests for aggregate genetic variant sets have been developed in recent years. However, existing statistical methods become rate limiting for large biobank-scale sequencing studies with correlated samples. We propose efficient Mixed-model Association tests for GEne-Environment interactions (MAGEE), for testing GEI between an aggregate variant set and environmental exposures on quantitative and binary traits in large-scale sequencing studies with related individuals. Joint tests for the aggregate genetic main effects and GEI effects are also developed. A null generalized linear mixed model adjusting for covariates but without any genetic effects is fit only once in a whole genome GEI analysis, thereby vastly reducing the overall computational burden. Score tests for variant sets are performed as a combination of genetic burden and variance component tests by accounting for the genetic main effects using matrix projections. The computational complexity is dramatically reduced in a whole genome GEI analysis, which makes MAGEE scalable to hundreds of thousands of individuals. We applied MAGEE to the exome sequencing data of 41,144 related individuals from the UK Biobank, and the analysis of 18,970 protein coding genes finished within 10.4 CPU hours.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1002/gepi.22351DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7754763PMC
November 2020

A Long Non-coding RNA, , Is an Effector Transcript at the Chromosome 8p23.1- Metabolic Traits and Type 2 Diabetes Risk Locus.

Front Genet 2020 10;11:615. Epub 2020 Jul 10.

Center for Molecular Medicine & Genetics, Wayne State University, Detroit, MI, United States.

Aims: Causal transcripts at genomic loci associated with type 2 diabetes (T2D) are mostly unknown. The chr8p23.1 variant rs4841132, associated with an insulin-resistant diabetes risk phenotype, lies in the second exon of a long non-coding RNA (lncRNA) gene, , located 175 kilobases from , which encodes a key protein regulating insulin-mediated hepatic glycogen storage in humans. We hypothesized that regulates expression of in human hepatocytes.

Methods: We tested our hypothesis using Stellaris fluorescent hybridization to assess subcellular localization of ; small interfering RNA (siRNA) knockdown of , followed by RT-PCR to quantify and expression; RNA-seq to quantify the whole-transcriptome gene expression response to knockdown; and an insulin-stimulated assay to measure hepatocyte glycogen deposition before and after knockdown.

Results: We found that siRNA knockdown decreased transcript levels by approximately 80%, increased mRNA levels by 1.7-fold, and increased glycogen deposition by >50% in primary human hepatocytes. An A/G heterozygous carrier (vs. three G/G carriers) had reduced abundance due to reduced transcription of the A allele and increased PPP1R3B expression and glycogen deposition.

Conclusion: We show that the lncRNA is a negative regulator of expression and glycogen deposition in human hepatocytes and a causal transcript at an insulin-resistant T2D risk locus.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.3389/fgene.2020.00615DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7367044PMC
July 2020

Deriving stratified effects from joint models investigating gene-environment interactions.

BMC Bioinformatics 2020 Jun 18;21(1):251. Epub 2020 Jun 18.

Department of Computational Biology, USR 3756 CNRS, Institut Pasteur, Paris, France.

Background: Models including an interaction term and performing a joint test of SNP and/or interaction effect are often used to discover Gene-Environment (GxE) interactions. When the environmental exposure is a binary variable, analyses from exposure-stratified models which consist of estimating genetic effect in unexposed and exposed individuals separately can be of interest. In large-scale consortia focusing on GxE interactions in which only the joint test has been performed, it may be challenging to get summary statistics from both exposure-stratified and marginal (i.e not accounting for interaction) models.

Results: In this work, we developed a simple framework to estimate summary statistics in each stratum of a binary exposure and in the marginal model using summary statistics from the "joint" model. We performed simulation studies to assess our estimators' accuracy and examined potential sources of bias, such as correlation between genotype and exposure and differing phenotypic variances within exposure strata. Results from these simulations highlight the high theoretical accuracy of our estimators and yield insights into the impact of potential sources of bias. We then applied our methods to real data and demonstrate our estimators' retained accuracy after filtering SNPs by sample size to mitigate potential bias.

Conclusions: These analyses demonstrated the accuracy of our method in estimating both stratified and marginal summary statistics from a joint model of gene-environment interaction. In addition to facilitating the interpretation of GxE screenings, this work could be used to guide further functional analyses. We provide a user-friendly Python script to apply this strategy to real datasets. The Python script and documentation are available at https://gitlab.pasteur.fr/statistical-genetics/j2s.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-020-03569-4DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302007PMC
June 2020

Gene-educational attainment interactions in a multi-ancestry genome-wide meta-analysis identify novel blood pressure loci.

Mol Psychiatry 2021 06 5;26(6):2111-2125. Epub 2020 May 5.

Health Disparities Research Section, Laboratory of Epidemiology and Population Sciences, National Institute on Aging, National Institutes of Health, Baltimore, MD, 21224, USA.

Educational attainment is widely used as a surrogate for socioeconomic status (SES). Low SES is a risk factor for hypertension and high blood pressure (BP). To identify novel BP loci, we performed multi-ancestry meta-analyses accounting for gene-educational attainment interactions using two variables, "Some College" (yes/no) and "Graduated College" (yes/no). Interactions were evaluated using both a 1 degree of freedom (DF) interaction term and a 2DF joint test of genetic and interaction effects. Analyses were performed for systolic BP, diastolic BP, mean arterial pressure, and pulse pressure. We pursued genome-wide interrogation in Stage 1 studies (N = 117 438) and follow-up on promising variants in Stage 2 studies (N = 293 787) in five ancestry groups. Through combined meta-analyses of Stages 1 and 2, we identified 84 known and 18 novel BP loci at genome-wide significance level (P < 5 × 10). Two novel loci were identified based on the 1DF test of interaction with educational attainment, while the remaining 16 loci were identified through the 2DF joint test of genetic and interaction effects. Ten novel loci were identified in individuals of African ancestry. Several novel loci show strong biological plausibility since they involve physiologic systems implicated in BP regulation. They include genes involved in the central nervous system-adrenal signaling axis (ZDHHC17, CADPS, PIK3C2G), vascular structure and function (GNB3, CDON), and renal function (HAS2 and HAS2-AS1, SLIT3). Collectively, these findings suggest a role of educational attainment or SES in further dissection of the genetic architecture of BP.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41380-020-0719-3DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7641978PMC
June 2021

Multi-ancestry sleep-by-SNP interaction analysis in 126,926 individuals reveals lipid loci stratified by sleep duration.

Nat Commun 2019 11 12;10(1):5121. Epub 2019 Nov 12.

Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, Netherlands.

Both short and long sleep are associated with an adverse lipid profile, likely through different biological pathways. To elucidate the biology of sleep-associated adverse lipid profile, we conduct multi-ancestry genome-wide sleep-SNP interaction analyses on three lipid traits (HDL-c, LDL-c and triglycerides). In the total study sample (discovery + replication) of 126,926 individuals from 5 different ancestry groups, when considering either long or short total sleep time interactions in joint analyses, we identify 49 previously unreported lipid loci, and 10 additional previously unreported lipid loci in a restricted sample of European-ancestry cohorts. In addition, we identify new gene-sleep interactions for known lipid loci such as LPL and PCSK9. The previously unreported lipid loci have a modest explained variance in lipid levels: most notable, gene-short-sleep interactions explain 4.25% of the variance in triglyceride level. Collectively, these findings contribute to our understanding of the biological mechanisms involved in sleep-associated adverse lipid profiles.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41467-019-12958-0DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6851116PMC
November 2019

Impact of Rare and Common Genetic Variants on Diabetes Diagnosis by Hemoglobin A1c in Multi-Ancestry Cohorts: The Trans-Omics for Precision Medicine Program.

Am J Hum Genet 2019 10 26;105(4):706-718. Epub 2019 Sep 26.

National Heart, Lung, and Blood Institute and Boston University's Framingham Heart Study, Framingham MA 01702, USA; Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20814, USA.

Hemoglobin A1c (HbA1c) is widely used to diagnose diabetes and assess glycemic control in individuals with diabetes. However, nonglycemic determinants, including genetic variation, may influence how accurately HbA1c reflects underlying glycemia. Analyzing the NHLBI Trans-Omics for Precision Medicine (TOPMed) sequence data in 10,338 individuals from five studies and four ancestries (6,158 Europeans, 3,123 African-Americans, 650 Hispanics, and 407 East Asians), we confirmed five regions associated with HbA1c (GCK in Europeans and African-Americans, HK1 in Europeans and Hispanics, FN3K and/or FN3KRP in Europeans, and G6PD in African-Americans and Hispanics) and we identified an African-ancestry-specific low-frequency variant (rs1039215 in HBG2 and HBE1, minor allele frequency (MAF) = 0.03). The most associated G6PD variant (rs1050828-T, p.Val98Met, MAF = 12% in African-Americans, MAF = 2% in Hispanics) lowered HbA1c (-0.88% in hemizygous males, -0.34% in heterozygous females) and explained 23% of HbA1c variance in African-Americans and 4% in Hispanics. Additionally, we identified a rare distinct G6PD coding variant (rs76723693, p.Leu353Pro, MAF = 0.5%; -0.98% in hemizygous males, -0.46% in heterozygous females) and detected significant association with HbA1c when aggregating rare missense variants in G6PD. We observed similar magnitude and direction of effects for rs1039215 (HBG2) and rs76723693 (G6PD) in the two largest TOPMed African American cohorts, and we replicated the rs76723693 association in the UK Biobank African-ancestry participants. These variants in G6PD and HBG2 were monomorphic in the European and Asian samples. African or Hispanic ancestry individuals carrying G6PD variants may be underdiagnosed for diabetes when screened with HbA1c. Thus, assessment of these variants should be considered for incorporation into precision medicine approaches for diabetes diagnosis.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ajhg.2019.08.010DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6817529PMC
October 2019

A multi-ancestry genome-wide study incorporating gene-smoking interactions identifies multiple new loci for pulse pressure and mean arterial pressure.

Hum Mol Genet 2019 08;28(15):2615-2633

Icelandic Heart Association, Kopavogur, Iceland.

Elevated blood pressure (BP), a leading cause of global morbidity and mortality, is influenced by both genetic and lifestyle factors. Cigarette smoking is one such lifestyle factor. Across five ancestries, we performed a genome-wide gene-smoking interaction study of mean arterial pressure (MAP) and pulse pressure (PP) in 129 913 individuals in stage 1 and follow-up analysis in 480 178 additional individuals in stage 2. We report here 136 loci significantly associated with MAP and/or PP. Of these, 61 were previously published through main-effect analysis of BP traits, 37 were recently reported by us for systolic BP and/or diastolic BP through gene-smoking interaction analysis and 38 were newly identified (P < 5 × 10-8, false discovery rate < 0.05). We also identified nine new signals near known loci. Of the 136 loci, 8 showed significant interaction with smoking status. They include CSMD1 previously reported for insulin resistance and BP in the spontaneously hypertensive rats. Many of the 38 new loci show biologic plausibility for a role in BP regulation. SLC26A7 encodes a chloride/bicarbonate exchanger expressed in the renal outer medullary collecting duct. AVPR1A is widely expressed, including in vascular smooth muscle cells, kidney, myocardium and brain. FHAD1 is a long non-coding RNA overexpressed in heart failure. TMEM51 was associated with contractile function in cardiomyocytes. CASP9 plays a central role in cardiomyocyte apoptosis. Identified only in African ancestry were 30 novel loci. Our findings highlight the value of multi-ancestry investigations, particularly in studies of interaction with lifestyle factors, where genomic and lifestyle differences may contribute to novel findings.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1093/hmg/ddz070DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6644157PMC
August 2019

Exome sequencing of 20,791 cases of type 2 diabetes and 24,440 controls.

Nature 2019 06 22;570(7759):71-76. Epub 2019 May 22.

Division of Genome Research, Center for Genome Science, National Institute of Health, Chungcheongbuk-do, South Korea.

Protein-coding genetic variants that strongly affect disease risk can yield relevant clues to disease pathogenesis. Here we report exome-sequencing analyses of 20,791 individuals with type 2 diabetes (T2D) and 24,440 non-diabetic control participants from 5 ancestries. We identify gene-level associations of rare variants (with minor allele frequencies of less than 0.5%) in 4 genes at exome-wide significance, including a series of more than 30 SLC30A8 alleles that conveys protection against T2D, and in 12 gene sets, including those corresponding to T2D drug targets (P = 6.1 × 10) and candidate genes from knockout mice (P = 5.2 × 10). Within our study, the strongest T2D gene-level signals for rare variants explain at most 25% of the heritability of the strongest common single-variant signals, and the gene-level effect sizes of the rare variants that we observed in established T2D drug targets will require 75,000-185,000 sequenced cases to achieve exome-wide significance. We propose a method to interpret these modest rare-variant associations and to incorporate these associations into future target or gene prioritization efforts.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41586-019-1231-2DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6699738PMC
June 2019

Multi-ancestry genome-wide gene-smoking interaction study of 387,272 individuals identifies new loci associated with serum lipids.

Nat Genet 2019 04 29;51(4):636-648. Epub 2019 Mar 29.

Human Genomics Laboratory, Pennington Biomedical Research Center, Baton Rouge, LA, USA.

The concentrations of high- and low-density-lipoprotein cholesterol and triglycerides are influenced by smoking, but it is unknown whether genetic associations with lipids may be modified by smoking. We conducted a multi-ancestry genome-wide gene-smoking interaction study in 133,805 individuals with follow-up in an additional 253,467 individuals. Combined meta-analyses identified 13 new loci associated with lipids, some of which were detected only because association differed by smoking status. Additionally, we demonstrate the importance of including diverse populations, particularly in studies of interactions with lifestyle factors, where genomic and lifestyle differences by ancestry may contribute to novel findings.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41588-019-0378-yDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6467258PMC
April 2019

Protein-coding variants implicate novel genes related to lipid homeostasis contributing to body-fat distribution.

Nat Genet 2019 03 18;51(3):452-469. Epub 2019 Feb 18.

Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA.

Body-fat distribution is a risk factor for adverse cardiovascular health consequences. We analyzed the association of body-fat distribution, assessed by waist-to-hip ratio adjusted for body mass index, with 228,985 predicted coding and splice site variants available on exome arrays in up to 344,369 individuals from five major ancestries (discovery) and 132,177 European-ancestry individuals (validation). We identified 15 common (minor allele frequency, MAF ≥5%) and nine low-frequency or rare (MAF <5%) coding novel variants. Pathway/gene set enrichment analyses identified lipid particle, adiponectin, abnormal white adipose tissue physiology and bone development and morphology as important contributors to fat distribution, while cross-trait associations highlight cardiometabolic traits. In functional follow-up analyses, specifically in Drosophila RNAi-knockdowns, we observed a significant increase in the total body triglyceride levels for two genes (DNAH10 and PLXND1). We implicate novel genes in fat distribution, stressing the importance of interrogating low-frequency and protein-coding variants.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41588-018-0334-2DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6560635PMC
March 2019

Multiancestry Genome-Wide Association Study of Lipid Levels Incorporating Gene-Alcohol Interactions.

Am J Epidemiol 2019 06;188(6):1033-1054

Department of Epidemiology and Biostatistics, Imperial College London, London, United Kingdom.

A person's lipid profile is influenced by genetic variants and alcohol consumption, but the contribution of interactions between these exposures has not been studied. We therefore incorporated gene-alcohol interactions into a multiancestry genome-wide association study of levels of high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, and triglycerides. We included 45 studies in stage 1 (genome-wide discovery) and 66 studies in stage 2 (focused follow-up), for a total of 394,584 individuals from 5 ancestry groups. Analyses covered the period July 2014-November 2017. Genetic main effects and interaction effects were jointly assessed by means of a 2-degrees-of-freedom (df) test, and a 1-df test was used to assess the interaction effects alone. Variants at 495 loci were at least suggestively associated (P < 1 × 10-6) with lipid levels in stage 1 and were evaluated in stage 2, followed by combined analyses of stage 1 and stage 2. In the combined analysis of stages 1 and 2, a total of 147 independent loci were associated with lipid levels at P < 5 × 10-8 using 2-df tests, of which 18 were novel. No genome-wide-significant associations were found testing the interaction effect alone. The novel loci included several genes (proprotein convertase subtilisin/kexin type 5 (PCSK5), vascular endothelial growth factor B (VEGFB), and apolipoprotein B mRNA editing enzyme, catalytic polypeptide 1 (APOBEC1) complementation factor (A1CF)) that have a putative role in lipid metabolism on the basis of existing evidence from cellular and experimental models.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1093/aje/kwz005DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6545280PMC
June 2019

Multi-ancestry study of blood lipid levels identifies four loci interacting with physical activity.

Nat Commun 2019 01 22;10(1):376. Epub 2019 Jan 22.

Laboratory of Genetics and Molecular Cardiology, Heart Institute (InCor), University of São Paulo Medical School, São Paulo, 01246903, SP, Brazil.

Many genetic loci affect circulating lipid levels, but it remains unknown whether lifestyle factors, such as physical activity, modify these genetic effects. To identify lipid loci interacting with physical activity, we performed genome-wide analyses of circulating HDL cholesterol, LDL cholesterol, and triglyceride levels in up to 120,979 individuals of European, African, Asian, Hispanic, and Brazilian ancestry, with follow-up of suggestive associations in an additional 131,012 individuals. We find four loci, in/near CLASP1, LHX1, SNTA1, and CNTNAP2, that are associated with circulating lipid levels through interaction with physical activity; higher levels of physical activity enhance the HDL cholesterol-increasing effects of the CLASP1, LHX1, and SNTA1 loci and attenuate the LDL cholesterol-increasing effect of the CNTNAP2 locus. The CLASP1, LHX1, and SNTA1 regions harbor genes linked to muscle function and lipid metabolism. Our results elucidate the role of physical activity interactions in the genetic contribution to blood lipid levels.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41467-018-08008-wDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6342931PMC
January 2019

Novel genetic associations for blood pressure identified via gene-alcohol interaction in up to 570K individuals across multiple ancestries.

PLoS One 2018 18;13(6):e0198166. Epub 2018 Jun 18.

Icelandic Heart Association, Kopavogur, Iceland.

Heavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10-5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10-8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10-8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0198166PLOS
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6005576PMC
January 2019

Publisher Correction: Protein-altering variants associated with body mass index implicate pathways that control energy intake and expenditure in obesity.

Nat Genet 2018 05;50(5):766-767

Department of Genetic Epidemiology, University of Regensburg, Regensburg, Germany.

In the version of this article originally published, one of the two authors with the name Wei Zhao was omitted from the author list and the affiliations for both authors were assigned to the single Wei Zhao in the author list. In addition, the ORCID for Wei Zhao (Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA) was incorrectly assigned to author Wei Zhou. The errors have been corrected in the HTML and PDF versions of the article.
View Article and Find Full Text PDF

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41588-018-0082-3DOI Listing
May 2018
-->