Joint analysis of functional genomic data and genome-wide association studies of 18 human traits

Published 2013 in bioRxiv

ABSTRACT

Annotations of gene structures and regulatory elements can inform genome-wide association studies (GWAS). However, choosing the relevant annotations for interpreting an association study of a given trait remains challenging. We describe a statistical model that uses association statistics computed across the genome to identify classes of genomic element that are enriched or depleted for loci that influence a trait. The model naturally incorporates multiple types of annotations. We applied the model to GWAS of 18 human traits, including red blood cell traits, platelet traits, glucose levels, lipid levels, height, BMI, and Crohn’s disease. For each trait, we evaluated the relevance of 450 different genomic annotations, including protein-coding genes, enhancers, and DNase-I hypersensitive sites in over a hundred tissues and cell lines. We show that the fraction of phenotype-associated SNPs that influence protein sequence ranges from around 2% (for platelet volume) up to around 20% (for LDL cholesterol); that repressed chromatin is significantly depleted for SNPs associated with several traits; and that cell type-specific DNase-I hypersensitive sites are enriched for SNPs associated with several traits (for example, the spleen in platelet volume). Finally, by re-weighting each GWAS using information from functional genomics, we increase the number of loci with high-confidence associations by around 5%.

PUBLICATION RECORD

Publication year
2013
Venue
bioRxiv
Publication date
2013-11-19
Fields of study
Biology, Medicine, Computer Science
Identifiers
DOI 10.1101/000752 arXiv 1311.4843 PMID 24702953 PMCID PMC3980523
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease
2016cited by this paper
R: A language and environment for statistical computing.
2014cited by this paper
An integrated encyclopedia of DNA elements in the human genome
2014cited by this paper
Transcriptome and genome sequencing uncovers functional variation in humans
2013cited by this paper
The genomic signature of trait-associated variants
2013influential reference
Integrated Enrichment Analysis of Variants and Pathways in Genome-Wide Association Studies Indicates Central Role for IL-2 Signaling Genes in Type 1 Diabetes, and Cytokine Signaling Genes in Crohn's Disease
2013cited by this paper
Discovery and refinement of loci associated with lipid levels
2013cited by this paper
A Bayesian Method to Incorporate Hundreds of Functional Characteristics with Association Evidence to Improve Variant Prioritization
2013cited by this paper
Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics
2013cited by this paper
Chromatin stretch enhancer states drive cell-specific gene regulation and harbor human disease risk variants
2013cited by this paper
Predicting Cell Types and Genetic Variations Contributing to Disease by Combining GWAS and Epigenetic Data
2013influential reference
Maps of open chromatin highlight cell type–restricted patterns of regulatory sequence variation at hematological trait loci
2013cited by this paper
Systematic functional regulatory assessment of disease-associated variants
2013cited by this paper
Super-enhancers in the control of cell identity and disease.
2013cited by this paper
A Unified Framework for Association Analysis with Multiple Related Phenotypes
2013cited by this paper
All SNPs Are Not Created Equal: Genome-Wide Association Studies Reveal a Consistent Pattern of Enrichment among Functionally Annotated SNPs
2013cited by this paper
Fast and accurate imputation of summary statistics enhances evidence of functional enrichment
2013cited by this paper
Seventy-five genetic loci influencing the human red blood cell
2012cited by this paper
Dissecting the regulatory architecture of gene expression QTLs
2012cited by this paper
A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance
2012cited by this paper
Integrative annotation of chromatin elements from ENCODE data
2012influential reference
The Metabochip, a Custom Genotyping Array for Genetic Studies of Metabolic, Cardiovascular, and Anthropometric Traits
2012cited by this paper
Breast cancer risk-associated SNPs modulate the affinity of chromatin for FOXA1 and alter gene expression
2012cited by this paper
The accessible chromatin landscape of the human genome
2012cited by this paper
Five years of GWAS discovery.
2012cited by this paper
Bayesian refinement of association signals for 14 loci in 3 common diseases
2012cited by this paper
Synthesizing genome-wide association studies and expression microarray reveals novel genes that act in the human growth plate to modulate height.
2012cited by this paper
Association analyses of 249 , 796 individuals reveal 18 new loci associated with body mass index
2012cited by this paper
An Integrated Encyclopedia of DNA Elements in the Human Genome
2012influential reference
Systematic Localization of Common Disease-Associated Variation in Regulatory DNA
2012cited by this paper
Chromatin marks identify critical cell types for fine mapping complex trait variants
2012cited by this paper
Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits
2012cited by this paper
Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture
2012cited by this paper
Mapping and analysis of chromatin state dynamics in nine human cell types
2012cited by this paper
Maps of Open Chromatin Guide the Functional Follow-Up of Genome-Wide Association Signals: Application to Hematological Traits
2011cited by this paper
Exploration of empirical Bayes hierarchical modeling for the analysis of genome-wide association study data.
2011cited by this paper
Integrating Autoimmune Risk Loci with Gene-Expression Data Identifies Specific Pathogenic Immune Cell Subsets.
2011cited by this paper
DNaseI sensitivity QTLs are a major determinant of human expression variation
2011cited by this paper
New gene functions in megakaryopoiesis and platelet formation
2011cited by this paper
Integrating autoimmune risk loci with gene-expression data identifies specific pathogenic immune cell subsets.
2011cited by this paper
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
2010cited by this paper
Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS
2010cited by this paper
A map of human genome variation from population-scale sequencing
2010influential reference
Hundreds of variants clustered in genomic loci and biological pathways affect human height
2010cited by this paper
Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids
2010influential reference
Candidate Causal Regulatory Effects by Integration of Expression QTLs with Complex Trait Genetic Associations
2010cited by this paper
Potential etiologic and functional implications of genome-wide association loci for human diseases and traits
2009influential reference
Learning a Prior on Regulatory Potential from eQTL Data
2009cited by this paper
Bayes factors for genome‐wide association studies: comparison with P‐values
2009cited by this paper
The Wellcome Trust Case Control Consortium, U.K.
2008cited by this paper
High-Resolution Mapping of Expression-QTLs Yields Insight into Human Gene Regulation
2008cited by this paper
Statistical independence of the colocalized association signals for type 1 diabetes and RPS26 gene expression on chromosome 12q13
2008cited by this paper
Enriching the analysis of genomewide association studies with hierarchical modeling.
2007cited by this paper
Fibrogenesis in Crohn's Disease
2007cited by this paper
Hierarchical Bayes prioritization of marker associations from a genome‐wide association scan for further investigation
2007cited by this paper
Megakaryocyte development and platelet production
2006cited by this paper
The elements of statistical learning: data mining, inference and prediction
2005cited by this paper
Association studies for quantitative traits in structured populations
2002influential reference
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
2001cited by this paper
Genetic Influences on Muscle Strength, Lean Body Mass, and Bone Mineral Density: A Twin Study
1997cited by this paper

CITED BY

Causal splicing variants revealed by deep-learning integration of single-cell sQTL mapping under influenza infection
2026cites this paper
keju: powerful and accurate inference in Massively Parallel Reporter Assays
2026cites this paper
Donor-matched iPSC model reveals context-dependent T2D genetic signals in fibro-adipogenic progenitors
2026cites this paper
Single-cell multiome and spatial profiling reveals pancreas cell type-specific gene regulatory programs driving type 1 diabetes progression
2025cites this paper
Combining functional annotation and multi-trait fine-mapping methods improves fine-mapping resolution at glycaemic trait loci
2025cites this paper
Rethinking GWAS: how lessons from genetic screens and artificial intelligence could reveal biological mechanisms
2025cites this paper
Integrative single-cell multi-omics profiling of human pancreatic islets identifies T1D-associated genes and regulatory signals
2025cites this paper
Dysregulated Gene Expression: A Candidate Mechanism for Anxiety Disorders
2025cites this paper
fSuSiE enables fine-mapping of QTLs from genome-scale molecular profiles
2025cites this paper
Amyotrophic Lateral Sclerosis-associated 3′ UTR enhancer embedded within CAV1 risk gene
2025cites this paper
Integrative Genomics Refines Tissues, Candidate Genes and Putative Regulatory Links Involved in the Humic Adaptation of Keystone Freshwater Fish
2025cites this paper
Single-cell multiome and spatial profiling reveals pancreas cell type–specific gene regulatory programs of type 1 diabetes progression
2025cites this paper
Evaluation of epistasis detection methods for quantitative phenotypes
2025cites this paper
Towards improved fine-mapping of candidate causal variants
2025cites this paper
A review of post-GWAS studies in schizophrenia
2025cites this paper
The Promise of Genetics in Alcohol Use Disorders and the Problems of Phenotype, Polygenetic Architecture, Ancestry, and Comorbidity.
2025cites this paper
The impact of background selection in mutation-selection-drift balance models of complex trait evolution
2025cites this paper
An obesogenic FTO allele causes accelerated development, growth and insulin resistance in human skeletal muscle cells
2025cites this paper
BTS: a scalable Bayesian Tissue Score for prioritizing GWAS variants and their functional contexts across >1000s of omics datasets
2025cites this paper
Translational genomics of osteoarthritis in 1,962,069 individuals
2025cites this paper
Heterogeneity of acute myeloid leukemia patients explored through single-cell and single-sample gene regulatory networks.
2025cites this paper
Transcripts with high distal heritability mediate genetic effects on complex metabolic traits
2025cites this paper
BTS: scalable Bayesian Tissue Score for prioritizing GWAS variants and their functional contexts across omics data
2025cites this paper
Genome-wide association study on longitudinal and cross-sectional traits of child health and development in a Japanese population
2025cites this paper
MIRAGE: A Bayesian statistical method for gene-level rare-variant analysis incorporating functional annotations.
2025cites this paper
Integrative Genomic and Functional Approaches Identify FUOM as a Key Driver and Therapeutic Target in Cervical Cancer
2025cites this paper
Genetics and Analysis of Quantitative Traits
2025cites this paper
Multi-dimensional annotation of porcine variants using genomic and epigenomic features in pigs
2025influential citation
Disease-associated variants are enriched for altering cell-type-specific gene co-expression relationships
2025cites this paper
Multi-omics approaches for understanding gene-environment interactions in noncommunicable diseases: techniques, translation, and equity issues
2025cites this paper
A Deep Dive into Statistical Modeling of RNA Splicing QTLs Reveals New Variants that Explain Neurodegenerative Disease
2024cites this paper
The goldmine of GWAS summary statistics: a systematic review of methods and tools
2024cites this paper
The genetics and epidemiology of N- and O-immunoglobulin A glycomics
2024cites this paper
Untangling the genetics of beta cell dysfunction and death in type 1 diabetes
2024cites this paper
Understanding the genetic complexity of puberty timing across the allele frequency spectrum
2024influential citation
Genetic links between ovarian ageing, cancer risk and de novo mutation rates
2024cites this paper
Effectiveness of the Use of Genetic Markers of Meat Productivity in the Kazakh White-Headed Breed Identified Using Genome-Wide Association Study
2024cites this paper
Predicting cell type-specific epigenomic profiles accounting for distal genetic effects
2024cites this paper
A distinct class of pan-cancer susceptibility genes revealed by an alternative polyadenylation transcriptome-wide association study
2024influential citation
Polygenic scores and their applications in kidney disease
2024cites this paper
Genetic analyses point to alterations in immune-related pathways underpinning the association between psychiatric disorders and COVID-19
2024cites this paper
Characterization of caffeine response regulatory variants in vascular endothelial cells
2024cites this paper
The genetics and epidemiology of N- and O- IgA glycomics
2024cites this paper
Gene expression variation underlying tissue-specific responses to copper stress in Drosophila melanogaster
2024cites this paper
Placental expression quantitative trait loci in an East Asian population
2024cites this paper
LOGOWheat: deep learning–based prediction of regulatory effects for noncoding variants in wheats
2024cites this paper
A multi-omic atlas of human embryonic skeletal development
2024cites this paper
Causality Between ADHD, ASD, and CVDs: A Two-Step, Two-Sample Mendelian Randomization Investigation
2024cites this paper
First insight of the genome-wide association study and genomic prediction into enteritis disease (Vibrio harveyi) resistance trait in the lined seahorse (Hippocampus erectus)
2024cites this paper
An ancient polymorphic regulatory region within the BDNF gene associated with obesity modulates anxiety-like behaviour in mice and humans
2024cites this paper
Transcripts with high distal heritability mediate genetic effects on complex metabolic traits
2024cites this paper
Enhancing disease risk gene discovery by integrating transcription factor-linked trans-variants into transcriptome-wide association analyses
2024cites this paper
Improved estimation of functional enrichment in SNP heritability using feasible generalized least squares
2024cites this paper
Crop-GPA: an integrated platform of crop gene-phenotype associations
2024cites this paper
Molecular Characterization of the Distal Lung: Novel Insights from COPD Omics.
2024cites this paper
Unveiling blood pressure‐associated genes in aortic cells through integrative analysis of GWAS and RNA modification‐associated variants
2024cites this paper
From GWASs toward Mechanistic Understanding with Case Studies in Dermatogenetics.
2024cites this paper
Toward telomere-to-telomere cat genomes for precision medicine and conservation biology
2024cites this paper
Fine mapping of candidate effector genes for heart rate
2024cites this paper
Multi-ancestry genome-wide association analyses improve resolution of genes and pathways influencing lung function and chronic obstructive pulmonary disease risk
2023cites this paper
Toward a comprehensive catalog of regulatory elements
2023cites this paper
Liver-Specific Polygenic Risk Score Is Associated with Alzheimer’s Disease Diagnosis
2023cites this paper
Interaction-integrated linear mixed model reveals 3D-genetic basis underlying Autism.
2023cites this paper
A distinct class of pan cancer susceptibility genes revealed by alternative polyadenylation transcriptome wide association study
2023cites this paper
Multi‐omics cannot replace sample size in genome‐wide association studies
2023cites this paper
Multi-ancestry transcriptome-wide association analyses yield insights into tobacco use biology and drug repurposing
2023cites this paper
Integration of genetic fine-mapping and multi-omics data reveals candidate effector genes for hypertension
2023cites this paper
Translating non-coding genetic associations into a better understanding of immune-mediated disease
2023cites this paper
Deep learning predicts the impact of regulatory variants on cell-type-specific enhancers in the brain
2023cites this paper
Priors, population sizes, and power in genome-wide hypothesis tests
2023cites this paper
Multi-omics data integration methods and their applications in psychiatric disorders.
2023cites this paper
Exploring the genetic basis of coronary artery disease using functional genomics.
2023cites this paper
Evaluating 17 methods incorporating biological function with GWAS summary statistics to accelerate discovery demonstrates a tradeoff between high sensitivity and high positive predictive value
2023cites this paper
PALM: a powerful and adaptive latent model for prioritizing risk variants with functional annotations
2023cites this paper
Population-scale skeletal muscle single-nucleus multi-omic profiling reveals extensive context specific genetic regulation
2023cites this paper
An integrative single-cell multi-omics profiling of human pancreatic islets identifies T1D associated genes and regulatory signals
2023cites this paper
Enhancing Disease Risk Gene Discovery by Integrating Transcription Factor-Linked Trans-located Variants into Transcriptome-Wide Association Analyses
2023cites this paper
Plasma Proteomics to Identify Drug Targets for Ischemic Heart Disease
2023cites this paper
Decoding mutational hotspots in human disease through the gene modules governing thymic regulatory T cells
2023cites this paper
Linking non-coding variants to function in microglia in Alzheimer’s disease
2023influential citation
The Genetics of Coronary Artery Disease: A Vascular Perspective
2023cites this paper
Single-cell genomics improves the discovery of risk variants and genes of atrial fibrillation
2023cites this paper
Bayesian multivariate genetic analysis improves translational insights
2023cites this paper
Multitissue H3K27ac profiling of GTEx samples links epigenomic variation to disease
2023cites this paper
Improving the discovery of rare variants associated with alcohol problems by leveraging machine learning phenotype prediction and functional information
2023cites this paper
Functional characterization of Alzheimer’s disease genetic variants in microglia
2023cites this paper
Additional Evidence for the Relationship between Type 2 Diabetes Mellitus and Stroke through Observational and Genetic Analyses.
2023cites this paper
Childhood-onset asthma is characterized by airway epithelial hillock-to-squamous differentiation in early life
2023cites this paper
Genetic variation in chromatin state across multiple tissues in Drosophila melanogaster
2023cites this paper
Single-cell chromatin accessibility and transcriptomic characterization of Behcet’s disease
2023cites this paper
A whole-genome reference panel of 14,393 individuals for East Asian populations accelerates discovery of rare functional variants
2023cites this paper
Multiple genetic variants at the SLC30A8 locus affect local super-enhancer activity and influence pancreatic β-cell survival and function
2023cites this paper
Multi-omics analysis in primary T cells elucidates mechanisms behind disease-associated genetic loci
2023cites this paper
Modeling tissue co-regulation estimates tissue-specific contributions to disease
2023cites this paper
Evaluating significance of European-associated index SNPs in the East Asian population for 31 complex phenotypes
2023cites this paper
Interpreting non-coding disease-associated human variants using single-cell epigenomics
2023cites this paper
Understanding the genetic complexity of puberty timing across the allele frequency spectrum
2023cites this paper
Genome-wide mapping of regulatory variants for temperature- and salinity-adaptive genes reveals genetic basis of genotype-by-environment interaction in Crassostrea ariakensis.
2023cites this paper
Modeling tissue co-regulation to estimate tissue-specific contributions to disease
2023cites this paper
graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data
2023cites this paper