DOMINO: a novel algorithm for network-based identification of active modules with reduced rate of false calls

Published 2020 in bioRxiv

ABSTRACT

Algorithms for active module identification (AMI) are central to analysis of omics data. Such algorithms receive a gene network and nodes’ activity scores as input and report sub-networks that show significant over-representation of accrued activity signal (‘active modules’), thus representing biological processes that presumably play key roles in the analyzed biological conditions. Although such methods exist for almost two decades, only a handful of studies attempted to compare the biological signals captured by different methods. Here, we systematically evaluated six popular AMI methods on gene expression (GE) and GWAS data. Notably, we observed that GO terms enriched in modules detected by these methods on the real data were often also enriched on modules found on randomly permuted input data. This indicated that AMI methods frequently report modules that are not specific to the biological context measured by the analyzed omics dataset. To tackle this bias, we designed a permutation-based method that evaluates the empirical significance of GO terms reported as enriched in modules. We used the method to fashion five novel performance criteria for evaluating AMI methods. Last, we developed DOMINO, a novel AMI algorithm, that outperformed the other six algorithms in extensive testing on GE and GWAS data. Software is available at https://github.com/Shamir-Lab.

PUBLICATION RECORD

Publication year
2020
Venue
bioRxiv
Publication date
2020-03-11
Fields of study
Biology, Computer Science
Identifiers
DOI 10.1101/2020.03.10.984963
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

A reference map of the human binary protein interactome
2020cited by this paper
Toward a gold standard for benchmarking gene set enrichment analysis
2020cited by this paper
NetMix: A Network-Structured Mixture Model for Reduced-Bias Estimation of Altered Subnetworks
2020cited by this paper
Assessment of network module identification across complex diseases
2019cited by this paper
Genetics of Common, Complex Coronary Artery Disease.
2019cited by this paper
Defining the Genetic, Genomic, Cellular, and Diagnostic Architectures of Psychiatric Disorders.
2019influential reference
Integrative Bioinformatics: History and Future
2019cited by this paper
The Gene Ontology Resource: 20 years and still GOing strong
2018cited by this paper
Network module identification-A widespread theoretical bias and best practices.
2018cited by this paper
CBFβ-SMMHC Inhibition Triggers Apoptosis by Disrupting MYC Chromatin Dynamics in Acute Myeloid Leukemia.
2018cited by this paper
Systematic Evaluation of Molecular Networks for Discovery of Disease Genes.
2018cited by this paper
The unfolded protein response regulator ATF6 promotes mesodermal differentiation
2018cited by this paper
Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data
2018cited by this paper
Quantifying the impact of public omics data
2018cited by this paper
Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps
2018cited by this paper
Network and Pathway Analysis of Toxicogenomics Data
2018cited by this paper
Luminal lncRNAs Regulation by ERα-Controlled Enhancers in a Ligand-Independent Manner in Breast Cancer Cells
2018cited by this paper
Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations
2018cited by this paper
Patient-iPSC-Derived Kidney Organoids Show Functional Validation of a Ciliopathic Renal Phenotype and Reveal Underlying Pathogenetic Mechanisms.
2018cited by this paper
Engagement of DNA and H3K27me3 by the CBX8 chromodomain drives chromatin association
2018cited by this paper
Biobank-driven genomic discovery yields new insight into atrial fibrillation biology
2018cited by this paper
Integrating genetic and protein-protein interaction networks maps a functional wiring diagram of a cell.
2018cited by this paper
Network Analysis as a Grand Unifier in Biomedical Data Science
2018cited by this paper
Regulation of Cellular Senescence by Polycomb Chromatin Modifiers through Distinct DNA Damage-and Histone Methylation-Dependent Pathways
2018cited by this paper
Ror2 Signaling and Its Relevance in Breast Cancer Progression
2017cited by this paper
An Expanded View of Complex Traits: From Polygenic to Omnigenic.
2017cited by this paper
Comparison of statistical methods for subnetwork detection in the integration of gene expression and protein interaction network
2017cited by this paper
Association analysis identifies 65 new breast cancer risk loci
2017cited by this paper
Identification of 153 new loci associated with heel bone mineral density and functional involvement of GPC6 in osteoporosis
2017cited by this paper
Network propagation: a universal amplifier of genetic associations
2017cited by this paper
Association analyses based on false discovery rate implicate new loci for coronary artery disease
2017cited by this paper
Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases
2016cited by this paper
Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease
2016cited by this paper
Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics
2016cited by this paper
Gene and Network Analysis of Common Variants Reveals Novel Associations in Multiple Complex Diseases
2016cited by this paper
AKAP 6 and MIR 2113 in cognitive decline 1 Title Page Association of AKAP 6 and MIR 2113 with cognitive performance in a population based sample of older adults
2016cited by this paper
The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible
2016cited by this paper
RFX transcription factors are essential for hearing in mice
2015cited by this paper
A Fast , Adaptive Variant of the Goemans-Williamson Scheme for the Prize-Collecting Steiner Tree Problem
2015cited by this paper
Acute TNF-induced repression of cell identity genes is mediated by NFκB-directed redistribution of cofactors from super-enhancers
2015cited by this paper
Pathway and network analysis of cancer genomes
2015cited by this paper
MAGMA: Generalized Gene-Set Analysis of GWAS Data
2015cited by this paper
Network-Based Analysis of Schizophrenia Genome-Wide Association Data to Detect the Joint Functional Association Signals
2015cited by this paper
A large genome-wide association study of age-related macular degeneration highlights contributions of rare and common variants
2015cited by this paper
Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes
2014cited by this paper
Biological Insights From 108 Schizophrenia-Associated Genetic Loci
2014cited by this paper
Regulation of NF-κB by TNF family cytokines.
2014cited by this paper
Drug Target Prediction and Repositioning Using an Integrated Network-Based Approach
2013cited by this paper
Community Detection in Networks with Node Attributes
2013cited by this paper
Genotype to phenotype via network analysis.
2013cited by this paper
Integrative approaches for finding modular structure in biological networks
2013influential reference
Five years of GWAS discovery.
2012cited by this paper
Mammalian MAPK signal transduction pathways activated by stress and inflammation: a 10-year update.
2012cited by this paper
Efficient algorithms for extracting biological key pathways with global constraints
2012cited by this paper
International Cancer Genome Consortium Data Portal—a one-stop shop for cancer genomics data
2011cited by this paper
Underestimated Effect Sizes in GWAS: Fundamental Limitations of Single SNP Analysis for Dichotomous Phenotypes
2011cited by this paper
REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms
2011influential reference
Cell Type–Specific Transcriptome Analysis Reveals a Major Role for Zeb1 and miR-200b in Mouse Inner Ear Morphogenesis
2011cited by this paper
Hundreds of variants clustered in genomic loci and biological pathways affect human height
2010cited by this paper
The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function
2010cited by this paper
Automated Network Analysis Identifies Core Pathways in Glioblastoma
2010influential reference
Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids
2010cited by this paper
Network medicine: a network-based approach to human disease
2010cited by this paper
Network Properties of Complex Human Disease Genes Identified through Genome-Wide Association Studies
2009cited by this paper
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data
2009cited by this paper
Protein networks in disease.
2008cited by this paper
Fast unfolding of communities in large networks
2008cited by this paper
Comprehensive genomic characterization defines human glioblastoma genes and core pathways
2008cited by this paper
From the Cover: Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
2005cited by this paper
Network biology: understanding the cell's functional organization
2004cited by this paper
Maximizing the spread of influence through a social network
2003cited by this paper
Cytoscape: a software environment for integrated models of biomolecular interaction networks.
2003cited by this paper
Biological Networks: The Tinkerer as an Engineer
2003cited by this paper
Discovering regulatory and signalling circuits in molecular interaction networks
2002influential reference
DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions
2002cited by this paper
Semantic Similarity Measures as Tools for Exploring the Gene Ontology
2002cited by this paper
Community structure in social and biological networks
2001cited by this paper
Stress Signals Utilize Multiple Pathways To Stabilize p53
2000cited by this paper
The prize collecting Steiner tree problem: theory and practice
2000influential reference
Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language
1999cited by this paper
From molecular to modular cell biology
1999cited by this paper
Controlling the false discovery rate: a practical and powerful approach to multiple testing
1995influential reference
Bioinformatics Applications Note Systems Biology Bionet: an R-package for the Functional Analysis of Biological Networks
year unknowncited by this paper
Open Peer Review Invited Referee Responses
year unknowncited by this paper

CITED BY

Finding disease modules for cancer and COVID-19 in gene co-expression networks with the Core&Peel method
2020cites this paper