Algorithms for active module identification (AMI) are central to analysis of omics data. Such algorithms receive a gene network and nodes’ activity scores as input and report sub-networks that show significant over-representation of accrued activity signal (‘active modules’), thus representing biological processes that presumably play key roles in the analyzed biological conditions. Although such methods exist for almost two decades, only a handful of studies attempted to compare the biological signals captured by different methods. Here, we systematically evaluated six popular AMI methods on gene expression (GE) and GWAS data. Notably, we observed that GO terms enriched in modules detected by these methods on the real data were often also enriched on modules found on randomly permuted input data. This indicated that AMI methods frequently report modules that are not specific to the biological context measured by the analyzed omics dataset. To tackle this bias, we designed a permutation-based method that evaluates the empirical significance of GO terms reported as enriched in modules. We used the method to fashion five novel performance criteria for evaluating AMI methods. Last, we developed DOMINO, a novel AMI algorithm, that outperformed the other six algorithms in extensive testing on GE and GWAS data. Software is available at https://github.com/Shamir-Lab.
DOMINO: a novel algorithm for network-based identification of active modules with reduced rate of false calls
Hagai Levi,Ran Elkon,R. Shamir
Published 2020 in bioRxiv
ABSTRACT
PUBLICATION RECORD
- Publication year
2020
- Venue
bioRxiv
- Publication date
2020-03-11
- Fields of study
Biology, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-84 of 84 references · Page 1 of 1
CITED BY
Showing 1-1 of 1 citing papers · Page 1 of 1