Exploring Gene Expression Data with Class Scores

P. Pavlidis,Darrin P. Lewis,William Stafford Noble

Published 2001 in Pacific Symposium on Biocomputing

ABSTRACT

We address a commonly asked question about gene expression data sets: "What functional classes of genes are most interesting in the data?" In the methods we present, expression data is partitioned into classes based on existing annotation schemes. Each class is then given three separately derived "interest" scores. The first score is based on an assessment of the statistical significance of gene expression changes experienced by members of the class, in the context of the experimental design. The second is based on the co-expression of genes in the class. The third score is based on the learnability of the classification. We show that all three methods reveal significant classes in each of three different gene expression data sets. Many classes are identified by one method but not the others, indicating that the methods are complementary. The classes identified are in many cases of clear relevance to the experiment. Our results suggest that these class scoring methods are useful tools for exploring gene expression data.

PUBLICATION RECORD

Publication year
2001
Venue
Pacific Symposium on Biocomputing
Publication date
2001-12-01
Fields of study
Biology, Medicine, Computer Science
Identifiers
DOI 10.1142/9789812799623_0044 PMID 11928500
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Proceedings of the Pacific Symposium on Biocomputing '96. Hawaii, USA, 3-6 January 1996.
1996cited by this paper
FEBS Letters
1987cited by this paper

CITED BY

Transcriptional Dynamics and Key Regulators of Adipogenesis in Mouse Embryonic Stem Cells: Insights from Robust Rank Aggregation Analysis
2024cites this paper
Insights into gemcitabine resistance in pancreatic cancer: association with metabolic reprogramming and TP53 pathogenicity in patient derived xenografts
2024cites this paper
Resolving missing protein problems using functional class scoring
2022cites this paper
Network-based methods for gene function prediction.
2021cites this paper
Efficient gene set analysis of high-throughput data: From omics to pathway architecture of health and disease
2020cites this paper
SITC cancer immunotherapy resource document: a compass in the land of biomarker discovery
2020cites this paper
The interpretation of gene coexpression in systems biology : [supplementary material]
2019cites this paper
Advanced bioinformatics methods for practical applications in proteomics
2019cites this paper
Gene Set Analysis: As Applied to Public Health and Biomedical Studies
2017cites this paper
Advancing Clinical Proteomics via Analysis Based on Biological Complexes: A Tale of Five Paradigms.
2016cites this paper
H response to treatment with the anticarcinogen 3 Identification of novel transcriptional networks in
2016influential citation
Using predictive specificity to determine when gene set analysis is biologically meaningful
2016cites this paper
Design principles for clinical network-based proteomics.
2016cites this paper
Using predictive specificity to determine when gene set analysis is biologically meaningful
2016cites this paper
MIRA: mutual information-based reporter algorithm for metabolic networks
2015cites this paper
Hierarchy in gene expression is predictive of risk, progression, and outcome in adult acute myeloid leukemia
2015cites this paper
Advances in network-based metabolic pathway analysis and gene expression data integration
2015cites this paper
Pathway-Based Analysis of Genome-Wide siRNA Screens Reveals the Regulatory Landscape of App Processing
2015cites this paper
Correction: Pathway-Based Analysis of Genome-Wide siRNA Screens Reveals the Regulatory Landscape of App Processing
2015cites this paper
MIRA: mutual information-based reporter algorithm for metabolic networks
2014cites this paper
Contemporary Network Proteomics and Its Requirements
2013cites this paper
Networks and multivariate statistics as applied to biological datasets and wine-related omics
2013cites this paper
Enhancing the utility of Proteomics Signature Profiling (PSP) with Pathway Derived Subnets (PDSs), performance analysis and specialised ontologies
2013cites this paper
Hierarchy of gene expression data is predictive of future breast cancer outcome
2013cites this paper
Gene set analysis methods: statistical models and methodological differences
2013cites this paper
Computational proteomics using network-based strategies
2013cites this paper
Gene batteries and synexpression groups applied in a multivariate statistical approach to dose-response analysis of toxicogenomic data.
2013cites this paper
Comparative Network-Based Recovery Analysis and Proteomic Profiling of Neurological Changes in Valproic Acid-Treated Mice
2013cites this paper
Influence of Quercetin-Rich Food Intake on microRNA Expression in Lung Cancer Tissues
2012cites this paper
A pathway analysis method for genome‐wide association studies
2012cites this paper
GSA-PCA: gene set generation by principal component analysis of the Laplacian matrix of a metabolic network
2012cites this paper
Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges
2012cites this paper
Data driven linear algebraic methods for analysis of molecular pathways : application to disease progression in shock / trauma
2012cites this paper
Bayesian gene set analysis for identifying significant biological pathways
2011cites this paper
Potential biological insights revealed by an integrated assessment of proteomic and transcriptomic data in human colorectal cancer.
2011cites this paper
Molecular profiling reveals frequent gain of MYCN and anaplasia‐specific loss of 4q and 14q in wilms tumor
2011influential citation
Functional Pathway Analysis for Understanding Immunologic Signature of Rejection: Current Approaches and Outstanding Challenges
2011cites this paper
Pathway Analysis in Drug Discovery
2011cites this paper
Contribution to Statistical Techniques for Identifying Differentially Expressed Genes in Microarray Data
2011cites this paper
The limitations of simple gene set enrichment analysis assuming gene independence
2011cites this paper
Pathway-based modeling and diagnosis of cancer development and progression
2010cites this paper
Data mining for discovery of clinical and genomic disease markers
2010cites this paper
Large-scale inference
2010influential citation
Bayesian Gene Set Analysis
2010cites this paper
A nonparametric approach for relevance determination
2010cites this paper
Bayesian Nonparametric Variable Selection as an Exploratory Tool for Finding Genes that Matter
2010cites this paper
Integrative Biomarker Discovery for Breast Cancer Metastasis from Gene Expression and Protein Interaction Data Using Error-tolerant Pattern Mining
2010cites this paper
Identification of functional modules that correlate with phenotypic difference: the influence of network topology
2010cites this paper
Investigation of low-dose ritonavir on human peripheral blood mononuclear cells using gene expression whole genome microarrays.
2010cites this paper
The evolving role of mass spectrometry in cancer biomarker discovery
2009cites this paper
Immune profile and mitotic index of metastatic melanoma lesions enhance clinical staging in predicting patient survival
2009cites this paper
Gene set enrichment analysis made simple
2009cites this paper
Optimization of cDNA microarray image analysis methods
2009cites this paper
Seeking unique and common biological themes in multiple gene lists or datasets: pathway pattern extraction pipeline for pathway-level comparative analysis
2009influential citation
Prior biological knowledge-based approaches for the analysis of genome-wide expression profiles using gene sets and pathways
2009cites this paper
Robust extraction of functional signals from gene set analysis using a generalized threshold free scoring function
2009cites this paper
Getting Started in Gene Expression Microarray Analysis
2009cites this paper
Analysis of Gene Expression in Parkinson's Disease: Possible Involvement of Neurotrophic Support and Axon Guidance in Dopaminergic Cell Death
2009cites this paper
Nicotinic Acetylcholine Receptors and Modulation of Learning in 4- and 27-Month-Old Rabbits
2008cites this paper
Gene Set Expression Comparison kit for BRB-ArrayTools
2008cites this paper
Implication du BDNF dans l'étiopathogenèse et le traitement des troubles anxio-dépressifs : aspects précliniques.
2008cites this paper
SLEPR: A Sample-Level Enrichment-Based Pathway Ranking Method — Seeking Biological Themes through Pathway-Level Consistency
2008cites this paper
Abnormal Indices of Cell Cycle Activity in Schizophrenia and their Potential Association with Oligodendrocytes
2008cites this paper
Tools for Interpreting Large-scale Protein Profiling in Microbiology
2008cites this paper
Dissection of transcriptional regulation networks and prediction of gene functions in Saccharomyces cerevisiae
2008cites this paper
Large-scale estimates of cellular origins of mRNAs: enhancing the yield of transcriptome analyses.
2008influential citation
Inferring Pathway Activity toward Precise Disease Classification
2008cites this paper
Gene-set approach for expression pattern analysis
2008cites this paper
IGF axis gene expression patterns are prognostic of survival in epithelial ovarian cancer.
2007cites this paper
Functional genomic analysis reveals cross-talk between peroxisome proliferator-activated receptor gamma and calcium signaling in human colorectal cancer cells.
2007cites this paper
Variations in oligodendrocyte-related gene expression across multiple cortical regions: implications for the pathophysiology of schizophrenia.
2007cites this paper
Gene expression Annotation-based distance measures for patient subgroup discovery in clinical microarray studies
2007cites this paper
Fun&Co: identification of key functional differences in transcriptomes
2007cites this paper
Prediction of Co-Regulated Gene Groups through Gene Ontology
2007cites this paper
Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis
2007cites this paper
Lack of serotonin1B receptor expression leads to age-related motor dysfunction, early onset of brain molecular aging and reduced longevity
2007cites this paper
Transcriptome analysis of cold syndrome using microarray.
2007cites this paper
Probabilistic path ranking based on adjacent pairwise coexpression for metabolic transcripts analysis
2007cites this paper
The Maize Zmsmu2 Gene Encodes a Putative RNA-Splicing Factor That Affects Protein Synthesis and RNA Processing during Endosperm Development1[W][OA]
2007cites this paper
Interpretation of gene expression microarray experiments
2007cites this paper
Activation of MAPK pathways links LMNA mutations to cardiomyopathy in Emery-Dreifuss muscular dystrophy.
2007cites this paper
From Gene Expression to Metabolic Fluxes
2007cites this paper
Annotation-based Distance Measures for Patient Subgroup Discovery in Clinical Microarray Studies
2007cites this paper
Network-based classification of breast cancer metastasis
2007cites this paper
7 Ontologies and functional genomics
2006cites this paper
Identification of novel transcriptional networks in response to treatment with the anticarcinogen 3H-1,2-dithiole-3-thione.
2006influential citation
Model-Based Inference of Transcriptional Regulatory Mechanisms from DNA Microarray Data
2006cites this paper
Contextual Analysis of Gene Expression Data
2006cites this paper
Development and application of methods for the analysis of microarray gene expression data
2006cites this paper
A Regression-based K nearest neighbor algorithm for gene function prediction from heterogeneous data
2006cites this paper
The Metabolic Response of Heterotrophic Arabidopsis Cells to Oxidative Stress1[W]
2006influential citation
Genome Wide Gene Expression Studies in Mood Disorders
2006cites this paper
Peeling Off the Hidden Genetic Heterogeneities of Cancers Based on Disease-Relevant Functional Modules
2006cites this paper
Expectation-maximization algorithms for fuzzy assignment of genes to cellular pathways.
2006cites this paper
C. elegansRecommender Algorithm to Identify Coexpressed Genes in
2006cites this paper
On testing the significance of sets of genes
2006cites this paper
Role of SMU Homologues in Pre-mRNA Splicing During Maize and Arabidopsis Development
2006cites this paper
ADGO: analysis of differentially expressed gene sets using composite GO annotation
2006cites this paper
Molecular decomposition of complex clinical phenotypes using biologically structured analysis of microarray data
2005cites this paper
Expression dynamics of a cellular metabolic network
2005cites this paper