Clustering Based on Conditional Distributions in an Auxiliary Space

Published 2002 in Neural Computation

ABSTRACT

We study the problem of learning groups or categories that are local in the continuous primary space but homogeneous by the distributions of an associated auxiliary random variable over a discrete auxiliary space. Assuming that variation in the auxiliary space is meaningful, categories will emphasize similarly meaningful aspects of the primary space. From a data set consisting of pairs of primary and auxiliary items, the categories are learned by minimizing a Kullback-Leibler divergence-based distortion between (implicitly estimated) distributions of the auxiliary data, conditioned on the primary data. Still, the categories are defined in terms of the primary space. An online algorithm resembling the traditional Hebb-type competitive learning is introduced for learning the categories. Minimizing the distortion criterion turns out to be equivalent to maximizing the mutual information between the categories and the auxiliary data. In addition, connections to density estimation and to the distributional clustering paradigm are outlined. The method is demonstrated by clustering yeast gene expression data from DNA chips, with biological knowledge about the functional classes of the genes as the auxiliary data.

PUBLICATION RECORD

Publication year
2002
Venue
Neural Computation
Publication date
2002-01-01
Fields of study
Mathematics, Computer Science, Medicine
Identifiers
DOI 10.1162/089976602753284509 PMID 11747539
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Elements of Information Theory
2005cited by this paper
Unsupervised Recursive Sequence Processing
2003cited by this paper
Generalized relevance learning vector quantization
2002cited by this paper
Bankruptcy analysis with self-organizing maps in learning metrics
2001cited by this paper
Knowledge-based analysis of microarray gene expression data by using support vector machines.
2000cited by this paper
The information bottleneck method
2000influential reference
Flexible discriminant and mixture models
2000cited by this paper
Mutual Information in Learning Feature Transformations
2000influential reference
Deriving cluster analytic distance functions from Gaussian mixture models
1999cited by this paper
Where the abstract feature maps of the brain might come from.
1999cited by this paper
Learning the Similarity of Documents: An Information-Geometric Approach to Document Retrieval and Categorization
1999cited by this paper
Exploiting Generative Models in Discriminative Classifiers
1998cited by this paper
Learning from Dyadic Data
1998cited by this paper
Cluster analysis and display of genome-wide expression patterns.
1998influential reference
A methodology for information theoretic feature extraction
1998cited by this paper
Mutual information maximization: models of cortical self-organization.
1996influential reference
Self-Organizing Maps
1995cited by this paper
Winner-take-all networks for physiological models of competitive learning
1994cited by this paper
Physiological interpretationm of the self-organizing map algorithm
1993cited by this paper
Physiological interpretation of the Self-Organizing Map algorithm
1993cited by this paper
Self-organizing neural network that discovers surfaces in random-dot stereograms
1992influential reference
Maximum Likelihood Competitive Learning
1989cited by this paper
Self-organization and associative memory: 3rd edition
1989cited by this paper
Self-Organization and Associative Memory
1988cited by this paper
Vector quantization in speech coding
1985influential reference
Simplified neuron model as a principal component analyzer
1982cited by this paper
Asymptotically optimal block quantization
1979cited by this paper
Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper
1977influential reference
On the development of feature detectors in the visual cortex with applications to learning and reaction-diffusion systems
1976influential reference
A model of visuomotor mechanisms in the frog optic tectum
1976influential reference
Development of Specificity in the Cat Visual Cortex
1975cited by this paper
A theory for the development of feature detecting cells in visual cortex
1975cited by this paper
Statistics of Directional Data
1972cited by this paper
Some methods for classification and analysis of multivariate observations
1967cited by this paper
Cluster analysis of multivariate data : efficiency versus interpretability of classifications
1965cited by this paper

CITED BY

A Possible World-Based Fusion Estimation Model for Uncertain Data Clustering in WBNs
2021cites this paper
Penalized maximum likelihood estimator for mixture of von Mises–Fisher distributions
2020cites this paper
Learning a metric when clustering data points in the presence of constraints
2019cites this paper
Bregman Divergence Bounds and Universality Properties of the Logarithmic Loss
2018cites this paper
A Random Walk Approach to Query Informative Constraints for Clustering
2018cites this paper
A spatial correlation model for the horizontal non-isotropic ocean ambient noise vector field
2017cites this paper
Gaussian Lower Bound for the Information Bottleneck Limit
2017cites this paper
Semi-supervised Information Fusion for Clustering, Classification and Detection Applications
2017cites this paper
A statistical framework for online learning using adjustable model selection criteria
2016cites this paper
Mixture Models for Multidimensional Positive Data Clustering with Applications to Image Categorization and Retrieval
2015cites this paper
Advances in Analysis and Exploration in Medical Imaging
2014influential citation
Active selection of clustering constraints: a sequential approach
2014cites this paper
HMM-based hybrid meta-clustering ensemble for temporal data
2014cites this paper
Self-Supervised MRI Tissue Segmentation by Discriminative Clustering
2014cites this paper
Bayesian Estimation of the von-Mises Fisher Mixture Model with Variational Inference
2014cites this paper
CLUSTERING CONSTRAINED BY DEPENDENCIES
2013cites this paper
eu CLUSTERING CONSTRAINED BY DEPENDENCIES
2013cites this paper
Leveraging Subjective Human Annotation for Clustering Historic Newspaper Articles
2012cites this paper
Clustering at Presence of Side Information via Weighted Constraints Ratio Gap Maximization
2012cites this paper
Exploratory Data Analysis using Clusters and Stories
2012cites this paper
Computational intelligence in biomedical imaging: multidimensional analysis of spatio-temporal patterns
2011cites this paper
Probabilistic analysis of the human transcriptome with side information
2011cites this paper
Learning Parameters of the K-Means Algorithm From Subjective Human Annotation
2011cites this paper
Towards general semi-supervised clustering using a cognitive reinforcement K-Iteration fast learning artificial neural network (R-Kflann).
2010cites this paper
Recent Advances in Nonlinear Dimensionality Reduction, Manifold and Topological Learning
2010cites this paper
Neural Maps and Learning Vector Quantization - Theory and Applications
2009cites this paper
Schemas of Clustering
2009cites this paper
Modeling of mutual dependencies
2008cites this paper
Rough Text Assisting Text Mining: Focus on Document Clustering Validity
2008cites this paper
Data Compression - A Generic Principle of Pattern Recognition?
2008influential citation
On the Contribution of Compression to Visual Pattern Recognition
2008influential citation
Probabilistic approach to detecting dependencies between data sets
2008cites this paper
A Survey of Clustering with Instance Level Constraints 1 2
2007cites this paper
Fuzzy vector quantization with the particle swarm optimization: A study in fuzzy granulation-degranulation information processing
2007cites this paper
Validating module network learning algorithms using simulated data
2007cites this paper
Assessment of self-organizing map variants for clustering with application to redistribution of emotional speech patterns
2007cites this paper
Association Learning in SOMs for Fuzzy-Classification
2007cites this paper
METHODS FOR EXPLORING GENOMIC DATA SETS : APPLICATION TO HUMAN ENDOGENOUS RETROVIRUSES
2007cites this paper
Co-clustering by similarity refinement
2007cites this paper
A Kernel Approach for Semisupervised Metric Learning
2007cites this paper
Methods for exploring genomic data sets : application to human endogenous retroviruses
2007cites this paper
Prototype based machine learning for clinical proteomics
2006cites this paper
Clustering with kernel-based self-organized maps trained with supervised bias
2006cites this paper
Fuzzy Labeled Self-Organizing Map with Label-Adjusted Prototypes
2006cites this paper
Chapter 6 Dependency exploration and learning metrics
2006cites this paper
Relaxational metric adaptation and its application to semi-supervised clustering and content-based image retrieval
2006cites this paper
UNDERSTANDING COMPLEX DATASETS: DATA MINING WITH MATRIX DECOMPOSITIONS
2006cites this paper
Discovery of gene interactions in regulatory networks using genomic data mining and computational intelligence methods
2006cites this paper
Locally linear metric adaptation with application to semi-supervised clustering and image retrieval
2006cites this paper
On Clustering Validity Measures and the Rough Set Theory
2006cites this paper
Neural Networks for Clustering Analysis of Molecular Data
2006cites this paper
Fuzzy classification by fuzzy labeled neural gas
2006cites this paper
ANAKAΛYΨH TΩN (AITIΩ∆ΩN) ΣXEΣEΩN AΛΛHΛEΠI∆PAΣHΣ ΣTO ∆IKTYO PYΘMIΣHΣ ΓONI∆IΩN, ME XPHΣH ΠPOHΓMENΩN MEΘO∆ΩN TEXNHTHΣ NOHMOΣYNHΣ, BAΣIZOMENEΣ ΣTHN EΞOPYΞH ΠΛHPOΦOPIAΣ AΠO ∆E∆OMENA ΣYNOΛIKHΣ ΓONI∆IΩMATIKHΣ KΛIMAKOΣ
2006cites this paper
Information Bottleneck for Non Co-Occurrence Data
2006cites this paper
Gaussian Mixture Approach to Detect Drift
2006cites this paper
Discriminative components of data
2005cites this paper
Kernel-Based Metric Adaptation with Pairwise Constraints
2005cites this paper
Semi-supervised graph clustering: a kernel approach
2005cites this paper
Mutual Information Clustering for Efficient Mining of Fuzzy Association Rules with Application to Gene Expression Data Analysis
2005cites this paper
Semisupervised metric learning by kernel matrix adaptation
2005cites this paper
A Unified Probabilistic Framework for Clustering Correlated Heterogeneous Web Objects
2005cites this paper
Clustering on the Unit Hypersphere using von Mises-Fisher Distributions
2005cites this paper
Self-organizing neural networks for sequence processing
2005cites this paper
Associative clustering for exploring dependencies between functional genomics data sets
2005cites this paper
Merge SOM for temporal data
2005cites this paper
Semisupervised metric learning by kernel matrix adaptation
2005cites this paper
Discriminative clustering
2005influential citation
Non-parametric dependent components
2005cites this paper
Classification using non-standard metrics
2005cites this paper
From learning metrics towards dependency exploration
2005cites this paper
Clustering and prototype based classification
2005influential citation
Locally Linear Metric Adaptation with Application to Image Retrieval
2005cites this paper
A Common Lisp Application to Discover Kripke Models : Redescribing Biological Processes from Time-Course Data ∗
2005cites this paper
Discriminative Clustering of Yeast Stress Response
2005cites this paper
Semi-supervised clustering: probabilistic models, algorithms and experiments
2005cites this paper
The Minimum Information Principlein Learning and Neural DataAnalysis
2005cites this paper
Improved learning of Riemannian metrics for exploratory analysis [Neural Networks 17 (8–9) 1087–1100]
2005cites this paper
Data exploration with learning metrics
2004cites this paper
Self-organizing maps and clustering methods for matrix data
2004cites this paper
A general framework for unsupervised processing of structured data
2004cites this paper
Semisupervised Clustering for Intelligent User Management
2004cites this paper
Recursive self-organizing network models
2004cites this paper
Extracting Relevant Structures
2004cites this paper
An Analysis of Model-based Clustering , Competitive Learning , and Information Bottleneck
2004influential citation
Self-organizing context learning
2004cites this paper
Frequency Sensitive Competitive Learning for Balanced Clustering on High-dimensional Hyperspheres
2004cites this paper
From insights to innovations : data mining, visualization, and user interfaces
2004cites this paper
Associative Clustering
2004cites this paper
Theory and applications of neural maps
2004cites this paper
Improved learning of Riemannian metrics for exploratory analysis
2004influential citation
Margin-based active learning and background knowledge in text mining
2004cites this paper
Growing kernel-based self-organized maps trained with supervised bias
2004cites this paper
Learning metrics
2004cites this paper
A general framework for unsupervisedprocessing of structuredd ata
2004cites this paper
Frequency-sensitive competitive learning for scalable balanced clustering on high-dimensional hyperspheres
2004cites this paper
Locally linear metric adaptation for semi-supervised clustering
2004cites this paper
Kernel-based Self-organized Maps Trained with Supervised Bias for Gene Expression Data Analysis
2004cites this paper
Principle of Learning Metrics for Exploratory Data Analysis
2004influential citation
Some Research Problems in Metric Learning and Manifold Learning
2004cites this paper
Semi-supervised Clustering: Learning with Limited User Feedback
2004cites this paper