Supervising Unsupervised Learning

Published 2017 in Neural Information Processing Systems

ABSTRACT

We introduce a framework to leverage knowledge acquired from a repository of (heterogeneous) supervised datasets to new unsupervised datasets. Our perspective avoids the subjectivity inherent in unsupervised learning by reducing it to supervised learning, and provides a principled way to evaluate unsupervised algorithms. We demonstrate the versatility of our framework via simple agnostic bounds on unsupervised problems. In the context of clustering, our approach helps choose the number of clusters and the clustering algorithm, remove the outliers, and provably circumvent the Kleinberg's impossibility result. Experimental results across hundreds of problems demonstrate improved performance on unsupervised data with simple algorithms, despite the fact that our problems come from heterogeneous domains. Additionally, our framework lets us leverage deep networks to learn common features from many such small datasets, and perform zero shot learning.

PUBLICATION RECORD

Publication year
2017
Venue
Neural Information Processing Systems
Publication date
2017-09-14
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1709.05262
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Learning how to learn
2019cited by this paper
Probabilistic Matrix Factorization for Automated Machine Learning
2017cited by this paper
ProtoNN: Compressed and Accurate kNN for Resource-scarce Devices
2017cited by this paper
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
2017cited by this paper
Learning Disentangled Representations with Semi-Supervised Deep Generative Models
2017cited by this paper
Learning-Theoretic Foundations of Algorithm Configuration for Combinatorial Partitioning Problems
2016cited by this paper
Domain Separation Networks
2016cited by this paper
Unsupervised Domain Adaptation with Residual Transfer Networks
2016cited by this paper
Efficient and Robust Automated Machine Learning
2015cited by this paper
Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding
2015cited by this paper
Compressing Neural Networks with the Hashing Trick
2015cited by this paper
Semi-supervised Learning with Deep Generative Models
2014cited by this paper
Efficient Representations for Lifelong Learning and Autoencoding
2014cited by this paper
ImageNet Large Scale Visual Recognition Challenge
2014cited by this paper
Unsupervised Domain Adaptation by Backpropagation
2014cited by this paper
Transfer Learning in a Transductive Setting
2013cited by this paper
ADADELTA: An Adaptive Learning Rate Method
2012influential reference
Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms
2012cited by this paper
Practical Bayesian Optimization of Machine Learning Algorithms
2012cited by this paper
Scikit-learn: Machine Learning in Python
2011cited by this paper
A Survey on Transfer Learning
2010cited by this paper
A Uniqueness Theorem for Clustering
2009cited by this paper
Measures of Clustering Quality: A Working Set of Axioms for Clustering
2008cited by this paper
Penalized and weighted K-means for clustering with scattered objects and prior information in high-throughput biological data
2007cited by this paper
Toward efficient agnostic learning
2004cited by this paper
An Impossibility Theorem for Clustering
2002influential reference
The Biology and Technology of Intelligent Autonomous Agents
1995cited by this paper
Lifelong robot learning
1993cited by this paper
Silhouettes: a graphical aid to the interpretation and validation of cluster analysis
1987cited by this paper
Comparing partitions
1985cited by this paper

CITED BY

Machine Learning Algorithms for Predictive Maintenance: A Systematic Literature Mapping
2025cites this paper
Weakly Supervised Active Learning for Abstract Screening Leveraging LLM-Based Pseudo-Labeling
2025cites this paper
ReMlX: Resilience for ML Ensembles using XAI at Inference against Faulty Training Data
2025cites this paper
Online Performance Estimation with Unlabeled Data: A Bayesian Application of the Hui-Walter Paradigm
2024cites this paper
Is Unsupervised Clustering Somehow Truer?
2024cites this paper
From Latent to Engine Manifolds: Analyzing ImageBind's Multimodal Embedding Space
2024cites this paper
Harnessing Explainability to Improve ML Ensemble Resilience
2024cites this paper
A 3D-CAE-CNN model for Deep Representation Learning of 3D images
2022cites this paper
Unsupervised Anomaly Detection in Time-series: An Extensive Evaluation and Analysis of State-of-the-art Methods
2022cites this paper
When to Use What: An In-Depth Comparative Empirical Analysis of OpenIE Systems for Downstream Applications
2022cites this paper
Explainable AI for RAMS
2022cites this paper
Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts
2022cites this paper
Review on the Application of Metalearning in Artificial Intelligence
2021cites this paper
How to Train Your MAML to Excel in Few-Shot Classification
2021cites this paper
How much data is sufficient to learn high-performing algorithms? generalization guarantees for data-driven algorithm design
2021cites this paper
Discovering and Interpreting Biased Concepts in Online Communities
2020cites this paper
M ETA - K : T OWARDS S ELF - SUPERVISED P REDICTION OF N UMBER OF C LUSTERS
2020influential citation
Generalization in portfolio-based algorithm selection
2020cites this paper
Discovering and Interpreting Conceptual Biases in Online Communities
2020cites this paper
LiDAM: Semi-Supervised Learning with Localized Domain Adaptation and Iterative Matching
2020cites this paper
A Comprehensive Overview and Survey of Recent Advances in Meta-Learning
2020cites this paper
Meta-Learning in Neural Networks: A Survey
2020cites this paper
Revisiting Meta-Learning as Supervised Learning
2020cites this paper
New Approach based on Machine Learning for Short-Term Mortality Prediction in Neonatal Intensive Care Unit
2019cites this paper
Meta-Learning to Cluster
2019influential citation
Few-shot learning with adaptively initialized task optimizer: a practical meta-learning approach
2019cites this paper
A Meta Understanding of Meta-Learning
2019cites this paper
Private selection from private candidates
2018cites this paper
Unsupervised Learning via Meta-Learning
2018cites this paper
Meta-Learning Update Rules for Unsupervised Representation Learning
2018cites this paper
Learning Unsupervised Learning Rules
2018cites this paper