Semi-Supervised Class Discovery

Published 2020 in arXiv.org

ABSTRACT

One promising approach to dealing with datapoints that are outside of the initial training distribution (OOD) is to create new classes that capture similarities in the datapoints previously rejected as uncategorizable. Systems that generate labels can be deployed against an arbitrary amount of data, discovering classification schemes that through training create a higher quality representation of data. We introduce the Dataset Reconstruction Accuracy, a new and important measure of the effectiveness of a model's ability to create labels. We introduce benchmarks against this Dataset Reconstruction metric. We apply a new heuristic, class learnability, for deciding whether a class is worthy of addition to the training dataset. We show that our class discovery system can be successfully applied to vision and language, and we demonstrate the value of semi-supervised learning in automatically discovering novel classes.

PUBLICATION RECORD

Publication year
2020
Venue
arXiv.org
Publication date
2020-02-10
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 2002.03480
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

Semi-supervised learning demonstrates value in automatically discovering novel classes.
Confidence 0.88

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review
The class discovery system is successfully applied to both vision and language domains.
Confidence 0.90

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review
Class learnability is introduced as a heuristic for deciding whether a candidate class is worthy of addition to the training dataset.
Confidence 0.95

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review
Dataset Reconstruction Accuracy is introduced as a new measure of a model's effectiveness at creating labels for out-of-distribution data.
Confidence 0.95

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review

CONCEPTS

class discovery system
method, system

An automated system that generates labels and identifies novel classification schemes from unlabeled or rejected data.

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review
class learnability
heuristic, selection criterion

A heuristic introduced in this paper to decide whether a newly discovered class is suitable for addition to the training dataset.

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review
dataset reconstruction accuracy
evaluation metric

A metric introduced in this paper to measure how effectively a model creates labels that reconstruct the dataset.

Aliases: DRA

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review
out-of-distribution data
data characteristic, problem setting

Datapoints that fall outside the initial training distribution and cannot be categorized by an existing model.

Aliases: OOD data, OOD datapoints

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review
semi-supervised learning
method, learning paradigm

A learning paradigm that uses both labeled and unlabeled data, applied here to discover novel classes automatically.

뀨 (7c402c1b98) extractionimjlk (vdp8mqzes2) reviewAnonymous (12632b8b5f) review박진우 (dztg5apj7m) review

REFERENCES

5分で分かる!? 有名論文ナナメ読み：Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding
2020cited by this paper
S4L: Self-Supervised Semi-Supervised Learning
2019cited by this paper
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
2019cited by this paper
MixMatch: A Holistic Approach to Semi-Supervised Learning
2019cited by this paper
Language Models are Unsupervised Multitask Learners
2019cited by this paper
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
2019cited by this paper
Continual Unsupervised Representation Learning
2019cited by this paper
Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift
2019cited by this paper
An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction
2019influential reference
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019cited by this paper
Revisiting Self-Supervised Visual Representation Learning
2019cited by this paper
A Review of the Research on Dialogue Management of Task-Oriented Systems
2019cited by this paper
Deep Clustering for Unsupervised Learning of Visual Features
2018cited by this paper
Co-teaching: Robust training of deep neural networks with extremely noisy labels
2018cited by this paper
Unseen Class Discovery in Open-world Classification
2018cited by this paper
Unsupervised Representation Learning by Predicting Image Rotations
2018cited by this paper
A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks
2018cited by this paper
Automatic Goal Generation for Reinforcement Learning Agents
2017cited by this paper
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
2017cited by this paper
Learning Discrete Representations via Information Maximizing Self-Augmented Training
2017cited by this paper
Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks
2017cited by this paper
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
2017cited by this paper
A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks
2016cited by this paper
Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach
2016cited by this paper
TensorFlow: A system for large-scale machine learning
2016cited by this paper
Understanding deep learning requires rethinking generalization
2016cited by this paper
Deep Metric Learning via Facility Location
2016cited by this paper
Recovering the number of clusters in data sets with noise features using feature rescaling factors
2015cited by this paper
Unsupervised Visual Representation Learning by Context Prediction
2015cited by this paper
Deep Residual Learning for Image Recognition
2015cited by this paper
Convolutional Neural Networks for Sentence Classification
2014influential reference
GloVe: Global Vectors for Word Representation
2014cited by this paper
Towards Open World Recognition
2014cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
Toward Open Set Recognition
2013cited by this paper
Scikit-learn: Machine Learning in Python
2011cited by this paper
PowerPlay: Training an Increasingly General Problem Solver by Continually Searching for the Simplest Still Unsolvable Problem
2011cited by this paper
Convolutional Deep Belief Networks on CIFAR-10
2010cited by this paper
Latent Dirichlet Allocation
2009cited by this paper
Simultaneous Class Discovery and Classification of Microarray Data Using Spectral Analysis
2009cited by this paper
Self-taught learning: transfer learning from unlabeled data
2007cited by this paper
The mnist database of handwritten digits
2005cited by this paper
Integrating constraints and metric learning in semi-supervised clustering
2004cited by this paper
Curious model-building control systems
1991cited by this paper
Journal of Machine Learning Research () Submitted; Published Distance Dependent Chinese Restaurant Processes
year unknowncited by this paper

CITED BY

Unsupervised Class Incremental Learning using Empty Classes
2024cites this paper
Open-World Class Discovery with Kernel Networks
2020influential citation