Concept Matching for Low-Resource Classification

Federico Errica,Ludovic Denoyer,Bora Edizel,F. Petroni,Vassilis Plachouras,F. Silvestri,Sebastian Riedel

Published 2020 in IEEE International Joint Conference on Neural Network

ABSTRACT

In many applications that rely on machine learning, the availability of labelled data is a matter of primary importance. However, when tackling new tasks, labels are usually missing and must be collected from scratch by the users. In this work, we address the problem of learning classifiers when the amount of labels is very scarce. We do so by learning multiple vectors, called prototypes, that represent relevant semantic concepts for the task at hand. We propose a theoretically inspired mechanism that computes probabilities of matching between the prototypes and the input elements, and we combine these probabilities to increase the expressiveness of the classifier. Moreover, by leveraging low-cost extra annotations in the training data, a simple error-boosting technique guides the learning process and provides substantial performance improvements. Empirical results confirm the benefits of the proposed approach in both balanced and unbalanced datasets. Our methodology is thus of practical use when gathering and labelling new examples is more expensive than annotating what we already have.

PUBLICATION RECORD

Publication year
2020
Venue
IEEE International Joint Conference on Neural Network
Publication date
2020-06-01
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1109/IJCNN52387.2021.9533640 arXiv 2006.00937
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Statistical Learning Theory
2021cited by this paper
5分で分かる!? 有名論文ナナメ読み：Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding
2020influential reference
Augmenting Neural Networks with First-order Logic
2019cited by this paper
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019influential reference
Interpretable Neural Predictions with Differentiable Binary Variables
2019cited by this paper
Variable Selection
2019cited by this paper
A Closer Look at Few-shot Classification
2019cited by this paper
Strategies for Pre-training Graph Neural Networks
2019cited by this paper
PyTorch: An Imperative Style, High-Performance Deep Learning Library
2019cited by this paper
Universal Language Model Fine-tuning for Text Classification
2018cited by this paper
Bridging CNNs, RNNs, and Weighted Finite-State Machines
2018cited by this paper
Sequence Classification with Human Attention
2018cited by this paper
Neural Character-based Composition Models for Abuse Detection
2018cited by this paper
Deriving Machine Attention from Human Rationales
2018cited by this paper
Training Classifiers with Natural Language Explanations
2018influential reference
Marrying Up Regular Expressions with Neural Networks: A Case Study for Spoken Language Understanding
2018cited by this paper
Prototypical Networks for Few-shot Learning
2017cited by this paper
Few-Shot Learning with Graph Neural Networks
2017cited by this paper
Attention is All you Need
2017cited by this paper
Enriching Word Vectors with Subword Information
2016cited by this paper
Harnessing Deep Neural Networks with Logic Rules
2016cited by this paper
Deep Neural Networks with Massive Learned Knowledge
2016cited by this paper
Rationalizing Neural Predictions
2016influential reference
Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter
2016cited by this paper
Learning Word Importance with the Neural Bag-of-Words Model
2016cited by this paper
Deep Unordered Composition Rivals Syntactic Methods for Text Classification
2015cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
A Convolutional Neural Network for Modelling Sentences
2014influential reference
Generalized Expectation Criteria for Semi-Supervised Learning with Weakly Labeled Data
2010cited by this paper
Modeling Annotators: A Generative Approach to Learning from Annotator Rationales
2008cited by this paper
Reducing Annotation Effort Using Generalized Expectation Criteria
2007cited by this paper
Using “Annotator Rationales” to Improve Machine Learning for Text Categorization
2007influential reference
Generalized Expectation Criteria
2007cited by this paper
A Short Introduction to Boosting
1999cited by this paper
A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge.
1997cited by this paper
Long Short-Term Memory
1997cited by this paper
Bidirectional recurrent neural networks
1997cited by this paper
Support-Vector Networks
1995cited by this paper

CITED BY

Ranking-Constrained Learning with Rationales for Text Classification
2022cites this paper
FUZZY DEFINITION FOR TASKS AND RESOURCES CATEGORIZATION IN SOFTWARE PROJECTS
2021cites this paper