An efficient framework for learning sentence representations

Published 2018 in International Conference on Learning Representations

ABSTRACT

In this work we propose a simple and efficient framework for learning sentence representations from unlabelled data. Drawing inspiration from the distributional hypothesis and recent work on learning sentence representations, we reformulate the problem of predicting the context in which a sentence appears as a classification problem. Given a sentence and its context, a classifier distinguishes context sentences from other contrastive sentences based on their vector representations. This allows us to efficiently learn different types of encoding functions, and we show that the model learns high-quality sentence representations. We demonstrate that our sentence representations outperform state-of-the-art unsupervised and supervised representation learning methods on several downstream NLP tasks that involve understanding sentence semantics while achieving an order of magnitude speedup in training time.

PUBLICATION RECORD

Publication year
2018
Venue
International Conference on Learning Representations
Publication date
2018-02-15
Fields of study
Computer Science
Identifiers
arXiv 1803.02893
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
2017influential reference
A Simple but Tough-to-Beat Baseline for Sentence Embeddings
2017influential reference
Revisiting Recurrent Networks for Paraphrastic Sentence Embeddings
2017cited by this paper
Learning Paraphrastic Sentence Embeddings from Back-Translated Bitext
2017cited by this paper
Generating Sentences by Editing Prototypes
2017cited by this paper
Curiosity-Driven Exploration by Self-Supervised Prediction
2017cited by this paper
Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning
2017cited by this paper
Unsupervised Learning of Sentence Representations using Convolutional Neural Networks
2016influential reference
Context Encoders: Feature Learning by Inpainting
2016cited by this paper
Learning Distributed Representations of Sentences from Unlabelled Data
2016influential reference
Learning Generic Sentence Representations Using Convolutional Neural Networks
2016influential reference
Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks
2016cited by this paper
Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles
2016cited by this paper
Siamese CBOW: Optimizing Word Embeddings for Sentence Representations
2016cited by this paper
Autoencoding beyond pixels using a learned similarity metric
2015cited by this paper
Unsupervised Visual Representation Learning by Context Prediction
2015cited by this paper
Skip-Thought Vectors
2015influential reference
Neural Machine Translation of Rare Words with Subword Units
2015cited by this paper
Self-Adaptive Hierarchical Sentence Model
2015cited by this paper
Order-Embeddings of Images and Language
2015cited by this paper
Generating Sentences from a Continuous Space
2015cited by this paper
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
2015cited by this paper
Associating neural word embeddings with deep image representations using Fisher Vectors
2015cited by this paper
Towards Universal Paraphrastic Sentence Embeddings
2015cited by this paper
Gated Feedback Recurrent Neural Networks
2015influential reference
Distributed Representations of Sentences and Documents
2014cited by this paper
A SICK cure for the evaluation of compositional distributional semantic models
2014cited by this paper
GloVe: Global Vectors for Word Representation
2014cited by this paper
On Using Very Large Target Vocabulary for Neural Machine Translation
2014cited by this paper
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
2014cited by this paper
Deep visual-semantic alignments for generating image descriptions
2014cited by this paper
A Model of Coherence Based on Distributed Sentence Representation
2014cited by this paper
Convolutional Neural Networks for Sentence Classification
2014influential reference
SemEval-2014 Task 10: Multilingual Semantic Textual Similarity
2014cited by this paper
Microsoft COCO: Common Objects in Context
2014cited by this paper
UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems
2013cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
Linguistic Regularities in Continuous Space Word Representations
2013cited by this paper
Efficient Estimation of Word Representations in Vector Space
2013cited by this paper
Multilingual Distributed Representations without Word Alignment
2013cited by this paper
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
2013cited by this paper
Discriminative Improvements to Distributional Sentence Similarity
2013cited by this paper
Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection
2011cited by this paper
A Scalable Hierarchical Distributed Language Model
2008cited by this paper
Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales
2005cited by this paper
Annotating Expressions of Opinions and Emotions in Language
2005influential reference
A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts
2004cited by this paper
Overview of the TREC 2003 Question Answering Track
2004cited by this paper
Unsupervised Construction of Large Paraphrase Corpora: Exploiting Massively Parallel News Sources
2004cited by this paper
Mining and summarizing customer reviews
2004cited by this paper

CITED BY

Revisiting Theory of Contrastive Learning for Domain Generalization
2025influential citation
Aligning Multimodal Representations through an Information Bottleneck
2025cites this paper
Disentanglement of Variations with Multimodal Generative Modeling
2025cites this paper
MARCEL: Multifaceted SpAtial-TempoRal ContrastivE Learning for Generic Spatial-Temporal Representations
2025cites this paper
Set-Theoretic Compositionality of Sentence Embeddings
2025cites this paper
Predicting learning performance using NLP: an exploratory study using two semantic textual similarity methods
2025cites this paper
Fine-Grained Alignment Network for Zero-Shot Cross-Modal Retrieval
2025cites this paper
Cropping outperforms dropout as an augmentation strategy for training self-supervised text embeddings
2025cites this paper
Modeling Language as a Sequence of Thoughts
2025cites this paper
FANoise: Singular Value-Adaptive Noise Modulation for Robust Multimodal Representation Learning
2025cites this paper
Know Yourself and Know Your Neighbour : A Syntactically Informed Self-Supervised Compositional Sentence Representation Learning Framework using a Recursive Hypernetwork
2025cites this paper
Foundation models for geospatial reasoning: assessing the capabilities of large language models in understanding geometries and topological spatial relations
2025cites this paper
Towards Efficient Contrastive PAC Learning
2025cites this paper
Category-guided multi-interest collaborative metric learning with representation uniformity constraints
2025cites this paper
FineXtrol: Controllable Motion Generation via Fine-Grained Text
2025cites this paper
Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models
2025cites this paper
A Mutual Information Perspective on Knowledge Graph Embedding
2025cites this paper
Latent Reasoning via Sentence Embedding Prediction
2025cites this paper
Layer-wise contrastive learning BERT for sentence representation of GitHub
2025cites this paper
Cross-modal Counterfactual Explanations: Uncovering Decision Factors and Dataset Biases in Subjective Classification
2025cites this paper
Leaky Diffusion: Attribute Leakage in Text-Guided Image Generation
2025cites this paper
Element2Vec: Build Chemical Element Representation from Text for Property Prediction
2025cites this paper
Sentence Embedding Using Supervised Contrastive Learning on Hierarchical Categories
2025cites this paper
Challenging Assumptions in Learning Generic Text Style Embeddings
2025cites this paper
A social information sensitive model for conversational recommender systems
2025cites this paper
Less Mature is More Adaptable for Sentence-level Language Modeling
2025cites this paper
Similarity-Agnostic Contrastive Learning With Alterable Self-Supervision
2025cites this paper
Enriching Knowledge Distillation with Intra-Class Contrastive Learning
2025cites this paper
Development of Classification Method for Lecturer Area of Expertise Based on Scientific Publication Using BERT
2024cites this paper
Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction
2024cites this paper
Chinese Spelling Correction Based on Knowledge Enhancement and Contrastive Learning
2024cites this paper
Contrastive Unsupervised Representation Learning With Optimize-Selected Training Samples
2024cites this paper
Heterogeneous Contrastive Learning for Foundation Models and Beyond
2024cites this paper
Adaptive Reinforcement Tuning Language Models as Hard Data Generators for Sentence Representation
2024cites this paper
Embedding Dimension of Contrastive Learning and k-Nearest Neighbors
2024cites this paper
ReGCL: Rethinking Message Passing in Graph Contrastive Learning
2024cites this paper
Simple Temperature Cool-down in Contrastive Framework for Unsupervised Sentence Representation Learning
2024cites this paper
Common Sense Enhanced Knowledge-based Recommendation with Large Language Model
2024cites this paper
Joint contrastive learning for prompt-based few-shot language learners
2024cites this paper
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
2024cites this paper
Understanding Hyperbolic Metric Learning through Hard Negative Sampling
2024cites this paper
Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback
2024cites this paper
Poly-View Contrastive Learning
2024cites this paper
AUC-CL: A Batchsize-Robust Framework for Self-Supervised Contrastive Representation Learning
2024cites this paper
Learning Backdoors for Mixed Integer Linear Programs with Contrastive Learning
2024cites this paper
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
2024cites this paper
UNSEE: Unsupervised Non-contrastive Sentence Embeddings
2024cites this paper
Enhancing Electronic Information Retrieval through Masked Auto-encoder Based Word-level Augmentation
2024cites this paper
Nonparametric Clustering-Guided Cross-View Contrastive Learning for Partially View-Aligned Representation Learning
2024cites this paper
TabDeco: A Comprehensive Contrastive Framework for Decoupled Representations in Tabular Data
2024cites this paper
Information-Controllable Graph Contrastive Learning for Recommendation
2024cites this paper
Contrastive Abstraction for Reinforcement Learning
2024cites this paper
KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning
2024cites this paper
Multilingual Sentence-T5: Scalable Sentence Encoders for Multilingual Applications
2024cites this paper
Knowledge-Based Domain-Oriented Data Augmentation for Enhancing Unsupervised Sentence Embedding
2024cites this paper
The Impact of Training Methods on the Development of Pre-trained Language Models
2024cites this paper
SetCSE: Set Operations using Contrastive Learning of Sentence Embeddings
2024cites this paper
EMC2: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence
2024cites this paper
Contrastive Learning for Clinical Outcome Prediction with Partial Data Sources
2024cites this paper
A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges
2024cites this paper
Improving Related Work Generation through Knowledge Aggregation from Citation Networks
2024cites this paper
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning
2024cites this paper
Contrastive Learning of Asset Embeddings from Financial Time Series
2024cites this paper
Enhancing Topic Interpretability for Neural Topic Modeling Through Topic-Wise Contrastive Learning
2024cites this paper
Robust Similarity Learning with Difference Alignment Regularization
2024cites this paper
Leveraging Superfluous Information in Contrastive Representation Learning
2024cites this paper
View-Category Interactive Sharing Transformer for Incomplete Multi-View Multi-Label Learning
2024cites this paper
Enhancing Unsupervised Sentence Embeddings via Knowledge-Driven Data Augmentation and Gaussian-Decayed Contrastive Learning
2024cites this paper
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks
2024cites this paper
CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages
2024cites this paper
False Negative Masking for Debiasing in Contrastive Learning
2024cites this paper
Not All Negatives are Equally Negative: Soft Contrastive Learning for Unsupervised Sentence Representations
2024cites this paper
Bayesian Self-Supervised Contrastive Learning
2023cites this paper
Unbiased and Efficient Self-Supervised Incremental Contrastive Learning
2023cites this paper
Tsetlin Machine Embedding: Representing Words Using Logical Expressions
2023cites this paper
Data-to-text Generation with Data Control and Multi-loss Fusion
2023cites this paper
Parameter-Effective Contrastive Learning and Isotropy of Different Layers For Better Sentence Embeddings
2023cites this paper
Modelling Text Similarity: A Survey
2023cites this paper
ClusCSE: Clustering-Based Contrastive Learning of Sentence Embeddings
2023cites this paper
PACT: Pretraining with Adversarial Contrastive Learning for Text Classification
2023cites this paper
Dialog‐based multi‐item recommendation using automatic evaluation
2023cites this paper
BERT Has More to Offer: BERT Layers Combination Yields Better Sentence Embeddings
2023cites this paper
Locally Differentially Private Embedding Models in Distributed Fraud Prevention Systems
2023cites this paper
Enriching Electronic Health Record with Semantic Features UtilisingPretrained Transformers
2023cites this paper
Optimal Sample Complexity of Contrastive Learning
2023influential citation
Contrastive Difference Predictive Coding
2023cites this paper
M³Seg: A Maximum-Minimum Mutual Information Paradigm for Unsupervised Topic Segmentation in ASR Transcripts
2023cites this paper
AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking
2023cites this paper
DistillCSE: Distilled Contrastive Learning for Sentence Embeddings
2023cites this paper
A Two-Stage Progressive Intent Clustering for Task-Oriented Dialogue
2023cites this paper
MetaCAR: Cross-Domain Meta-Augmentation for Content-Aware Recommendation
2023cites this paper
Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
2023cites this paper
An effective negative sampling approach for contrastive learning of sentence embedding
2023cites this paper
Sparse Contrastive Learning of Sentence Embeddings
2023cites this paper
Monitoring of Public Opinion on Typhoon Disaster Using Improved Clustering Model Based on Single-Pass Approach
2023cites this paper
Differentially private optimization for non-decomposable objective functions
2023cites this paper
A Novel Information-Theoretic Objective to Disentangle Representations for Fair Classification
2023cites this paper
DocSplit: Simple Contrastive Pretraining for Large Document Embeddings
2023cites this paper
Unsupervised contrastive graph learning for resting‐state functional MRI analysis and brain disorder detection
2023cites this paper
Graph Joint Representation Clustering via Penalized Graph Contrastive Learning
2023cites this paper