Universal Sentence Encoder

Daniel Matthew Cer,Yinfei Yang,Sheng-yi Kong,Nan Hua,Nicole Limtiaco,Rhomni St. John,Noah Constant,Mario Guajardo-Cespedes,Steve Yuan,C. Tar,Yun-Hsuan Sung,B. Strope,R. Kurzweil

Published 2018 in arXiv.org

ABSTRACT

We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer. With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.

PUBLICATION RECORD

Publication year
2018
Venue
arXiv.org
Publication date
2018-03-29
Fields of study
Computer Science
Identifiers
arXiv 1803.11175
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
2017cited by this paper
Efficient Natural Language Response Suggestion for Smart Reply
2017cited by this paper
Google Vizier: A Service for Black-Box Optimization
2017cited by this paper
Attention is All you Need
2017cited by this paper
SemEval-2017 Task 1: Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation
2017influential reference
TensorFlow: A system for large-scale machine learning
2016cited by this paper
Semantics derived automatically from language corpora contain human-like biases
2016cited by this paper
Skip-Thought Vectors
2015cited by this paper
Deep Unordered Composition Rivals Syntactic Methods for Text Classification
2015influential reference
A large annotated corpus for learning natural language inference
2015cited by this paper
GloVe: Global Vectors for Word Representation
2014influential reference
Convolutional Neural Networks for Sentence Classification
2014cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
2013cited by this paper
PETTIT iMpliciT anD expliciT sTigMaTizing aTTiTuDes anD sTereoTypes aBouT Depression
2011cited by this paper
Implicit and Explicit Stigmatizing Attitudes and Stereotypes About Depression
2011cited by this paper
Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales
2005cited by this paper
Annotating Expressions of Opinions and Emotions in Language
2005cited by this paper
A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts
2004cited by this paper
Mining and summarizing customer reviews
2004cited by this paper
Are Emily and Greg More Employable than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination
2003cited by this paper
Harvesting implicit group attitudes and beliefs from a demonstration web site
2002cited by this paper
Math = male, me = female, therefore math ≠ me.
2002cited by this paper
Learning Question Classifiers
2002cited by this paper
Math Male , Me Female , Therefore Math Me
2002cited by this paper
Measuring individual differences in implicit cognition: the implicit association test.
1998cited by this paper
Long Short-Term Memory
1997cited by this paper

CITED BY

Analyzing Cancer Patients' Experiences with Embedding-based Topic Modeling and LLMs
2026cites this paper
Semantic Novelty at Scale: Narrative Shape Taxonomy and Readership Prediction in 28,606 Books
2026cites this paper
On Plagiarism and Software Plagiarism
2026cites this paper
Not All Tokens Matter: Data-Centric Optimization for Efficient Code Summarization
2026cites this paper
Age Matters: Analyzing Age-Related Discussions in App Reviews
2026cites this paper
Adaptive Quality-Diversity Trade-offs for Large-Scale Batch Recommendation
2026cites this paper
Bootstrapping Embeddings for Low Resource Languages
2026cites this paper
COSTAR: Software Code Smell Detection Through Tree-Based Abstract Representation
2026cites this paper
To Tell or to Ask? Comparing the Effects of Targeted vs. Socratic AI Hints
2026cites this paper
Text-Muddler: an advanced adversarial paradigm for disrupting NLP-based neural architectures in sentiment analysis frameworks
2026cites this paper
Semantic Communities and Boundary-Spanning Lyrics in K-pop: A Graph-Based Unsupervised Analysis
2026cites this paper
AI-Augmented Instruction: Real-Time Misconception Detection
2026cites this paper
Sum Estimation via Vector Similarity Search
2026cites this paper
Cascaded Transformer for Robust and Scalable SLA Decomposition via Amortized Optimization
2026cites this paper
Harnessing multimodal large language models to interpret ecological momentary assessment-generated caregiving photographs
2026cites this paper
Accessibility rank: a machine learning approach for prioritizing accessibility user feedback
2025cites this paper
A Computational Framework to Identify Self-Aspects in Text
2025cites this paper
Experimental Evaluation of Dynamic Topic Modeling Algorithms
2025cites this paper
Semantic similarity estimation for domain specific data using BERT and other techniques
2025influential citation
TA: A Chinese Adversarial Samples Generation Approach Based on Multi-Strategy Perturbations
2025cites this paper
Search-based Selection of Metamorphic Relations for Optimized Robustness Testing of Large Language Models
2025influential citation
Enhancing Serendipity Recommendation System by Constructing Dynamic User Knowledge Graphs with Large Language Models
2025cites this paper
Relationships between numerical score and free text comments in student evaluations of teaching: A sentiment topic analysis reveals the influence of gender and culture
2025cites this paper
Ontology-based NLP tool for tracing software requirements and conceptual models: an empirical study
2025cites this paper
SMCLM: Semantically Meaningful Causal Language Modeling for Autoregressive Paraphrase Generation
2025cites this paper
Memory-Augmented Architecture for Long-Term Context Handling in Large Language Models
2025cites this paper
Large Language Models for Automating Clinical Data Standardization: HL7 FHIR Use Case
2025cites this paper
VIP: Visual Information Protection through Adversarial Attacks on Vision-Language Models
2025cites this paper
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
2025cites this paper
Automatic Text Summarization for Hindi Language Using Word Embeddings: A Critical Review
2025cites this paper
What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection
2025influential citation
Assessment and Integration of Large Language Models for Automated Electronic Health Record Documentation in Emergency Medical Services
2025cites this paper
Requirements Coverage-Guided Minimization for Natural Language Test Cases
2025influential citation
Enhancing Retrieval-Augmented Generation via Dual-Granularity Document Indexing
2025cites this paper
Saga: Understanding Stories in Mobile App Reviews
2025cites this paper
Predicting whole-brain neural dynamics from prefrontal cortex functional near-infrared spectroscopy signal during movie-watching
2025cites this paper
Estimating LLM Consistency: A User Baseline vs Surrogate Metrics
2025cites this paper
VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow
2025cites this paper
Visual Question Answering: A Survey of Methods, Datasets, Evaluation, and Challenges
2025cites this paper
Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models
2025cites this paper
What is User Engagement?: A Systematic Review of 241 Research Articles in Human-Computer Interaction and Beyond
2025cites this paper
Large Language Models Are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
2025cites this paper
A Data‐Driven Methodology for Quality Aware Code Fixing
2025cites this paper
Pixel Motion as Universal Representation for Robot Control
2025cites this paper
High-dimensional structure underlying individual differences in naturalistic visual experience.
2025cites this paper
Rethinking Scene Segmentation. Advancing Automated Detection of Scene Changes in Literary Texts
2025influential citation
Analyzing Reddit Stories of Sexual Violence: Incidents, Effects, and Requests for Advice
2025cites this paper
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors
2025cites this paper
Adimen Artifiziala Joko Patologikoaren Arriskua Aurresateko
2025cites this paper
An AI-driven Requirements Engineering Framework Tailored for Evaluating AI-Based Software
2025cites this paper
Context-Based Fake News Detection using Graph Based Approach: ACOVID-19 Use-case
2025cites this paper
Navigating the storm: the impact of the Russia–Ukraine war on EU’s quest for strategic autonomy
2025cites this paper
Black-Box Adversarial Attack on Dialogue Generation via Multi-Objective Optimization
2025cites this paper
CARTS: Collaborative Agents for Recommendation Textual Summarization
2025cites this paper
UFGraphFR: Graph Federation Recommendation System based on User Text description features
2025cites this paper
Vision Assist: Enhancing Accessibility for the Visually Impaired through Advanced Video Captioning
2025cites this paper
Enhancing Phishing Detection in Financial Systems through NLP
2025cites this paper
From words to visuals: a transformer-based multi-modal framework for emotion-driven tourism analytics
2025cites this paper
Branch Explorer: Leveraging Branching Narratives to Support Interactive 360° Video Viewing for Blind and Low Vision Users
2025cites this paper
Exploring Layer-wise Representations of English and Chinese Homonymy in Pre-trained Language Models
2025cites this paper
Uncertainty-driven Embedding Convolution
2025cites this paper
Evaluating Suprasegmental Features for Phonological Fusion and Spectrogram-Based Speech Command Recognition
2025cites this paper
Studying memory narratives with natural language processing.
2025cites this paper
A Survey of Large Language Models in Mental Health Disorder Detection on Social Media
2025cites this paper
Emotional arousal enhances narrative memories through functional integration of large-scale brain networks
2025cites this paper
Language-agnostic, Automated Assessment of Listeners’ Speech Recall Using Large Language Models
2025cites this paper
Evaluating Pretrained Embeddings for Automatic Short Answer Grading
2025cites this paper
Exploring Author Style in Nakba Short Stories: A Comparative Study of Transformer-Based Models
2025cites this paper
Confidence Elicitation: A New Attack Vector for Large Language Models
2025cites this paper
Context-aware code summarization with multi-relational graph neural network
2025cites this paper
Bankruptcy analysis using images and convolutional neural networks (CNN)
2025cites this paper
Unified Prompt Attack Against Text-to-Image Generation Models
2025cites this paper
Conversational linguistic features inform social-relational inference
2025cites this paper
Approximate Hausdorff Distance for Multi-Vector Databases
2025cites this paper
Is Multi-Agent Debate (MAD) the Silver Bullet? An Empirical Analysis of MAD in Code Summarization and Translation
2025cites this paper
Hard label adversarial attack with high query efficiency against NLP models
2025cites this paper
RoboFlamingo-Plus: Fusion of Depth and RGB Perception with Vision-Language Models for Enhanced Robotic Manipulation
2025cites this paper
Automated Software Requirements Prioritization Using Natural Language Processing
2025cites this paper
Performance Evaluation of Sentiment Analysis on Text and Emoji Data Using End-to-End, Transfer Learning, Distributed and Explainable AI Models
2025cites this paper
Sentiment Analysis of Arabic Tweets Using Large Language Models
2025influential citation
Universal Automatic Short Answer Grading (ASAG) Model: A Comprehensive Approach
2025cites this paper
Detecting Implicit Subjects in Text
2025cites this paper
Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation
2025cites this paper
Exploring similarity patterns in a large scientific corpus
2025cites this paper
Do Automatic Comment Generation Techniques Fall Short? Exploring the Influence of Method Dependencies on Code Understanding
2025cites this paper
Transformer-Empowered Actor-Critic Reinforcement Learning for Sequence-Aware Service Function Chain Partitioning
2025cites this paper
Sentence Embeddings as an intermediate target in end-to-end summarisation
2025influential citation
Theoretical Guarantees for LT-TTD: A Unified Transformer-based Architecture for Two-Level Ranking Systems
2025cites this paper
A Blockchain-Based Anonymous Crime Reporting Platform Integrating Privacy-Preserving NLP for Enhanced Public Safety
2025cites this paper
Are LLMs complicated ethical dilemma analyzers?
2025cites this paper
Subjective Answer Sheet Evaluation
2025cites this paper
Predicting Reaction Time to Comprehend Scenes with Foveated Scene Understanding Maps
2025cites this paper
SLMEval: Entropy-Based Calibration for Human-Aligned Evaluation of Large Language Models
2025cites this paper
SBERT-based Deep Learning model for mapping of PEOs and POs with Justification Rubrics
2025cites this paper
Multilingual and Multi-Class Sentiment Classification Using Machine Learning, BERT, and GPT-4o-mini
2025cites this paper
How to Automate Feedback on Diagrammatic Reasoning with a Relevant Degree of Freedom?
2025cites this paper
A Novel Transformer Based Unified Framework for Sentence-Level Sentiment Analysis
2025cites this paper
AI-Generated Compromises for Coalition Formation
2025cites this paper
Hierarchical Article Classification: A Multi-Level Framework for Organizing Scholarly Literature
2025cites this paper
Parentheses insertion based sentence-level text adversarial attack
2025cites this paper