Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov,Kai Chen,G. Corrado,J. Dean

Published 2013 in International Conference on Learning Representations

ABSTRACT

We propose two novel model architectures for computing continuous vector representations of words from very large data sets. The quality of these representations is measured in a word similarity task, and the results are compared to the previously best performing techniques based on different types of neural networks. We observe large improvements in accuracy at much lower computational cost, i.e. it takes less than a day to learn high quality word vectors from a 1.6 billion words data set. Furthermore, we show that these vectors provide state-of-the-art performance on our test set for measuring syntactic and semantic word similarities.

PUBLICATION RECORD

Publication year
2013
Venue
International Conference on Learning Representations
Publication date
2013-01-16
Fields of study
Computer Science
Identifiers
arXiv 1301.3781
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Combining Heterogeneous Models for Measuring Relational Similarity
2013cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
Linguistic Regularities in Continuous Space Word Representations
2013influential reference
A fast and simple algorithm for training neural probabilistic language models
2012cited by this paper
Large Scale Distributed Deep Networks
2012influential reference
Statistical Language Models Based on Neural Networks
2012cited by this paper
Improving Word Representations via Global Context and Multiple Word Prototypes
2012cited by this paper
SemEval-2012 Task 2: Measuring Degrees of Relational Similarity
2012cited by this paper
Natural Language Processing (Almost) from Scratch
2011cited by this paper
Adaptive Subgradient Methods for Online Learning and Stochastic Optimization
2011cited by this paper
Empirical Evaluation and Combination of Advanced Language Modeling Techniques
2011cited by this paper
Learning Word Vectors for Sentiment Analysis
2011cited by this paper
Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection
2011cited by this paper
Extensions of recurrent neural network language model
2011cited by this paper
Strategies for training large scale neural network language models
2011cited by this paper
The Microsoft Research Sentence Completion Challenge
2011influential reference
Recurrent neural network based language model
2010cited by this paper
Word Representations: A Simple and General Method for Semi-Supervised Learning
2010cited by this paper
A unified architecture for natural language processing: deep neural networks with multitask learning
2008cited by this paper
A Scalable Hierarchical Distributed Language Model
2008cited by this paper
Continuous space language models
2007cited by this paper
Large Language Models in Machine Translation
2007cited by this paper
Three new graphical models for statistical language modelling
2007cited by this paper
Scaling learning algorithms towards AI
2007cited by this paper
Language Modeling for Speech Recognition of Czech
2006influential reference
Hierarchical Probabilistic Neural Network Language Model
2005cited by this paper
Measuring Semantic Similarity by Latent Relational Analysis
2005cited by this paper
A Neural Probabilistic Language Model
2003cited by this paper
Finding Structure in Time
1990cited by this paper
Parallel Distributed Processing: Explorations in the Micro-structure of Cognition
1989cited by this paper
Learning internal representations by back-propagating errors
1986cited by this paper
Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations
1986cited by this paper
Distributed Representations
1986cited by this paper

CITED BY

Time & Frequency Domain Consistency Generic Learning for Fault Diagnosis of HST Bogie via Self-supervised Contrastive Pre-training
2026cites this paper
From patents to predictive analytics: Leveraging R-GCNs for technological opportunity discovery in converging industries
2026cites this paper
Natural Language Processing and Machine Learning Classification Model for Injury Mechanism in Trauma.
2026cites this paper
A Self-Reflection Mechanism for Reducing Hallucination in Vietnamese Legal Question Answering Systems
2026cites this paper
Automated Assignment of Community Reports With Early Fusion Multimodal Transformer
2026cites this paper
What Is Missing: Interpretable Ratings for Large Language Model Outputs
2026cites this paper
Detecting RAG Advertisements Across Advertising Styles
2026cites this paper
World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurrence Statistics in Static Word Embeddings
2026cites this paper
Universal Conceptual Structure in Neural Translation: Probing NLLB-200's Multilingual Geometry
2026influential citation
LIDS: LLM Summary Inference Under the Layered Lens
2026cites this paper
VectorMaton: Efficient Vector Search with Pattern Constraints via an Enhanced Suffix Automaton
2026cites this paper
Texterial: A Text-as-Material Interaction Paradigm for LLM-Mediated Writing
2026cites this paper
The Global Landscape of Environmental AI Regulation: From the Cost of Reasoning to a Right to Green AI
2026cites this paper
Towards OOD Generalization in Dynamic Graphs via Causal Invariant Learning
2026cites this paper
OmniRet: Efficient and High-Fidelity Omni Modality Retrieval
2026cites this paper
M$^2$: Dual-Memory Augmentation for Long-Horizon Web Agents via Trajectory Summarization and Insight Retrieval
2026influential citation
Convolutional autoencoder and embedding-based approach for deep clustering in knowledge graphs
2026cites this paper
Sentence representations for semantic textual similarity: A systematic review
2026cites this paper
Infact: Intelligent Fake News & Sentiment Analysis
2026cites this paper
Analyzing RISC-V Compiler Toolchain by Adopting Topic Modeling
2026cites this paper
Feasibility of Using Large Language Models for Structured Medication Extraction from Clinical Text: A Comparative Analysis of Zero-Shot and Few-Shot Paradigms
2026cites this paper
Longitudinal modality prediction learns gene regulatory patterns: insights from a single-cell competition
2026cites this paper
Double-Edged Sword of Online Communication Substantiveness in the Pursuit of Strategy Uniqueness
2026cites this paper
Leveraging Non-linear Dimension Reduction and Random Walk Co-occurrence for Node Embedding
2026cites this paper
Connectome-based predictive modeling of concurrent and longitudinal substance use vulnerability in adolescence.
2026cites this paper
A Lightweight Defense Mechanism against Next Generation of Phishing Emails using Distilled Attention-Augmented BiLSTM
2026cites this paper
KeemenaPreprocessing.jl: Unicode-Robust Cleaning, Multi-Level Tokenisation and Streaming Offset Bundling for Julia NLP
2026cites this paper
Integrating language model embeddings into the ACT-R cognitive modeling framework
2026cites this paper
CGSeg: Cross-Aggregation and Gated Fusion for Open-Vocabulary Semantic Segmentation
2026cites this paper
SHIELD: System for Harmful Explicit-Content Identification and Evaluation Through LLM-Driven Approach
2026cites this paper
Beyond Subtokens: A Rich Character Embedding for Low-resource and Morphologically Complex Languages
2026cites this paper
PVminer: A Domain-Specific Tool to Detect the Patient Voice in Patient Generated Data
2026cites this paper
CREDIT: Certified Ownership Verification of Deep Neural Networks Against Model Extraction Attacks
2026cites this paper
[b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic
2026cites this paper
InsNet: Deep Indefinite Spectral Kernel Network
2026cites this paper
Energy transition policies and the corporate AI adoption: Evidence from new energy cities
2026cites this paper
Relative Chaoticity of Natural Languages
2026cites this paper
Complexity in the humanities and the humanities in complexity
2026cites this paper
Understanding and Generating Student Questions with LLMs in Collaborative Learning
2025cites this paper
A Preoperative Data Sentenceization Method for Postoperative Major Adverse Cardiovascular Event Prediction
2025cites this paper
An LLM-Based Multi-Modal Framework for Efficient and Realtime Detection of Sensitive Information Using Speech and Transcribed Data
2025cites this paper
Federated Multi-Modal Knowledge Graph Representation Learning with Optimal Transport Alignment
2025cites this paper
A Preliminary Study on Preoperative Data Sentenceization for Postoperative Major Adverse Cardiovascular Event Prediction
2025cites this paper
Conflict Event Actor Prediction Using Spatial Graph Neural Networks
2025cites this paper
LiteKG: A Lightweight LLM-Assisted Framework for Domain-Specific Knowledge Graph Construction
2025cites this paper
Student-to-job Ranking Framework Based on Knowledge Space Embedding
2025cites this paper
Extracting Latent Insights and Tagging Fall Injuries from Clinical Narratives Using Unsupervised Learning
2025cites this paper
Effective Neural Author-Topic Modeling by Leveraging Pre-Trained Language Models
2025cites this paper
Graph Neural Network (GNN) and its Application: A State-of-the-Art Survey
2025cites this paper
Chinese NER for UAV Fault Texts via Local–Global Joint Modeling and Diffusion-Based Semantic Denoising
2025cites this paper
Artificial Intelligence in Social Media: A Comprehensive Review of Models, Applications, and Ethical Issues
2025cites this paper
Quantum State Embeddings and Fidelity Clustering: Advancing Hindi Word Sense Disambiguation : 3rd International Conference on Computational Intelligence and Network Systems (CINS 2025), BITS Pilani, Dubai Campus, Dubai, UAE
2025cites this paper
A Mobile Application Approach to Deep Learning for Symptom-Based Disease Detection From User Textual
2025cites this paper
AIoT for Sentiment-Driven Automation: Leveraging RNNs for Intelligent IoT Device Control
2025cites this paper
LexiBoost: AI-Powered Automated Essay Scoring System with Comparative Model Analysis
2025cites this paper
Efficient Vector Search for RAG and Document Classification Using Principal Vectors
2025cites this paper
Automated Multilingual Content Delivery for the Visually Impaired via AI-Driven Document Parsing
2025cites this paper
CausaMap: A Semi-Supervised Map for Causal Text Mining
2025cites this paper
Utilizing Arithmetic Meta-Learning Method for Unsupervised Learning Purposes
2025cites this paper
Odia News Headlines Classification using Convolutional Neural Network
2025cites this paper
Nagy nyelvi modellek és zárt információs rendszerek a védelmi szférában
2025cites this paper
Specialized Word Embedding Model for the Detection of Suicidal Ideation Using Deep Learning
2025influential citation
Architecture for Semantic Profile Matching Based on Text Descriptors
2025cites this paper
Sentiment Analysis of Digital Ethics in YouTube Islamic Preaching Videos Using Support Vector Machine
2025cites this paper
Differentiable Distance Between Hierarchically-Structured Data
2025cites this paper
Towards Detecting Infinite Combos in Collectible Card Game
2025cites this paper
AI-Optimized Serverless Computing for Scalable Customer Sentiment Analysis in Open Banking APIs
2025cites this paper
Arabic Spam Email Detection Using Fine-Tuned Transformer Models
2025cites this paper
A Comparative Study of BiLSTM and LSTM Architectures for Sentiment Analysis on a Mobile Banking App Review Dataset
2025cites this paper
Research Seminar on Social Media: Mining, Modeling and Meaning
2025cites this paper
Effective and Efficient Similarity Search for DNA Sequences Through de Bruijn Sum Graph Embedding
2025cites this paper
MTL-CR: A Multitask Learning Approach for Code Representation
2025cites this paper
Combining Pre-Trained Language Models with Deep GCN for Text Classification
2025cites this paper
Dictionary-based Byte-Pair Encoding tokenizer for morphologically rich languages
2025cites this paper
Identifying the technology convergence using patent text information: A graph convolutional networks (GCN)-based approach
2022cites this paper
Predicting Tags for Stack Overflow Questions Using Classifier
2021cites this paper
Machine Learning with Applications
2021influential citation
Quantified language connectedness in schizophrenia-spectrum disorders Running title : language connectedness in schizophrenia
2021cites this paper
Evaluating Natural Language Generation via Unbalanced Optimal Transport
2020cites this paper
Artificial Intelligence Applications and Innovations: 16th IFIP WG 12.5 International Conference, AIAI 2020, Neos Marmaras, Greece, June 5–7, 2020, Proceedings, Part II
2020cites this paper
Neural Information Processing: 26th International Conference, ICONIP 2019, Sydney, NSW, Australia, December 12–15, 2019, Proceedings, Part I
2019influential citation
Neural Information Processing: 26th International Conference, ICONIP 2019, Sydney, NSW, Australia, December 12–15, 2019, Proceedings, Part II
2019influential citation
The Value of First Impressions: Leveraging Acquisition Data for Customer Management
2019cites this paper
What are you saying? Using topic to detect financial misreporting*
2018cites this paper
Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics
2015cites this paper
AI for Data Ingestion into IPAC Archives
year unknowncites this paper
Expert Systems With Applications
year unknowncites this paper
Attacking the First-Principle: A Black-Box, Query-Free Targeted Mimicry Attack on Binary Function Classifiers
year unknowncites this paper
: Advancing Natural Language Processing for
year unknowncites this paper
Proceedings of the Annual Meeting of the Cognitive Science Society
year unknowncites this paper
How Do Medical Professionals Perceive Artificial Intelligence: An How Do Medical Professionals Perceive Artificial Intelligence: An Analysis of Reddit Data Analysis of Reddit Data
year unknowncites this paper
ODU Digital Commons ODU Digital Commons
year unknowncites this paper
Supplementary Materials of Category-Specific Selective Feature Enhancement for Long-Tailed Multi-Label Image Classification
year unknowncites this paper
Crypto social media analysis: Deep learning framework for Text Classification and Question Answering
year unknowncites this paper
Decoding Benglish: Scalable Information Retrieval for Transliterated Code-Mixed Conversations
year unknowncites this paper
International Journal of Approximate Reasoning
year unknowncites this paper
SHARK: MODELING SEMANTIC HIERARCHY OF MEDICAL CODE VIA RESIDUAL K-MEANS QUANTIZATION
year unknowncites this paper
Unsupervised Text Classification with Neural Word Embeddings
year unknowncites this paper
A Review of Artificial Intelligence Techniques for Cybersecurity Threat Detection
year unknowncites this paper
Uncanny Semantics: How AI and Human Authors Use Language Differently in Academic Writing
year unknowncites this paper