Dynamic Word Embeddings

Published 2017 in International Conference on Machine Learning

ABSTRACT

We present a probabilistic language model for time-stamped text data which tracks the semantic evolution of individual words over time. The model represents words and contexts by latent trajectories in an embedding space. At each moment in time, the embedding vectors are inferred from a probabilistic version of word2vec [Mikolov et al., 2013]. These embedding vectors are connected in time through a latent diffusion process. We describe two scalable variational inference algorithms--skip-gram smoothing and skip-gram filtering--that allow us to train the model jointly over all times; thus learning on all data while simultaneously allowing word and context vectors to drift. Experimental results on three different corpora demonstrate that our dynamic model infers word embedding trajectories that are more interpretable and lead to higher predictive likelihoods than competing methods that are based on static models trained separately on time slices.

PUBLICATION RECORD

Publication year
2017
Venue
International Conference on Machine Learning
Publication date
2017-02-27
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1702.08359
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Dynamic Bernoulli Embeddings for Language Evolution
2017cited by this paper
Variational Inference: A Review for Statisticians
2016influential reference
Dynamic Collaborative Filtering With Compound Poisson Factorization
2016cited by this paper
Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change
2016influential reference
Bayesian Neural Word Embedding
2016cited by this paper
Semi-supervised Vocabulary-Informed Learning
2016cited by this paper
Visualizing Time-Dependent Data Using Dynamic t-SNE
2016influential reference
Exponential Family Embeddings
2016cited by this paper
Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks
2016cited by this paper
The Survival Filter: Joint Survival Analysis with a Latent Time Series
2015cited by this paper
Dynamic Poisson Factorization
2015cited by this paper
A Collaborative Kalman Filter for Time-Evolving Dyadic Processes
2014cited by this paper
Statistically Significant Detection of Linguistic Change
2014cited by this paper
Neural Word Embedding as Implicit Matrix Factorization
2014cited by this paper
Stochastic Backpropagation and Approximate Inference in Deep Generative Models
2014influential reference
Temporal Analysis of Language through Neural Language Models
2014influential reference
GloVe: Global Vectors for Word Representation
2014cited by this paper
Word Representations via Gaussian Embedding
2014cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
Parsing with Compositional Vector Grammars
2013cited by this paper
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
2013cited by this paper
Black Box Variational Inference
2013cited by this paper
Efficient Estimation of Word Representations in Vector Space
2013influential reference
Linguistic Regularities in Continuous Space Word Representations
2013influential reference
The inverse of banded matrices
2013cited by this paper
Learning word embeddings efficiently with noise-contrastive estimation
2013cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013influential reference
Auto-Encoding Variational Bayes
2013cited by this paper
A Hidden Markov Model for Collaborative Filtering
2012cited by this paper
Stochastic variational inference
2012influential reference
Word Epoch Disambiguation: Finding How Words Change Over Time
2012cited by this paper
Tracing semantic change with latent semantic analysis
2011cited by this paper
Quantitative Analysis of Culture Using Millions of Digitized Books
2010cited by this paper
Continuous Time Dynamic Topic Models
2008cited by this paper
A Neural Probabilistic Language Model
2003cited by this paper
Mirror descent and nonlinear projected subgradient methods for convex optimization
2003cited by this paper
A New Approach to Linear Filtering and Prediction Problems
2002cited by this paper
The Ordered Subsets Mirror Descent Optimization Method with Applications to Tomography
2001cited by this paper
An Introduction to Variational Methods for Graphical Models
1999cited by this paper
Welch & Bishop , An Introduction to the Kalman Filter 2 1 The Discrete Kalman Filter In 1960
1994cited by this paper
On the theory of brownian motion
1973cited by this paper

CITED BY

Semantic Substrate Theory: An Operator-Theoretic Framework for Geometric Semantic Drift
2026cites this paper
Improving Interpretability of Lexical Semantic Change with Neurobiological Features
2026cites this paper
The cell as a token: high-dimensional geometry in language models and cell embeddings
2025cites this paper
Chronotome: Real-Time Topic Modeling for Streaming Embedding Spaces
2025cites this paper
Posterior Sampling of Probabilistic Word Embeddings
2025influential citation
ConShift: Sense-based Language Variation Analysis using Flexible Alignment
2025cites this paper
An Efficient Classification Model for Cyber Text
2025cites this paper
Measuring Interdisciplinarity in Geology: A Semantic Analysis Approach
2025cites this paper
CATER: A Cluster-Based Alternative-Term Recommendation Framework for Large-Scale Web Search at NAVER
2025cites this paper
Koopman Spectral Dynamics for Interpretable and Predictive Semantic Drift in Language Models
2025cites this paper
Temporal-Aware Soft Prompt Tuning for Automatic Text Dating
2025cites this paper
GrASP: A Generalizable Address-based Semantic Prefetcher for Scalable Transactional and Analytical Workloads
2025cites this paper
Towards Retrieval-Augmented Large Language Models: Data Management and System Design
2025cites this paper
Transcriptome Transformer: improving patient survival prediction via multitask learning of transcriptomic and clinical features
2025cites this paper
Interpretable domain-informed and domain-agnostic features for supervised and unsupervised learning on building energy demand data
2024cites this paper
Discovering emergent connections in quantum physics research via dynamic word embeddings
2024cites this paper
Syntactic Language Change in English and German: Metrics, Parsers, and Convergences
2024cites this paper
Insider Threat Detection Based on Personalized User Modeling
2024cites this paper
TempoFormer: A Transformer for Temporally-aware Representations in Change Detection
2024cites this paper
Building Brownian Bridges to Learn Dynamic Author Representations from Texts
2024cites this paper
Towards a Complete Solution to Lexical Semantic Change: an Extension to Multiple Time Periods and Diachronic Word Sense Induction
2024cites this paper
RiskSEA : A Scalable Graph Embedding for Detecting On-chain Fraudulent Activities on the Ethereum Blockchain
2024cites this paper
A computational analysis of crosslinguistic regularity in semantic change
2023cites this paper
Neural Dynamic Focused Topic Model
2023cites this paper
Temporal Domain Adaptation for Historical Irish
2023cites this paper
Time is Encoded in the Weights of Finetuned Language Models
2023cites this paper
Sequential Path Signature Networks for Personalised Longitudinal Language Modeling
2023influential citation
Once Upon a Time in Graph: Relative-Time Pretraining for Complex Temporal Reasoning
2023cites this paper
A survey on narrative extraction from textual data
2023cites this paper
Object embedding using an information geometrical perspective
2023cites this paper
Sig-Networks Toolkit: Signature Networks for Longitudinal Language Modelling
2023cites this paper
Dynamic Bayesian Contrastive Predictive Coding Model for Personalized Product Search
2023cites this paper
Temporal word embedding with predictive capability
2023influential citation
The big data analysis in cultural psychology
2023cites this paper
Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
2023cites this paper
Literary Intertextual Semantic Change Detection: Application and Motivation for Evaluating Models on Small Corpora
2023cites this paper
adSformers: Personalization from Short-Term Sequences and Diversity of Representations in Etsy Ads
2023cites this paper
DynaMiTE: Discovering Explosive Topic Evolutions with User Guidance
2023cites this paper
Semantic Shift Detection in Vatican Publications: a Case Study from Leo XIII to Francis
2022cites this paper
Temporal Analysis on Topics Using Word2Vec
2022cites this paper
Dynamic Co-Embedding Model for Temporal Attributed Networks
2022cites this paper
Markovian Gaussian Process Variational Autoencoders
2022cites this paper
Graph-based Dynamic Word Embeddings
2022cites this paper
Letters From the Past: Modeling Historical Sound Change Through Diachronic Character Embeddings
2022cites this paper
Evaluating the timing and magnitude of semantic change in diachronic word embedding models
2022cites this paper
Entity Cloze By Date: What LMs Know About Unseen Entities
2022cites this paper
MEDT: Using Multimodal Encoding-Decoding Network as in Transformer for Multimodal Sentiment Analysis
2022cites this paper
Probabilistic Embeddings with Laplacian Graph Priors
2022cites this paper
Trust in Motion: Capturing Trust Ascendancy in Open-Source Projects using Hybrid AI
2022cites this paper
Dynamic Gaussian Embedding of Authors
2022cites this paper
HistBERT: A Pre-trained Language Model for Diachronic Lexical Semantic Analysis
2022cites this paper
Anomalous diffusion analysis of semantic evolution in major Indo-European languages
2022cites this paper
Detecting and Adapting to Irregular Distribution Shifts in Bayesian Online Learning: Supplementary Materials
2022cites this paper
“Vaderland”, “Volk” and “Natie”: Semantic Change Related to Nationalism in Dutch Literature Between 1700 and 1880 Captured with Dynamic Bernoulli Word Embeddings
2022cites this paper
The Changing Concepts of the Constitution
2022cites this paper
Understanding Entropy Coding With Asymmetric Numeral Systems (ANS): a Statistician's Perspective
2022cites this paper
Infinite SCAN: An Infinite Model of Diachronic Semantic Change
2022cites this paper
Metadata Might Make Language Models Better
2022cites this paper
Temporal Attention for Language Models
2022cites this paper
Extracting information and inferences from a large text corpus
2022cites this paper
Domain-Specific Word Embeddings with Structure Prediction
2022cites this paper
TemporalWord Embeddings for Early Detection of Signs of Depression
2022cites this paper
Temporal Knowledge Graph Completion with Approximated Gaussian Process Embedding
2022cites this paper
Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change
2022cites this paper
A Survey on Embedding Dynamic Graphs
2021cites this paper
A data-driven approach to studying changing vocabularies in historical newspaper collections
2021cites this paper
Adjusting Scope: A Computational Approach to Case-Driven Research on Semantic Change
2021cites this paper
Word2Fun: Modelling Words as Functions for Diachronic Word Representation
2021influential citation
Semantic Change Detection With Gaussian Word Embeddings
2021cites this paper
Time Masking for Temporal Language Models
2021cites this paper
Exploring Word Usage Change with Continuously Evolving Embeddings
2021cites this paper
Dissecting word embeddings and language models in natural language processing
2021cites this paper
Theoretical Foundations and Limits of Word Embeddings: What Types of Meaning can They Capture?
2021cites this paper
Modeling the Evolution of Word Senses with Force-Directed Layouts of Co-occurrence Networks
2021cites this paper
Sentiment analysis with covariate-assisted word embeddings
2021cites this paper
On the Universality of Deep Contextual Language Models
2021cites this paper
A Statistical Model of Word Rank Evolution
2021cites this paper
Transitivity of transformation matrices to bridge word vector spaces over 1000 years
2021cites this paper
Measuring diachronic sense change: New models and Monte Carlo methods for Bayesian inference
2021influential citation
SFE-GACN: A novel unknown attack detection under insufficient data via intra categories generation in embedding space
2021cites this paper
Study Concept Drift in 150-year English Literature
2021cites this paper
Time-Aware Language Models as Temporal Knowledge Bases
2021cites this paper
Zipfian regularities in "non-point" word representations
2021cites this paper
A combined syntactic-semantic embedding model based on lexicalized tree-adjoining grammar
2021cites this paper
Measure and Evaluation of Semantic Divergence across Two Languages
2021cites this paper
Probabilistic and Dynamic Molecule-Disease Interaction Modeling for Drug Discovery
2021cites this paper
Learning Dynamic Embeddings for Temporal Knowledge Graphs
2021cites this paper
Diffusion-based Temporal Word Embeddings
2021cites this paper
Variational Beam Search for Novelty Detection
2021influential citation
A diachronic evaluation of gender asymmetry in euphemism
2021cites this paper
The challenges of temporal alignment on Twitter during crises
2021cites this paper
Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks
2021influential citation
On Disentanglement in Gaussian Process Variational Autoencoders
2021cites this paper
Challenges for Computational Lexical Semantic Change
2021cites this paper
Using Deep Learning for Emotion Analysis of 18th and 19th Century German Plays
2021cites this paper
Historical changes in semantic weights of sub-word units
2021cites this paper
Robust Visualisation of Dynamic Text Collections: Measuring and Comparing Dimensionality Reduction Algorithms
2021cites this paper
Deep dynamic neural networks for temporal language modeling in author communities
2021cites this paper
Dynamic Language Models for Continuously Evolving Content
2021cites this paper
DCC-Uchile at SemEval-2020 Task 1: Temporal Referencing Word Embeddings
2020cites this paper