Semantic Similarity Estimation Using Vector Symbolic Architectures

Job Isaias Quiroz-Mercado,R. Barrón-Fernández,M. Ramirez-Salinas

Published 2020 in IEEE Access

ABSTRACT

For many natural language processing applications, estimating similarity and relatedness between words are key tasks that serve as the basis for classification and generalization. Currently, vector semantic models (VSM) have become a fundamental language modeling tool. VSMs represent words as points in a high-dimensional space and follow the distributional hypothesis of meaning, which assumes that semantic similarity is related to the context. In this paper, we propose a model whose representations are based on the semantic features associated with a concept within the ConceptNet knowledge graph. The proposed model is based on a vector symbolic architecture framework, which defines a set of arithmetic operations to encode the semantic features within a single high-dimensional vector. In addition to word distribution, these vector representations consider several types of information. Moreover, owing to the properties of high-dimensional spaces, they have the additional advantage of being interpretable. We analyze the model’s performance on the SimLex-999 dataset, a dataset where commonly used distributional models (e.g., word2vec or GloVe) perform poorly. Our results are similar to those of other hybrid models, and they surpass several state-of-the-art distributional and knowledge-based models.

PUBLICATION RECORD

Publication year
2020
Venue
IEEE Access
Publication date
2020-06-11
Fields of study
Computer Science
Identifiers
DOI 10.1109/ACCESS.2020.3001765
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

A comprehensive analysis of the parameters in the creation and comparison of feature vectors in distributional semantic models for multiple languages
2020cited by this paper
Exploring Storing Capacity of Hyperdimensional Binary Vectors
2019cited by this paper
Vector Symbolic Architectures and their applications: Computing with random vectors in a hyperdimensional space
2018cited by this paper
Classification and Recall With Binary Hyperdimensional Computing: Tradeoffs in Choice of Density and Mapping Characteristics
2018cited by this paper
Antonyms are similar: Towards paradigmatic association approach to rating similarity in SimLex-999 and WordSim-353
2018cited by this paper
Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints
2017cited by this paper
Sequence Prediction with Hyperdimensional Computing
2017cited by this paper
AutoExtend: Combining Word Embeddings with Semantic Resources
2017cited by this paper
Concepts as Semantic Pointers: A Framework and Computational Model
2016cited by this paper
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge
2016cited by this paper
Enhancing the LexVec Distributed Word Representation Model Using Positional Contexts and External Memory
2016cited by this paper
Learning Word Meta-Embeddings
2016cited by this paper
Charagram: Embedding Words and Sentences via Character n-grams
2016cited by this paper
Measuring Semantic Similarity of Words Using Concept Networks
2016cited by this paper
Lemon and Tea Are Not Similar: Measuring Word-to-Word Similarity by Combining Different Methods
2015cited by this paper
Non-distributional Word Vector Representations
2015cited by this paper
SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation
2014influential reference
Computing with 10,000-bit words
2014cited by this paper
An Introduction to Language
2014cited by this paper
How to build a brain
2014cited by this paper
GloVe: Global Vectors for Word Representation
2014cited by this paper
Modular Composite Representation
2014cited by this paper
Analogical Mapping with Sparse Distributed Memory: A Simple Model that Learns to Generalize from Examples
2014cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
An Introduction to Information Retrieval
2013cited by this paper
Efficient Estimation of Word Representations in Vector Space
2013cited by this paper
Multimodal Distributional Semantics
2013cited by this paper
Representing Objects, Relations, and Sequences
2013cited by this paper
The Centre for Speech, Language and the Brain (CSLB) concept property norms
2013cited by this paper
Semantic and associative relations in adolescents and young adults: Examining a tenuous dichotomy.
2012cited by this paper
Using Information Content to Evaluate Semantic Similarity on HowNet
2012cited by this paper
Learning Word Vectors for Sentiment Analysis
2011cited by this paper
Strudel: A Corpus-Based Semantic Model Based on Properties and Types
2010cited by this paper
Strudel: A distributional semantic model based on properties and types
2010cited by this paper
From Frequency to Meaning: Vector Space Models of Semantics
2010cited by this paper
Hyperdimensional Computing: An Introduction to Computing in Distributed Representation with High-Dimensional Random Vectors
2009influential reference
Wikipedia-based Semantic Interpretation for Natural Language Processing
2009cited by this paper
Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases
2009cited by this paper
A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches
2009influential reference
The Distributional Hypothesis
2008cited by this paper
An effective, low-cost measure of semantic relatedness obtained from Wikipedia links
2008cited by this paper
Information-theoretic and Set-theoretic Similarity
2006cited by this paper
Information Retrieval by Semantic Similarity
2006cited by this paper
Similarity of Semantic Relations
2006cited by this paper
Evaluating WordNet-based Measures of Lexical Semantic Relatedness
2006cited by this paper
Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis
2005cited by this paper
Semantic feature production norms for a large set of living and nonliving things
2005cited by this paper
WordNet::Similarity - Measuring the Relatedness of Concepts
2004cited by this paper
Extended Gloss Overlaps as a Measure of Semantic Relatedness
2003cited by this paper
Placing search in context: the concept revisited
2002cited by this paper
Improvements in Automatic Thesaurus Extraction
2002cited by this paper
Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language
1999cited by this paper
Combining local context and wordnet similarity for word sense identification
1998cited by this paper
An Information-Theoretic Definition of Similarity
1998cited by this paper
Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy
1997cited by this paper
Binary Spatter-Coding of Ordered K-Tuples
1996cited by this paper
Using Information Content to Evaluate Semantic Similarity in a Taxonomy
1995cited by this paper
Semantic and Associative Priming in a Distributed Attractor Network
1995cited by this paper
Verb Semantics and Lexical Selection
1994cited by this paper
Contextual correlates of semantic similarity
1991cited by this paper
Sparse Distributed Memory
1988cited by this paper
Features of Similarity
1977influential reference
Contextual correlates of synonymy
1965influential reference
The analysis of proximities: Multidimensional scaling with an unknown distance function. I.
1962cited by this paper
The analysis of proximities: Multidimensional scaling with an unknown distance function. II
1962cited by this paper
Distributional Structure
1954cited by this paper

CITED BY

A novel Vector-Symbolic Architecture for graph encoding and its application to viral pangenome-based species classification
2025cites this paper
Hyperdimensional Quantum Factorization
2024cites this paper
The blessing of dimensionality
2024cites this paper
HDQMF: Holographic Feature Decomposition using Quantum Algorithms
2024cites this paper
Capacity Analysis of Vector Symbolic Architectures
2023influential citation
hdlib: A Python library for designing Vector-Symbolic Architectures
2023cites this paper
Symbolic Hyperdimensional Vectors with Sparse Graph Convolutional Neural Networks
2022cites this paper
A Survey on Hyperdimensional Computing aka Vector Symbolic Architectures, Part II: Applications, Cognitive Models, and Challenges
2021cites this paper