A Corpus-Based Approach for Building Semantic Lexicons

Published 1997 in Conference on Empirical Methods in Natural Language Processing

ABSTRACT

Semantic knowledge can be a great asset to natural language processing systems, but it is usually hand-coded for each application. Although some semantic information is available in general-purpose knowledge bases such as WordNet and Cyc, many applications require domain-specific lexicons that represent words and categories for a particular topic. In this paper, we present a corpus-based method that can be used to build semantic lexicons for specific categories. The input to the system is a small set of seed words for a category and a representative text corpus. The output is a ranked list of words that are associated with the category. A user then reviews the top-ranked words and decides which ones should be entered in the semantic lexicon. In experiments with five categories, users typically found about 60 words per category in 10-15 minutes to build a core semantic lexicon.

PUBLICATION RECORD

Publication year
1997
Venue
Conference on Empirical Methods in Natural Language Processing
Publication date
1997-06-10
Fields of study
Computer Science
Identifiers
arXiv cmp-lg/9706013
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

The Ups and Downs of Lexical Acquisition
1994cited by this paper
A Case-Based Approach to Knowledge Acquisition for Domain-Specific Sentence Analysis
1993cited by this paper
Coping with Ambiguity and Unknown Words through Probabilistic Models
1993cited by this paper
University of Massachusetts: Description of the CIRCUS System as Used for MUC-4
1992cited by this paper
Word-Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora
1992cited by this paper
Introduction to WordNet: An On-line Lexical Database
1990cited by this paper
Acquiring Lexical Knowledge from Text: A Case Study
1988cited by this paper
A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text
1988cited by this paper
CYC: Using Common Sense Knowledge to Overcome Brittleness and Knowledge Acquisition Bottlenecks
1986cited by this paper
Learning Word Meanings From Examples
1983cited by this paper
Towards a Self-Extending Parser
1979cited by this paper
FOUL-UP: A Program that Figures Out Meanings of Words from Context
1977cited by this paper

CITED BY

Towards an Ontology of Human Explanations of Robotic Behavior
2025cites this paper
Assessment of terminological density in scientific publications on physical culture
2025cites this paper
Short text sentiment analysis combining sentiment lexicon and graph convolutional networks
2024cites this paper
Beyond Model Performance: Can Link Prediction Enrich French Lexical Graphs?
2024cites this paper
for Sentiment, Affect, and Connotation
2023cites this paper
Automatic development of requirement linking matrix based on semantic similarity for robust software development
2022cites this paper
Analysis of public sentiment tendency in COVID-19 period based on GA-BiLSTM
2022cites this paper
Aspect-Level Semantic and Syntactic Reinforcement for Aspect-Based Sentiment Analysis
2022cites this paper
Unsupervised Grammatical Pattern Discovery from Arabic Extra Large Corpora
2021cites this paper
Multi-Media Content Clustering and Computer Intelligent Analysis by Text Mining
2021cites this paper
A Brief Survey of Sentiment Analysis
2020cites this paper
Asymmetric Attributional Word Similarity Measures to Detect the Relations of Textual Generality
2020cites this paper
End-to-End Bootstrapping Neural Network for Entity Set Expansion
2020cites this paper
Global Bootstrapping Neural Network for Entity Set Expansion
2020cites this paper
Lexifield: a system for the automatic building of lexicons by semantic expansion of short word lists
2020cites this paper
Building an Arabic Semantic Lexicon for Hajj
2019cites this paper
Chinese Micro-Blog Sentiment Analysis Based on Multiple Sentiment Dictionaries and Semantic Rule Sets
2019cites this paper
LexScore: A Semantic Approach to Scoring Domain Specific Sentiment Lexicons
2019cites this paper
Data-Efficient Image Recognition with Contrastive Predictive Coding
2019cites this paper
Exploiting Cross-Lingual Representations For Natural Language Processing
2019cites this paper
Tibetan Sentiment Classification Method Based on Semi-Supervised Recursive Autoencoders
2019cites this paper
Exploring Crosslingual Word Embeddings for Semantic Classification in Text and Dialogue
2019cites this paper
Unsupervised Approaches for Textual Semantic Annotation, A Survey
2019cites this paper
Tree-Based Sentiment Dictionary for Affective Computing: A New Approach
2018cites this paper
Applying Distributional Compositional Categorical Models of Meaning to Language Translation
2018cites this paper
The impact of summarisation on textual entailment - a case study on global warming arguments
2018cites this paper
A Retrospective on Mutual Bootstrapping
2018cites this paper
A Study on the Convergence and Analysis of Public Opinion in Cross-linguistic Network on Mongolian and Chinese
2017cites this paper
A Socio-mathematical and Structure-Based Approach to Model Sentiment Dynamics in Event-Based Text
2017cites this paper
Using convolution control block for Chinese sentiment analysis
2017cites this paper
A domain-specific sentiment lexicon construction method for stock index directionality
2017cites this paper
Improving Information Extraction from Clinical Notes with Multiple Domain Models and Clustering-Based Instance Selection
2017cites this paper
Unsupervised Concept Categorization and Extraction from Scientific Document Titles
2017cites this paper
Unsupervised Extraction of Representative Concepts from Scientific Literature
2017cites this paper
Identifying genotype-phenotype relationships in biomedical text
2017cites this paper
Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora
2016cites this paper
Acquiring Knowledge for Affective State Recognition in Social Media
2016cites this paper
Symbiotic Cognitive Computing through Iteratively Supervised Lexicon Induction
2016cites this paper
A method for the development of disease-specific reference standards vocabularies from textual biomedical literature resources
2016cites this paper
Bootstrapping a Semantic Lexicon on Verb Similarities
2016influential citation
A corpus-based lexicon building in Indonesian political context through Indonesian online news media
2016influential citation
Combining Pattern-Based and Distributional Similarity for Graph-Based Noun Categorization
2015cites this paper
Reflections on Sentiment/Opinion Analysis
2015cites this paper
The potential relationship discovery model based on result fusion for biomedical medicine research
2015cites this paper
Microblog Orientation Analysis Based on Connnectives
2015cites this paper
A computational model for task-adapted knowledge organisation: improving learning through concept maps extracted from lecture slides.
2015cites this paper
Tracking Emotions in Hot Topics: Exploiting Event-specific Emotional Words
2015cites this paper
Learning to Mine Chinese Coordinate Terms Using the Web
2015cites this paper
Recognition of Patient-Related Named Entities in Noisy Tele-Health Texts
2015cites this paper
Semantic Lexicon Induction from Twitter with Pattern Relatedness and Flexible Term Length
2015cites this paper
Opinion Holder and Target Extraction based on the Induction of Verbal Categories
2015cites this paper
Sentiment Analysis of Data from Online Forums on the Newborn Genome Sequencing
2015cites this paper
A Study on Sentiment Computing and Classification of Sina Weibo with Word2vec
2014cites this paper
Minimally Supervised Classification to Semantic Categories using Automatically Acquired Symmetric Patterns
2014cites this paper
Extracting Aspects and Polarity from Patents
2014cites this paper
Simple, Fast and Accurate Taxonomy Learning
2014cites this paper
An Approach for Sentiment Tendency Analysis on Comment Text
2014cites this paper
Unsupervised Aspect Discovery from Online Consumer Reviews
2014cites this paper
Structured Learning for Taxonomy Induction with Belief Propagation
2014cites this paper
Constructing an Arabic Opinion Mining Model: With Special Reference to Telecommunication Companies and Hotel Reviews
2014cites this paper
Automatic Food Categorization from Large Unlabeled Corpora and Its Impact on Relation Extraction
2014cites this paper
SADAATL 2014 COLING Workshop on Synchronic and Diachronic Approaches to Analyzing Technical Language Proceedings of the Workshop
2014cites this paper
Automated Arabic Antonym Extraction Using a Corpus Analysis Tool
2014cites this paper
A Study on Sentiment Computing and Classification of Sina Weibo with Word2vec
2014cites this paper
A new method for updating word senses in Hindi WordNet
2014cites this paper
Separating Brands from Types: an Investigation of Different Features for the Food Domain
2014cites this paper
A method for automatic extraction of multiword units representing business aspects from user reviews
2014cites this paper
A Hybrid Method of Sentiment Key Sentence Identification Using Lexical Semantics and Syntactic Dependencies
2014cites this paper
Bootstrapping Semantic Lexicons for Technical Domains
2013cites this paper
Asymmetric Distributional Similarity Measures to Recognize Textual Entailment by Generality. (Mesures de similarité distributionnelle asymétrique pour la détection de l'implication textuelle par généralité)
2013cites this paper
Onto.PT: Towards the Automatic Construction of a Lexical Ontology for Portuguese
2013cites this paper
Discovery of noun semantic relations based on sentential context analysis
2013cites this paper
A Rule Based Answer Extraction System with Stemming & Anaphora Resolution
2013cites this paper
Surface Web Semantics for Structured Natural Language Processing
2013cites this paper
A Combined Pattern-based and Distributional Approach for Automatic Hypernym Detection in Dutch.
2013cites this paper
Event representation across genre
2013cites this paper
Domain adaptive extraction of topical hierarchies for Expertise Mining
2013cites this paper
Concept-based analysis of scientific literature
2013cites this paper
Patient information extraction in noisy tele-health texts
2013cites this paper
Tailoring the automated construction of large-scale taxonomies using the web
2013cites this paper
Workshop on Events: Definition, Detection, Coreference, and Representation, EVENTS@NAACL-HLT 2013, Atlanta, Georgia, USA, June 14, 2013
2013cites this paper
Automatic construction of lexicons, taxonomies, ontologies, and other knowledge structures
2013cites this paper
Une approche linguistique de l'évaluation des ressources extraites par analyse distributionnelle automatique
2013cites this paper
Inducing Context Gazetteers from Encyclopedic Databases for Named Entity Recognition
2013cites this paper
Computational modeling of lexical ambiguity
2012cites this paper
A semi-supervised approach to extracting multiword entity names from user reviews
2012influential citation
Semi-Supervised Technical Term Tagging With Minimal User Feedback
2012influential citation
Bootstrapping via Graph Propagation
2012cites this paper
A Semi-Supervised Approach to the Construction of Semantic Lexicons
2012influential citation
Ontology Enrichment from Free-text Clinical Documents:A Comparison of Alternative Approaches
2012cites this paper
Combining Syntax & Ontologies for Information Extraction
2012cites this paper
PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO GRANDE DO SUL FACULDADE DE INFORMÁTICA PROGRAMA DE PÓS-GRADUAÇÃO EM CIÊNCIA DA COMPUTAÇÃO MINERAÇÃO DE OPINIÕES APLICADA A MÍDIAS SOCIAIS
2012cites this paper
Ensemble-based Semantic Lexicon Induction for Semantic Tagging
2012cites this paper
Personalized concept hierarchy construction
2012cites this paper
Corpus-Driven Hyponym Acquisition for Turkish Language
2012cites this paper
Onto.PT: Towards the Automatic Construction of a Lexical Ontology for Portuguese
2012cites this paper
Extracting Semantic Lexicons from Discharge Summaries using Machine Learning and the C-Value Method
2012cites this paper
Insights from Network Structure for Text Mining
2011cites this paper
Butcher, baker, or candlestick maker? Predicting occupations using predicate-argument relations
2011influential citation
Automatic Extraction and Validation of Lexical Ontologies from Text
2011cites this paper