Learning to Resolve Natural Language Ambiguities: A Unified Approach

Published 1998 in AAAI/IAAI

ABSTRACT

We analyze a few of the commonly used statistics based and machine learning algorithms for natural language disambiguation tasks and observe that they can be recast as learning linear separators in the feature space. Each of the methods makes a priori assumptions which it employs, given the data, when searching for its hypothesis. Nevertheless, as we show, it searches a space that is as rich as the space of all linear separators. We use this to build an argument for a data driven approach which merely searches for a good linear separator in the feature space, without further assumptions on the domain or a specific problem.We present such an approach - a sparse network of linear separators, utilizing the Winnow learning algorithm - and show how to use it in a variety of ambiguity resolution problems. The learning approach presented is attribute-efficient and, therefore, appropriate for domains having very large number of attributes.In particular, we present an extensive experimental comparison of our approach with other methods on several well studied lexical disambiguation tasks such as context-sensitive spelling correction, prepositional phrase attachment and part of speech tagging. In all cases we show that our approach either outperforms other methods tried for these tasks or performs comparably to the best.

PUBLICATION RECORD

Publication year
1998
Venue
AAAI/IAAI
Publication date
1998-07-01
Fields of study
Linguistics, Computer Science
Identifiers
arXiv cs/9811010
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

The Nature of Statistical Learning Theory
2000cited by this paper
Part of Speech Tagging Using a Network of Linear Separators
1998influential reference
Learning To Parse?
1998cited by this paper
Projection Learning
1998cited by this paper
Statistical Language Learning
1997cited by this paper
Resolving PP attachment Ambiguities with Memory-Based Learning
1997cited by this paper
Automatic Rule Acquisition for Spelling Correction
1997cited by this paper
Exponentiated Gradient Versus Gradient Descent for Linear Predictors
1997cited by this paper
A Bayesian Hybrid Method for Context-sensitive Spelling Correction
1996cited by this paper
Handling Sparse Data by Successive Abstraction
1996cited by this paper
Combining Trigram-Based and Feature-Based Methods for Context-Sensitive Spelling Correction
1996cited by this paper
Learning to Parse Database Queries Using Inductive Logic Programming
1996cited by this paper
An Empirical Study of Smoothing Techniques for Language Modeling
1996cited by this paper
Robust Trainability of Single Neurons
1995cited by this paper
Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging
1995influential reference
Embedded machine learning systems for natural language processing: a general framework
1995cited by this paper
The Nature of Statistical Learning
1995influential reference
Statistical language learning
1995cited by this paper
Additive versus exponentiated gradient updates for linear prediction
1995cited by this paper
Prepositional Phrase Attachment through a Backed-off Model
1995cited by this paper
Unsupervised Word Sense Disambiguation Rivaling Supervised Methods
1995cited by this paper
Empirical Support for Winnow and Weighted-Majority Based Algorithms: Results on a Calendar Scheduling Domain
1995cited by this paper
DECISION LISTS FOR LEXICAL AMBIGUITY RESOLUTION: Application to Accent Restoration in Spanish and French
1994cited by this paper
A Rule-Based Approach to Prepositional Phrase Attachment Disambiguation
1994cited by this paper
A Maximum Entropy Model for Prepositional Phrase Attachment
1994cited by this paper
Circuits of the mind
1994cited by this paper
An Introduction to Computational Learning Theory
1994cited by this paper
A method for disambiguating word senses in a large corpus
1992cited by this paper
Robust trainability of single neurons
1992cited by this paper
Redundant noisy attributes, attribute errors, and linear-threshold learning using winnow
1991cited by this paper
Efficient distribution-free learning of probabilistic concepts
1990cited by this paper
Estimation of probabilities from sparse data for the language model component of a speech recognizer
1987cited by this paper
Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm
1987cited by this paper
A theory of the learnable
1984cited by this paper
A Part of Speech
1977cited by this paper
Pattern classification and scene analysis
1974cited by this paper
Pattern classification and scene analysis
1974cited by this paper

CITED BY

Multiple Face Detection Algorithm for Complex Background Images Based on Edge Structure
2024cites this paper
Noor-Ghateh: A Benchmark Dataset for Evaluating Arabic Word Segmenters in Hadith Domain
2023cites this paper
Benchmarking Scalable Predictive Uncertainty in Text Classification
2022cites this paper
Natural language processing for word sense disambiguation and information extraction
2020cites this paper
Versatile Multiple Choice Learning and Its Application to Vision Computing
2019cites this paper
Exploiting Cross-Lingual Representations For Natural Language Processing
2019cites this paper
Mapping Enterprise Governance of IT Models using Text Analyses
2019cites this paper
The spy saw a cop with a telescope: Who has the telescope?
2018influential citation
Consensus-based modeling using distributed feature construction with ILP
2018cites this paper
Conversational Bootstrapping and Other Tricks of a Concierge Robot
2017cites this paper
Adapting to Learner Errors with Minimal Supervision
2017influential citation
Statistical Unigram Analysis for Source Code Repository
2017cites this paper
Connectionist Symbol Processing : Dead or Alive ?
2017influential citation
Sequence Labeling using Conditional Random Fields
2017cites this paper
A Forward-Selection Algorithm for SVM-Based Question Classification in Cognitive Systems
2016cites this paper
Mobile Context-Aware Systems: Technologies, Resources and Applications
2016cites this paper
A systematic literature review on natural language processing in business process identification and modeling
2016cites this paper
Extracting Compact Sets of Features for Question Classification in Cognitive Systems: A Comparative Study
2015cites this paper
Syntactic Ambiguity in Legal, Political, Media, and Academic Registers of Thai: Patterns and Avoidance
2015cites this paper
Social and Semantic Contexts in Tourist Mobile Applications
2015cites this paper
Identifying the Correct Root of an Ambiguous Hebrew Word
2014cites this paper
Command Recognition through keyword analysis
2014cites this paper
When Errors Become the Rule
2014cites this paper
De l'étiquetage syntaxique pour les grammaires catégorielles de dépendances à l'analyse par transition dans le domaine de l'analyse en dépendances non-projective. (From syntactic tagging for categorial dependency grammars to transition-based parsing in the domain of non-projective dependency parsing
2014cites this paper
Towards a face recognition system : face detection, face registration, and head pose estimation
2014cites this paper
Morphological Processing of Semitic Languages
2014cites this paper
Context and NLP
2014cites this paper
Consensus-Based Modelling using Distributed Feature Construction
2014cites this paper
Propositionalization of Relational Learning : An Information Extraction Case Study a
2013cites this paper
The semantics of role labeling
2013cites this paper
Learning and Inference in Entity and Relation Identification
2012cites this paper
Automatic construction of a domain-independent knowledge base from heterogeneous data sources
2012cites this paper
UNIVERSITY OF ALGARVE FACULTY OF SOCIAL AND HUMAN SCIENCE THE UNIVERSITY OF WOLVERHAMPTON SCHOOL OF LAW, SOCIAL SCIENCES AND COMMUNICATIONS
2012cites this paper
Exploiting knowledge in NLP
2012cites this paper
Semantic Role Labeling for Portuguese - A Preliminary Approach -
2012cites this paper
Morphological disambiguation of Hebrew: a case study in classifier combination
2012influential citation
AUTOMATIC QUESTION GENERATION: A SYNTACTICAL APPROACH TO THE SENTENCE-TO-QUESTION GENERATION CASE
2012cites this paper
Robust recognition of facial expressions on noise degraded facial images
2011cites this paper
Enhanced Question Classification with Optimal Combination of Features
2011cites this paper
When errors become the rule: A survey of Transformation-Based Learning
2011cites this paper
Algorithm Selection and Model Adaptation for ESL Correction Tasks
2011cites this paper
A Survey of State-of-the-Art Methods on Question Classification
2011cites this paper
Tense Sense Disambiguation: A New Syntactic Polysemy Task
2010cites this paper
A Comparative Study of Verbal Discourse Practices in Traditional and Inquiry-Based Undergraduate Biology Labs for Non-Science Majors.
2010cites this paper
Generating Confusion Sets for Context-Sensitive Error Correction
2010influential citation
EFFECTIVE PASSAGE RETRIEVAL IN QUESTION ANSWERING SYSTEMS
2010cites this paper
Robust Dialog Management Through A Context-centric Architecture
2010cites this paper
Large-scale semi-supervised learning for natural language processing
2010influential citation
Mouth open or closed decision for frontal face images with given eye locations
2010cites this paper
Investigating Automatic Alignment Methods for Slide Generation from Academic Papers
2009cites this paper
Constraint-driven transliteration discovery
2009cites this paper
Chinese Function Tag Labeling
2009cites this paper
Context-Sensitive Spelling Correction and Rich Morphology
2009cites this paper
Evaluation de la Performance de la Classification d'un Système Question/Réponse
2009cites this paper
Semantic role labeling using lexicalized tree adjoining grammars
2009cites this paper
Automated creation of Wikipedia articles
2009cites this paper
Winning the KDD Cup Orange Challenge with Ensemble Selection
2009cites this paper
Web-Scale N-gram Models for Lexical Disambiguation
2009cites this paper
Appariement Robuste de Formes Visuelles Complexes, Application à la Détection d'Objets. (Robust matching of complex visual forms, Application to object detection)
2009influential citation
Object Class Recognition Using SNoW with a Part Vocabulary
2009cites this paper
Constraint Driven Transliteration Discovery 1
2009cites this paper
A Supervised Algorithm for Verb Disambiguation into VerbNet Classes
2008cites this paper
Identifying Semitic Roots: Machine Learning with Linguistic Constraints
2008influential citation
Transliteration as Constrained Optimization
2008cites this paper
The Importance of Syntactic Parsing and Inference in Semantic Role Labeling
2008cites this paper
A Cluster-Based Classification Approach to Semantic Role Labeling
2008cites this paper
Multiview Face Detection Using Gabor Filter and Support Vector Machines
2008influential citation
Active Sample Selection for Named Entity Transliteration
2008cites this paper
Global inference for sentence compression : an integer linear programming approach
2008cites this paper
Multilingual dependency parsing: A pipeline approach
2007cites this paper
Applying System Combination to Base Noun Phrase Identi cationErik
2007cites this paper
Face Detection Benchmark Database Data Set Location Description
2007cites this paper
1 Global Inference for Entity and Relation Identification via a Linear Programming Formulation
2007cites this paper
Question Classification in Question Answering Systems
2007cites this paper
Morphological Disambiguation of Hebrew
2007influential citation
The Extraction of Trajectories from Real Texts Based on Linear Classification
2007cites this paper
In Proceedings of NIPS ’ 01 Efficiency versus Convergence of Boolean Kernels for On-Line Learning Algorithms
2007cites this paper
Text Categorisation Using Do ument Pro ling
2007cites this paper
Learning to Identify Semitic Roots
2007cites this paper
Morphological Disambiguation of Hebrew: A Case Study in Classifier Combination
2007cites this paper
Audio-Visual Affect Recognition
2007influential citation
IR / 0 11 00 53 v 1 2 6 O ct 2 00 1 Machine Learning in Automated Text Categorization
2006cites this paper
" 01 # " ' $ 2 Laboratory in Natural Language Processing ( 203 . 4650 )
2006cites this paper
The Extraction of Spatial Relationships from Text Based on Hybrid Method
2006cites this paper
Named Entity Discovery in Multilingual Comparable Corpora
2006influential citation
Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
2006influential citation
Learning to Find Context Based Spelling Errors
2006cites this paper
Adaptive information extraction
2006cites this paper
The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data
2006cites this paper
Distributional Thesaurus vs. WordNet: A Comparison of Backoff Techniques for Unsupervised PP Attachment1
2006cites this paper
A Pipeline Model for Bottom-Up Dependency Parsing
2006cites this paper
Named Entity Transliteration and Discovery in Multilingual Corpora
2006cites this paper
A Pipeline Framework for Dependency Parsing
2006influential citation
Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
2006cites this paper
Knowledge representation and reasoning based on entity and relation propagation diagram/tree
2006cites this paper
Algorithms and Analysis for Multi-Category Classification
2006cites this paper
Effectiveness of Combined Features for Machine Learning Based Question Classification
2005cites this paper
Distributional Thesaurus Versus WordNet: A Comparison of Backoff Techniques for Unsupervised PP Attachment
2005cites this paper
Toward Concept-Based Text Understanding and Mining
2005cites this paper
Generalized Inference with Multiple Semantic Role Labeling Systems
2005influential citation