Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature

Published 2011 in Journal of Biomedical Semantics

ABSTRACT

BackgroundThis paper presents a novel approach to the problem of hedge detection, which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task 1 of the CoNLL-2010 Shared Task, which focuses on hedging in the biomedical domain. We here propose to view hedge detection as a simple disambiguation problem, restricted to words that have previously been observed as hedge cues. As the feature space for the classifier is still very large, we also perform experiments with dimensionality reduction using the method of random indexing.ResultsThe SVM-based classifiers developed in this paper achieves the best published results so far for sentence-level uncertainty prediction on the CoNLL-2010 Shared Task test data. We also show that the technique of random indexing can be successfully applied for reducing the dimensionality of the original feature space by several orders of magnitude, without sacrificing classifier performance.ConclusionsThis paper introduces a simplified approach to detecting speculation or uncertainty in text, focusing on the biomedical domain. Evaluated at the sentence-level, our SVM-based classifiers achieve the best published results so far. We also show that the feature space can be aggressively compressed using random indexing while still maintaining comparable classifier performance.

PUBLICATION RECORD

Publication year
2011
Venue
Journal of Biomedical Semantics
Publication date
2011-10-06
Fields of study
Medicine, Computer Science
Identifiers
DOI 10.1186/2041-1480-2-S5-S7 PMID 22166306 PMCID 3239307
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

A Cascade Method for Detecting Hedges and their Scope in Natural Language Text
2010influential reference
Bayesian Learning in Sparse Graphical Factor Models via Variational Mean-Field Annealing
2010cited by this paper
Proceedings of the 32nd Annual Conference of the Cognitive Science Society
2010cited by this paper
Resolving Speculation: MaxEnt Cue Classification and Dependency-Based Scope Rules
2010influential reference
The CoNLL-2010 Shared Task: Learning to Detect Hedges and their Scope in Natural Language Text
2010influential reference
Detecting Speculative Language Using Syntactic Dependencies and Logistic Regression
2010cited by this paper
Syntactic Scope Resolution in Uncertainty Analysis
2010cited by this paper
Cross-framework parser stacking for data-driven dependency parsing
2009cited by this paper
Proceedings of the BioNLP 2009 Workshop
2009cited by this paper
Learning the Scope of Hedge Cues in Biomedical Texts
2009cited by this paper
The BioScope corpus: annotation for negation, uncertainty and their scope in biomedical texts
2008cited by this paper
MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
2006cited by this paper
Very sparse random projections
2006cited by this paper
KDD-2006 : proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 20-23, 2006, Philadelphia, PA, USA
2006cited by this paper
Unsupervised Learning of Multiple Aspects of Moving Objects from Video
2005cited by this paper
An Introduction to Random Indexing
2005cited by this paper
Automatic bilingual lexicon acquisition using random indexing of parallel corpora
2005cited by this paper
Developing a Robust Part-of-Speech Tagger for Biomedical Text
2005cited by this paper
An Introduction to Variable and Feature Selection
2003cited by this paper
Proceedings of the sixth conference on Applied natural language processing
2000cited by this paper
On building a more effcient grammar by exploiting types
2000cited by this paper
The Nature of Statistical Learning Theory
2000cited by this paper
Random indexing of text samples for latent semantic analysis
2000cited by this paper
Proceedings of the 22nd Annual Conference of the Cognitive Science Society
2000cited by this paper
TnT – A Statistical Part-of-Speech Tagger
2000cited by this paper
Advances in kernel methods: support vector learning
1999cited by this paper
Making large scale SVM learning practical
1998cited by this paper
The Nature of Statistical Learning
1995cited by this paper
Computational Intelligence: Imitating Life
1994cited by this paper
Building a Large Annotated Corpus of English: The Penn Treebank
1993cited by this paper
Extensions of Lipschitz mappings into Hilbert space
1984cited by this paper

CITED BY

An uncertainty and conviction-aware attention model for automatically estimating reviewer confidence from peer review texts
2026cites this paper
"You might think about slightly revising the title”: Identifying Hedges in Peer-tutoring Interactions
2023cites this paper
Creating an Ignorance-Base: Exploring Known Unknowns in the Scientific Literature
2023cites this paper
HedgePeer: A Dataset for Uncertainty Detection in Peer Reviews
2022cites this paper
Resolving the Scope of Speculation and Negation using Transformer-Based Architectures
2020cites this paper
Multitask Learning of Negation and Speculation using Transformers
2020cites this paper
Using Structured Representation and Data: A Hybrid Model for Negation and Sentiment in Customer Service Conversations
2019cites this paper
Crowdsourced Hedge Term Disambiguation
2019cites this paper
Negation and Speculation Detection
2019cites this paper
Sentiment and Stance Visualization of Textual Data for Social Media
2019cites this paper
Using Hedge Detection to Improve Committed Belief Tagging
2018cites this paper
Understanding the Semantics of Narratives of Interpersonal Violence through Reader Annotations and Physiological Reactions
2017cites this paper
An open-source tool for negation detection: a maximum-margin approach
2017cites this paper
Random indexing of multidimensional data
2016cites this paper
Speculation detection for Chinese clinical notes: Impacts of word segmentation and embedding models
2016cites this paper
A portable toolkit for detecting negation
2016cites this paper
Visual analysis of online social media to open up the investigation of stance phenomena
2015cites this paper
Detecting Semantic Uncertainty by Learning Hedge Cues in Sentences Using an HMM
2014cites this paper
Ubiquitous Cognitive Computing: A Vector Symbolic Approach
2014cites this paper
ContextD: an algorithm to identify contextual properties of medical terms in a Dutch clinical corpus
2014cites this paper
UiO1: Constituent-Based Discriminative Ranking for Negation Resolution
2012cites this paper
Hedging their Mets: The Use of Uncertainty Terms in Clinical Documents and its Potential Implications when Sharing the Documents with Patients
2012cites this paper
Modality and Negation: An Introduction to the Special Issue
2012cites this paper
UiO 2: Sequence-labeling Negation Using Dependency Features
2012cites this paper
Speculation and Negation: Rules, Rankers, and the Role of Syntax
2012influential citation
Resolving Speculation and Negation Scope in Biomedical Articles with a Syntactic Constituent Ranker
2011cites this paper
Towards mature use of semantic resources for biomedical analyses
2011cites this paper