Revisiting Embedding Features for Simple Semi-supervised Learning

Jiang Guo,Wanxiang Che,Haifeng Wang,Ting Liu

Published 2014 in Conference on Empirical Methods in Natural Language Processing

ABSTRACT

Recent work has shown success in using continuous word embeddings learned from unlabeled data as features to improve supervised NLP systems, which is regarded as a simple semi-supervised learning mechanism. However, fundamental problems on effectively incorporating the word embedding features within the framework of linear models remain. In this study, we investigate and analyze three different approaches, including a new proposed distributional prototype approach, for utilizing the embedding features. The presented approaches can be integrated into most of the classical linear models in NLP. Experiments on the task of named entity recognition show that each of the proposed approaches can better utilize the word embedding features, among which the distributional prototype approach performs the best. Moreover, the combination of the approaches provides additive improvements, outperforming the dense and continuous embedding features by nearly 2 points of F1 score.

PUBLICATION RECORD

Publication year
2014
Venue
Conference on Empirical Methods in Natural Language Processing
Publication date
2014-10-01
Fields of study
Computer Science
Identifiers
DOI 10.3115/v1/D14-1012
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method
2014cited by this paper
Learning Sense-specific Word Embeddings By Exploiting Bilingual Resources
2014cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters
2013cited by this paper
Effect of Non-linear Deep Architecture in Sequence Labeling
2013cited by this paper
Generalization of Words for Chinese Dependency Parsing
2013cited by this paper
Compound Embedding Features for Semi-supervised Learning
2013influential reference
Efficient Estimation of Word Representations in Vector Space
2013cited by this paper
Statistical Language Models Based on Neural Networks
2012cited by this paper
Representation Learning: A Review and New Perspectives
2012cited by this paper
Improving Word Representations via Global Context and Multiple Word Prototypes
2012cited by this paper
Natural Language Processing (Almost) from Scratch
2011influential reference
Combined regression and ranking
2010cited by this paper
Word Representations: A Simple and General Method for Semi-Supervised Learning
2010influential reference
Distributional Representations for Handling Sparsity in Supervised Sequence-Labeling
2009cited by this paper
Normalized (pointwise) mutual information in collocation extraction
2009influential reference
Design Challenges and Misconceptions in Named Entity Recognition
2009cited by this paper
Simple Semi-supervised Dependency Parsing
2008influential reference
LIBLINEAR: A Library for Large Linear Classification
2008cited by this paper
A Scalable Hierarchical Distributed Language Model
2008cited by this paper
An Effective Two-Stage Model for Exploiting Non-Local Dependencies in Named Entity Recognition
2006influential reference
Prototype-Driven Learning for Sequence Models
2006cited by this paper
Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling
2005influential reference
Semi-Supervised Learning for Natural Language
2005influential reference
A High-Performance Semi-Supervised Learning Method for Text Chunking
2005cited by this paper
Name Tagging with Word Clusters and Discriminative Training
2004cited by this paper
Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition
2003cited by this paper
A Neural Probabilistic Language Model
2003cited by this paper
A neural probabilistic language model
2003cited by this paper
Class-Based n-gram Models of Natural Language
1992cited by this paper

CITED BY

Knowledge Graph for Chinese Shipbuilding Standards: Construction Methodology and Practical Application
2025cites this paper
Application of machine and deep learning techniques on sport non-fungible token tweets: exploration of perceived values and risks
2025cites this paper
Interpretable Neural Embeddings with Sparse Self-Representation
2023cites this paper
Automated Travel History Extraction From Clinical Notes for Informing the Detection of Emergent Infectious Disease Events: Algorithm Development and Validation
2021cites this paper
Generation of Cross-Lingual Word Vectors for Low-Resourced Languages Using Deep Learning and Topological Metrics in a Data-Efficient Way
2021cites this paper
Training Cross-Lingual embeddings for Setswana and Sepedi
2021cites this paper
Bayesian estimation-based sentiment word embedding model for sentiment analysis
2021cites this paper
The Early Modern Dutch Mediascape. Detecting Media Mentions in Chronicles Using Word Embeddings and CRF
2021cites this paper
SEMIE: SEMantically Infused Embeddings with Enhanced Interpretability for Domain-specific Small Corpus
2021cites this paper
A constrained optimization algorithm for learning GloVe embeddings with semantic lexicons
2020cites this paper
BUCC2020: Bilingual Dictionary Induction using Cross-lingual Embedding
2020cites this paper
Evaluating Sparse Interpretable Word Embeddings for Biomedical Domain
2020cites this paper
Tibetan-Chinese cross-lingual word embeddings based on MUSE
2020cites this paper
Contextualized French Language Models for Biomedical Named Entity Recognition
2020cites this paper
When BERT meets Bilbo: a learning curve analysis of pretrained language model on disease classification
2020cites this paper
Incorporating Lexicon for Named Entity Recognition of Traditional Chinese Medicine Books
2020cites this paper
Named Entity Recognition in Chemical Patents using Ensemble of Contextual Language Models
2020cites this paper
Detecting Adverse Drug Events with Rapidly Trained Classification Models
2019cites this paper
Transformation of Dense and Sparse Text Representations
2019cites this paper
Recognizing software names in biomedical literature using machine learning
2019cites this paper
Research and Applications 2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records
2019cites this paper
A Word Similarity Feature-based Semi-supervised Approach for Named Entity Recognition
2019cites this paper
Selecting a text similarity measure for a content-based recommender system
2019cites this paper
Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings
2019cites this paper
Visualising and evaluating the effects of combining active learning with word embedding features
2019cites this paper
Learning Tibetan-Chinese Cross-Lingual Word Embeddings
2019cites this paper
Improving word embeddings projection for Turkish hypernym extraction
2019cites this paper
448. Can Electronic Clinical Notes Identify Travelers with Zika?
2018cites this paper
Towards Effective Extraction and Linking of Software Mentions from User-Generated Support Tickets
2018cites this paper
Exploiting Syntactic and Semantic Kernels for Target-Polarity Word Collocation Extraction
2018cites this paper
xSense: Learning Sense-Separated Sparse Representations and Textual Definitions for Explainable Word Sense Networks
2018cites this paper
Arabic Named Entity Recognition Using Topic Modeling
2018cites this paper
Concept-Based Embeddings for Natural Language Processing
2018cites this paper
A CRF-Based Stacking Model with Meta-features for Named Entity Recognition
2018cites this paper
What’s in Your Embedding, And How It Predicts Task Performance
2018cites this paper
Leveraging external information in topic modelling
2018cites this paper
Arabic Named Entity Recognition using Word Representations
2018influential citation
Comprehensive study on Opinion Mining for User Reviews for Mobile App Comparisons
2018cites this paper
APIReal: an API recognition and linking approach for online developer forums
2018influential citation
Design and Empirical Evaluation of Interactive and Interpretable Machine Learning
2018cites this paper
A COMPARATIVE STUDY OF WORD REPRESENTATION METHODS WITH CONDITIONAL RANDOM FIELDS AND MAXIMUM ENTROPY MARKOV FOR BIO-NAMED ENTITY RECOGNITION
2018influential citation
Word embeddings for negation detection in health records written in Spanish
2018cites this paper
Resurgence of Deep Learning: Genesis of Word Embedding
2018cites this paper
Using Word Embeddings with Linear Models for Short Text Classification
2018cites this paper
Knowledgebase construction of genetic variants in literature
2018cites this paper
Learning Word Embeddings for Low-Resource Languages by PU Learning
2018cites this paper
Learning Turkish Hypernymy Using Word Embeddings
2018cites this paper
Hybrid system for adverse drug event detection
2018cites this paper
Cross-lingual Name Tagging and Linking for 282 Languages
2017cites this paper
Interweaving Domain Knowledge and Unsupervised Learning for Psychiatric Stressor Extraction from Clinical Notes
2017cites this paper
Fine-Grained Opinion Mining from Mobile App Reviews with Word Embedding Features
2017cites this paper
MetaLDA: A Topic Model that Efficiently Incorporates Meta Information
2017cites this paper
Synapse at CAp 2017 NER challenge: Fasttext CRF
2017cites this paper
Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context
2017influential citation
What is the Essence of a Claim? Cross-Domain Claim Identification
2017cites this paper
Spoken Language Understanding for a Nutrition Dialogue System
2017cites this paper
Named Entity Recognition with Word Embeddings and Wikipedia Categories for a Low-Resource Language
2017cites this paper
SPINE: SParse Interpretable Neural Embeddings
2017cites this paper
Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints
2017cites this paper
On the Effectiveness of Feature Set Augmentation Using Clusters of Word Embeddings
2017cites this paper
Psychiatric symptom recognition without labeled data using distributional representations of phrases and on-line knowledge.
2017cites this paper
Clinical Event Detection with Hybrid Neural Architecture
2017cites this paper
Word Embeddings as Features for Supervised Coreference Resolution
2017cites this paper
A Hierarchical Book Representation of Word Embeddings for Effective Semantic Clustering and Search
2017cites this paper
A hybrid approach to automatic de-identification of psychiatric notes.
2017cites this paper
A Hierarchical Playscript Representation of Distributed Words for Effective Semantic Clustering and Search
2017cites this paper
PROTOTYPICAL FOR BIOMEDICAL NAMED ENTITY RECOGNITION
2017influential citation
Extraction of Clinical Timeline from Discharge Summaries using Neural Networks
2017cites this paper
VerbNet/OntoNotes-Based Sense Annotation
2017cites this paper
Extracting References from Political Speech Auto-Transcripts
2017cites this paper
Agents and Artificial Intelligence
2017influential citation
Semantic Specialization of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints
2017cites this paper
Intrinsic Evaluations of Word Embeddings: What Can We Do Better?
2016cites this paper
Background and Related Work
2016cites this paper
The 54th Annual Meeting of the Association for Computational Linguistics
2016cites this paper
Assessing the Corpus Size vs. Similarity Trade-off for Word Embeddings in Clinical NLP
2016cites this paper
Exploring Segment Representations for Neural Segmentation Models
2016cites this paper
Learning Compact Neural Word Embeddings by Parameter Space Sharing
2016cites this paper
Word Embeddings with Limited Memory
2016influential citation
A Distributed Representation-Based Framework for Cross-Lingual Transfer Parsing
2016cites this paper
Feature-enriched word embeddings for named entity recognition in open-domain conversations
2016influential citation
Learning to Extract API Mentions from Informal Natural Language Discussions
2016influential citation
Distributional semantics for understanding spoken meal descriptions
2016cites this paper
Correlation-based Intrinsic Evaluation of Word Vector Representations
2016cites this paper
Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning
2016cites this paper
Label Embedding for Zero-shot Fine-grained Named Entity Typing
2016cites this paper
Argumentation Mining in User-Generated Web Discourse
2016cites this paper
Spanish NER with Word Representations and Conditional Random Fields
2016influential citation
Automatic generation of tunable analogy benchmarks for word representations
2016cites this paper
EVALUATING DISTRIBUTED WORD REPRESENTATIONS FOR PREDICTING MISSING WORDS IN SENTENCES
2016cites this paper
Exploring Unsupervised Features in Conditional Random Fields for Spanish Named Entity Recognition
2016influential citation
Conditional Random Fields for Spanish Named Entity Recognition Using Unsupervised Features
2016influential citation
Unsupervised Word and Dependency Path Embeddings for Aspect Term Extraction
2016cites this paper
Multi-prototype Chinese Character Embedding
2016cites this paper
Combining Discrete and Neural Features for Sequence Labeling
2016cites this paper
Term Ranker: A Graph-Based Re-Ranking Approach
2016influential citation
Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning
2016cites this paper
The ijk System for EAL at TAC KBP 2016 Event Track
2016cites this paper
Arguments for Semantic Folding and Hierarchical Temporal Memory Theory Copyright
2016cites this paper
A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text
2015cites this paper