Weighting Finite-State Transductions With Neural Context

Pushpendre Rastogi,Ryan Cotterell,Jason Eisner

Published 2016 in North American Chapter of the Association for Computational Linguistics

ABSTRACT

How should one apply deep learning to tasks such as morphological reinﬂection, which stochastically edit one string to get another? A recent approach to such sequence-to-sequence tasks is to compress the input string into a vector that is then used to generate the output string, using recurrent neural networks. In contrast, we propose to keep the traditional architecture, which uses a ﬁnite-state transducer to score all possible output strings , but to augment the scoring function with the help of recurrent networks. A stack of bidirectional LSTMs reads the input string from left-to-right and right-to-left, in order to summarize the input context in which a transducer arc is applied. We combine these learned features with the transducer to deﬁne a probability distribution over aligned output strings, in the form of a weighted ﬁnite-state automaton. This reduces hand-engineering of features, allows learned features to examine unbounded context in the input string, and still permits exact inference through dynamic programming. We illustrate our method on the tasks of morphological reinﬂection and lemmatization.

PUBLICATION RECORD

Publication year
2016
Venue
North American Chapter of the Association for Computational Linguistics
Publication date
2016-06-01
Fields of study
Computer Science
Identifiers
DOI 10.18653/v1/N16-1076
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Globally Normalized Transition-Based Neural Networks
2016cited by this paper
Neural CRF Parsing
2015cited by this paper
Semantic Role Labeling with Neural Network Factors
2015cited by this paper
Bidirectional LSTM-CRF Models for Sequence Tagging
2015cited by this paper
Segmental Recurrent Neural Networks
2015cited by this paper
Learning to Transduce with Unbounded Memory
2015cited by this paper
Transition-Based Dependency Parsing with Stack Long Short-Term Memory
2015cited by this paper
Morphological Inflection Generation Using Character Sequence to Sequence Learning
2015influential reference
Sequence-to-sequence neural net models for grapheme-to-phoneme conversion
2015cited by this paper
Stochastic Contextual Edit Distance and Probabilistic FSTs
2014cited by this paper
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
Learning Deep Structured Models
2014cited by this paper
Grammar as a Foreign Language
2014influential reference
Sequence to Sequence Learning with Neural Networks
2014influential reference
Supervised Sequence Labelling with Recurrent Neural Networks
2012cited by this paper
Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition
2012cited by this paper
Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction
2011cited by this paper
A non-parametric model for the discovery of inflectional paradigms from plain text using graphical models over strings
2011influential reference
Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions
2010cited by this paper
Neural conditional random fields
2010cited by this paper
First- and Second-Order Expectation Semirings with Applications to Minimum-Risk Training on Translation Forests
2009cited by this paper
Variational Decoding for Statistical Machine Translation
2009cited by this paper
Conditional Neural Fields
2009cited by this paper
Joint-sequence models for grapheme-to-phoneme conversion
2008cited by this paper
Latent-Variable Modeling of String Transductions with Finite-State Methods
2008cited by this paper
Efficient, Feature-based, Conditional Random Field Parsing
2008cited by this paper
Three new graphical models for statistical language modelling
2007cited by this paper
Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion
2007cited by this paper
Moses: Open Source Toolkit for Statistical Machine Translation
2007cited by this paper
A Better N-Best List: Practical Determinization of Weighted Finite Tree Automata
2006cited by this paper
2005 Special Issue: Framewise phoneme classification with bidirectional LSTM and other neural network architectures
2005influential reference
Conditional and joint models for grapheme-to-phoneme conversion
2003cited by this paper
Modeling and learning multilingual inflectional morphology in a minimally supervised framework
2003cited by this paper
Parameter Estimation for Probabilistic Finite-State Transducers
2002influential reference
Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms
2002cited by this paper
Optimal linguistic decoding is a difficult computational problem
1999cited by this paper
Long Short-Term Memory
1997cited by this paper
Finite-State Transducers in Language and Speech Processing
1997cited by this paper
Learning String-Edit Distance
1996cited by this paper
Training Stochastic Model Recognition Algorithms as Networks can Lead to Maximum Mutual Information Estimation of Parameters
1989cited by this paper
Binary codes capable of correcting deletions, insertions, and reversals
1965cited by this paper
A note on two problems in connexion with graphs
1959cited by this paper

CITED BY

Transducing Language Models
2026cites this paper
Neural Induction of Finite-State Transducers
2026cites this paper
Automating the Analysis of Parsing Algorithms (and other Dynamic Programs)
2025cites this paper
Regular-pattern-sensitive CRFs for Distant Label Interactions
2024cites this paper
Encoding and Decoding Graph Representations of Natural Language
2024cites this paper
A Fast and Sound Tagging Method for Discontinuous Named-Entity Recognition
2024cites this paper
Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference
2024cites this paper
Recent advancements in computational morphology : A comprehensive survey
2024cites this paper
Reducing Sequence Length by Predicting Edit Spans with Large Language Models
2023cites this paper
Structure-Aware Path Inference for Neural Finite State Transducers
2023cites this paper
Non-autoregressive Machine Translation with Probabilistic Context-free Grammar
2023cites this paper
RankAug: Augmented data ranking for text classification
2023cites this paper
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding
2023cites this paper
Algorithms for Acyclic Weighted Finite-State Automata with Failure Arcs
2023influential citation
Composing RNNs and FSTs for Small Data: Recovering Missing Characters in Old Hawaiian Text
2022cites this paper
A survey on syntactic processing techniques
2022cites this paper
SIGMORPHON 2022 Task 0 Submission Description: Modelling Morphological Inflection with Data-Driven and Rule-Based Approaches
2022cites this paper
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
2022cites this paper
Sublinear Time Approximation of Text Similarity Matrices
2021cites this paper
Neuralizing Regular Expressions for Slot Filling
2021cites this paper
Lexically Aware Semi-Supervised Learning for OCR Post-Correction
2021cites this paper
Searching for More Efficient Dynamic Programs
2021cites this paper
Sequence-to-Sequence Learning with Latent Neural Grammars
2021cites this paper
Comparative Error Analysis in Neural and Finite-state Models for Unsupervised Character-level Transduction
2021cites this paper
Constraining Linear-chain CRFs to Regular Languages
2021cites this paper
Higher-order Derivatives of Weighted Finite-state Machines
2021cites this paper
Computational Morphology with Neural Network Approaches
2021cites this paper
Neural String Edit Distance
2021cites this paper
Predicting Institution Hierarchies with Set-based Models
2020cites this paper
Marble: Model-based Robustness Analysis of Stateful Deep Learning Systems
2020cites this paper
Limitations of Autoregressive Models and Their Alternatives
2020cites this paper
Modelling Verbal Morphology in Nen
2020cites this paper
Implementation of a Chomsky-Schützenberger n-best parser for weighted multiple context-free grammars
2019cites this paper
Training Data Augmentation for Context-Sensitive Neural Lemmatizer Using Inflection Tables and Raw Text
2019cites this paper
A Simple Joint Model for Improved Contextual Neural Lemmatization
2019cites this paper
Exact Hard Monotonic Attention for Character-Level Transduction
2019cites this paper
Neural Finite-State Transducers: Beyond Rational Relations
2019influential citation
Optimal Transport-based Alignment of Learned Character Representations for String Similarity
2019cites this paper
DeepStellar: model-based quantitative analysis of stateful deep learning systems
2019cites this paper
Neural sequence-to-sequence models for low-resource morphology
2019cites this paper
CUNI–Malta system at SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context: Operation-based word formation
2019cites this paper
How to embed noncrossing trees in Universal Dependencies treebanks in a low-complexity regular language
2019cites this paper
Two Birds with One Stone: Investigating Invertible Neural Networks for Inverse Problems in Morphology
2019cites this paper
13 Symbolic MT 2 : Weighted Finite State Transducers
2019cites this paper
ANN-Based Predictive State Modeling of Finite State Machines
2018cites this paper
Recovering Missing Characters in Old Hawaiian Writing
2018cites this paper
Tackling Sequence to Sequence Mapping Problems with Neural Networks
2018influential citation
Bridging CNNs, RNNs, and Weighted Finite-State Machines
2018cites this paper
A Distributional and Orthographic Aggregation Model for English Derivational Morphology
2018cites this paper
DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems
2018cites this paper
Bayesian Ensembles of Crowds and Deep Learners for Sequence Tagging
2018cites this paper
Imitation Learning for Neural Morphological String Transduction
2018cites this paper
Hard Non-Monotonic Attention for Character-Level Transduction
2018cites this paper
Rational Recurrences
2018cites this paper
Neural Transition-based String Transduction for Limited-Resource Setting in Morphology
2018influential citation
Local String Transduction as Sequence Labeling
2018influential citation
Neural Syntactic Generative Models with Exact Marginalization
2018cites this paper
Predicting and discovering linguistic structure with neural networks
2018cites this paper
Incremental generative models for syntactic and semantic natural language processing
2017cites this paper
Overview of Character-Based Models for Natural Language Processing
2017cites this paper
Learning String Alignments for Entity Aliases
2017cites this paper
Paradigm Completion for Derivational Morphology
2017cites this paper
Align and Copy: UZH at SIGMORPHON 2017 Shared Task for Morphological Reinflection
2017cites this paper
CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages
2017cites this paper
Joint Semantic Synthesis and Morphological Analysis of the Derived Word
2017cites this paper
An RNN Model of Text Normalization
2017cites this paper
Cross-lingual Character-Level Neural Morphological Tagging
2017cites this paper
Neural Models for Sequence Chunking
2017cites this paper
Improving Sequence to Sequence Learning for Morphological Inflection Generation: The BIU-MIT Systems for the SIGMORPHON 2016 Shared Task for Morphological Reinflection
2016cites this paper
Online Segment to Segment Neural Transduction
2016cites this paper
Neural Multi-Source Morphological Reinflection
2016cites this paper
Nonsymbolic Text Representation
2016cites this paper
Canonical Correlation Inference for Mapping Abstract Scenes to Text
2016cites this paper
The SIGMORPHON 2016 Shared Task—Morphological Reinflection
2016cites this paper
Sequence to Sequence Transduction with Hard Monotonic Attention
2016influential citation
RNN Approaches to Text Normalization: A Challenge
2016cites this paper
Unsupervised Neural Hidden Markov Models
2016cites this paper
Morphological Inflection Generation with Hard Monotonic Attention
2016influential citation
Morphological Inflection Generation Using Character Sequence to Sequence Learning
2015cites this paper
UvA-DARE (Digital Academic Repository) Unsupervised Neural Hidden Markov Models
year unknowncites this paper