On Multilingual Training of Neural Dependency Parsers

Michal Zapotoczny,Paweł Rychlikowski,J. Chorowski

Published 2017 in International Conference on Text, Speech and Dialogue

ABSTRACT

We show that a recently proposed neural dependency parser can be improved by joint training on multiple languages from the same family. The parser is implemented as a deep neural network whose only input is orthographic representations of words. In order to successfully parse, the network has to discover how linguistically relevant concepts can be inferred from word spellings. We analyze the representations of characters and words that are learned by the network to establish which properties of languages were accounted for. In particular we show that the parser has approximately learned to associate Latin characters with their Cyrillic counterparts and that it can group Polish and Russian words that have a similar grammatical function. Finally, we evaluate the parser on selected languages from the Universal Dependencies dataset and show that it is competitive with other recently proposed state-of-the art methods, while having a simple structure.

PUBLICATION RECORD

Publication year
2017
Venue
International Conference on Text, Speech and Dialogue
Publication date
2017-05-29
Fields of study
Linguistics, Computer Science
Identifiers
DOI 10.1007/978-3-319-64206-2_37 arXiv 1705.10209
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

SyntaxNet Models for the CoNLL 2017 Shared Task
2017cited by this paper
Many Languages, One Parser
2016influential reference
Dependency Parsing as Head Selection
2016cited by this paper
Read, Tag, and Parse All at Once, or Fully-neural Dependency Parsing
2016influential reference
Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature Representations
2016cited by this paper
Deep Biaffine Attention for Neural Dependency Parsing
2016cited by this paper
Globally Normalized Transition-Based Neural Networks
2016influential reference
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
2016cited by this paper
Exploring the Limits of Language Modeling
2016cited by this paper
Transition-Based Dependency Parsing with Stack Long Short-Term Memory
2015cited by this paper
Highway Networks
2015cited by this paper
Cross-lingual Dependency Parsing Based on Distributed Representations
2015cited by this paper
Universal Dependencies 1.4
2015influential reference
Character-Aware Neural Language Models
2015cited by this paper
A Neural Network Model for Low-Resource Universal Dependency Parsing
2015cited by this paper
Improved Transition-based Parsing by Modeling Characters instead of Words with LSTMs
2015cited by this paper
Blocks and Fuel: Frameworks for deep learning
2015cited by this paper
Sequence to Sequence Learning with Neural Networks
2014cited by this paper
End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results
2014cited by this paper
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
2014cited by this paper
Dropout: a simple way to prevent neural networks from overfitting
2014cited by this paper
Grammar as a Foreign Language
2014cited by this paper
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
A Fast and Accurate Dependency Parser using Neural Networks
2014cited by this paper
Maxout Networks
2013cited by this paper
Distributed Representations of Words and Phrases and their Compositionality
2013cited by this paper
ADADELTA: An Adaptive Learning Rate Method
2012cited by this paper
Extensions of recurrent neural network language model
2011cited by this paper
Theano: A CPU and GPU Math Compiler in Python
2010cited by this paper
Recurrent neural network based language model
2010cited by this paper
Algorithms for Deterministic Incremental Dependency Parsing
2008cited by this paper
MaltParser: A language-independent system for data-driven dependency parsing
2007cited by this paper
A Latent Variable Model for Generative Dependency Parsing
2007cited by this paper
Multitask Learning
1997cited by this paper
Bidirectional recurrent neural networks
1997cited by this paper
Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations
1986cited by this paper
Linguistic I Ssues in L Anguage Technology Lilt on Achieving and Evaluating Language-independence in Nlp on Achieving and Evaluating Language-independence in Nlp
year unknowncited by this paper

CITED BY

Detecting Troll Tweets in a Bilingual Corpus
2020cites this paper
Statistical Language and Speech Processing: 8th International Conference, SLSP 2020, Cardiff, UK, October 14–16, 2020, Proceedings
2020cites this paper
Exploring Parameter Sharing Techniques for Cross-Lingual and Cross-Task Supervision
2020cites this paper