Otedama: Fast Rule-Based Pre-Ordering for Machine Translation

Julian Hitschler,Laura Jehl,Sariya Karimova,Mayumi Ohta,Benjamin Körner,S. Riezler

Published 2016 in Prague Bulletin of Mathematical Linguistics

ABSTRACT

Abstract We present Otedama, a fast, open-source tool for rule-based syntactic pre-ordering, a well established technique in statistical machine translation. Otedama implements both a learner for pre-ordering rules, as well as a component for applying these rules to parsed sentences. Our system is compatible with several external parsers and capable of accommodating many source and all target languages in any machine translation paradigm which uses parallel training data. We demonstrate improvements on a patent translation task over a state-of-the-art English-Japanese hierarchical phrase-based machine translation system. We compare Otedama with an existing syntax-based pre-ordering system, showing comparable translation performance at a runtime speedup of a factor of 4.5-10.

PUBLICATION RECORD

  • Publication year

    2016

  • Venue

    Prague Bulletin of Mathematical Linguistics

  • Publication date

    2016-10-01

  • Fields of study

    Linguistics, Computer Science

  • Identifiers
  • External record

    Open on Semantic Scholar

  • Source metadata

    Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-100 of 169 references · Page 1 of 2