Improving Sampling-based Alignment by Investigating the Distribution of N-grams in Phrase Translation Tables

Juan Luo,Adrien Lardilleux,Y. Lepage

Published 2011 in Pacific Asia Conference on Language, Information and Computation

ABSTRACT

This paper describes an approach to improve the performance of sampling-based multilingual alignment on translation tasks by investigating the distribution of n-grams in the translation tables. This approach consists in enforcing the alignment of n-grams. The quality of phrase translation tables output by this approach and that of MGIZA++ is compared in statistical machine translation tasks. Significant improvements for this approach are reported. In addition, merging translation tables is shown to outperform state-of-the-art techniques.

PUBLICATION RECORD

  • Publication year

    2011

  • Venue

    Pacific Asia Conference on Language, Information and Computation

  • Publication date

    2011-12-01

  • Fields of study

    Computer Science

  • Identifiers

    No identifiers available.

  • External record

    Open on Semantic Scholar

  • Source metadata

    Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-26 of 26 references · Page 1 of 1