This paper describes an approach to improve the performance of sampling-based multilingual alignment on translation tasks by investigating the distribution of n-grams in the translation tables. This approach consists in enforcing the alignment of n-grams. The quality of phrase translation tables output by this approach and that of MGIZA++ is compared in statistical machine translation tasks. Significant improvements for this approach are reported. In addition, merging translation tables is shown to outperform state-of-the-art techniques.
Improving Sampling-based Alignment by Investigating the Distribution of N-grams in Phrase Translation Tables
Juan Luo,Adrien Lardilleux,Y. Lepage
Published 2011 in Pacific Asia Conference on Language, Information and Computation
ABSTRACT
PUBLICATION RECORD
- Publication year
2011
- Venue
Pacific Asia Conference on Language, Information and Computation
- Publication date
2011-12-01
- Fields of study
Computer Science
- Identifiers
No identifiers available.
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-26 of 26 references · Page 1 of 1
CITED BY
Showing 1-2 of 2 citing papers · Page 1 of 1