We describe Joshua, an open source toolkit for statistical machine translation. Joshua implements all of the algorithms required for synchronous context free grammars (SCFGs): chart-parsing, n-gram language model integration, beam-and cube-pruning, and k-best extraction. The toolkit also implements suffix-array grammar extraction and minimum error rate training. It uses parallel and distributed computing techniques for scalability. We demonstrate that the toolkit achieves state of the art translation performance on the WMT09 French-English translation task.
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation
Zhifei Li,Chris Callison-Burch,Chris Dyer,Juri Ganitkevitch,S. Khudanpur,Lane Schwartz,Wren N. G. Thornton,Jonathan Weese,Omar Zaidan
Published 2009 in Annual Meeting of the Association for Computational Linguistics
ABSTRACT
PUBLICATION RECORD
- Publication year
2009
- Venue
Annual Meeting of the Association for Computational Linguistics
- Publication date
2009-03-01
- Fields of study
Linguistics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-18 of 18 references · Page 1 of 1
CITED BY
Showing 1-17 of 17 citing papers · Page 1 of 1