This paper proposes a new method for automatic acquisition of Chinese bracketing knowledge from English-Chinese sentence-aligned bilingual corpora. Bilingual sentence pairs are first aligned in syntactic structure by combining English parse trees with a statistical bilingual language model. Chinese bracketing knowledge is then extracted automatically. The preliminary experiments show automatically learned knowledge accords well with manually annotated brackets. The proposed method is particularly useful to acquire bracketing knowledge for a less studied language that lacks tools and resources found in a second language more studied. Although this paper discusses experiments with Chinese and English, the method is also applicable to other language pairs.
Learning Chinese Bracketing Knowledge Based on a Bilingual Language Model
Yajuan Lü,Sheng Li,T. Zhao,Muyun Yang
Published 2002 in International Conference on Computational Linguistics
ABSTRACT
PUBLICATION RECORD
- Publication year
2002
- Venue
International Conference on Computational Linguistics
- Publication date
2002-08-24
- Fields of study
Linguistics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-8 of 8 references · Page 1 of 1
CITED BY
Showing 1-14 of 14 citing papers · Page 1 of 1