This paper presents a successful approach for domain adaptation of a dependency parser via self-training. We improve parsing accuracy for out-of-domain texts with a self-training approach that uses confidence-based methods to select additional training samples. We compare two confidence-based methods: The first method uses the parse score of the employed parser to measure the confidence into a parse tree. The second method calculates the score differences between the best tree and alternative trees. With these methods, we were able to improve the labeled accuracy score by 1.6 percentage points on texts from a chemical domain and by 0.6 on average on texts of three web domains. Our improvements on the chemical texts of 1.5% UAS is substantially higher than improvements reported in previous work of 0.5% UAS. For the three web domains, no positive results for self-training have been reported before.
Domain Adaptation for Dependency Parsing via Self-Training
Juntao Yu,Mohab Elkaref,Bernd Bohnet
Published 2015 in International Workshop/Conference on Parsing Technologies
ABSTRACT
PUBLICATION RECORD
- Publication year
2015
- Venue
International Workshop/Conference on Parsing Technologies
- Publication date
2015-07-01
- Fields of study
Chemistry, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-33 of 33 references · Page 1 of 1
CITED BY
Showing 1-37 of 37 citing papers · Page 1 of 1