This paper describes the latest Speech-to-Text system developed for the Global Autonomous Language Exploitation (“GALE”) domain by Carnegie Mellon University (CMU). This systems uses discriminative training, bottle-neck features and other techniques that were not used in previous versions of our system, and is trained on 1150 hours of data from a variety of Arabic speech sources. In this paper, we show how different lexica, pre-processing, and system combination techniques can be used to improve the final output, and provide analysis of the improvements achieved by the individual techniques. Index Terms: speech recognition, discriminative training, bottle-neck features
The 2010 CMU GALE speech-to-text system
Florian Metze,Roger Hsiao,Qin Jin,Udhyakumar Nallasamy,Tanja Schultz
Published 2010 in Interspeech
ABSTRACT
PUBLICATION RECORD
- Publication year
2010
- Venue
Interspeech
- Publication date
Unknown publication date
- Fields of study
Linguistics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-19 of 19 references · Page 1 of 1
CITED BY
Showing 1-20 of 20 citing papers · Page 1 of 1