Code-switched English pronunciation modeling for Swahili spoken term detection
Loading...
Files
Date
Supervisors
Journal Title
Journal ISSN
Volume Title
Publisher
Procedia Computer Science: Spoken Language Technology for Under-resourced Languages
Record Identifier
Abstract
We investigate modeling strategies for English code-switched words as found in a Swahili spoken term detection system. Code switching, where speakers switch language in a conversation, occurs frequently in multilingual environments, and typically deteriorates STD performance. Analysis is performed in the context of the IARPA Babel program which focuses on rapid STD system development for under-resourced languages. Our results show that approaches that specifically target the modeling of code-switched words, significantly improve the detection performance of these words.
Sustainable Development Goals
Description
Citation
Neil Kleynhans, William Hartman, Daniel van Niekerk, Charl van Heerden, Rich Schwartz, Stavros Tsakalidis and Marelie Davel, “Code-switched English pronunciation modeling for Swahili spoken term detection”, Procedia Computer Science: Spoken Language Technology for Under-resourced Languages, pp 128-135, Yogyakarta, Indonesia, May 2016.
