NWU Institutional Repository

Code-switched English pronunciation modeling for Swahili spoken term detection

Loading...
Thumbnail Image

Supervisors

Journal Title

Journal ISSN

Volume Title

Publisher

Procedia Computer Science: Spoken Language Technology for Under-resourced Languages

Record Identifier

Abstract

We investigate modeling strategies for English code-switched words as found in a Swahili spoken term detection system. Code switching, where speakers switch language in a conversation, occurs frequently in multilingual environments, and typically deteriorates STD performance. Analysis is performed in the context of the IARPA Babel program which focuses on rapid STD system development for under-resourced languages. Our results show that approaches that specifically target the modeling of code-switched words, significantly improve the detection performance of these words.

Sustainable Development Goals

Description

Citation

Neil Kleynhans, William Hartman, Daniel van Niekerk, Charl van Heerden, Rich Schwartz, Stavros Tsakalidis and Marelie Davel, “Code-switched English pronunciation modeling for Swahili spoken term detection”, Procedia Computer Science: Spoken Language Technology for Under-resourced Languages, pp 128-135, Yogyakarta, Indonesia, May 2016.

Endorsement

Review

Supplemented By

Referenced By