A Southern African corpus for multilingual name pronunciation
Loading...
Date
Authors
Giwa, Oluwapelumi
Davel, Marelie H.
Barnard, Etienne
Researcher ID
Supervisors
Journal Title
Journal ISSN
Volume Title
Publisher
Pattern Recognition Association of South Africa and Mechatronics International Conference
Record Identifier
Abstract
We describe the challenges that arise in predicting
the pronunciations of proper names in a multilingual society.
In order to improve our understanding of this issue – which
is of significant practical importance for applications of speech
technology – we have designed and collected a multilingual
corpus of proper names. Both the names and the speakers
are drawn from four South African languages, namely isiZulu,
Sesotho, English and Afrikaans. We describe how the corpus was
designed in order to probe the interaction between the speaker’s
language and the origin of the name, and discuss the practical
steps that were taken in collecting the spoken utterances. A
statistical investigation of the prompt material reveals some of
the systematic differences between the languages.
Sustainable Development Goals
Description
Citation
Oluwapelumi Giwa, Marelie H Davel and Etienne Barnard, “A Southern African corpus for multilingual name pronunciation”, in Proc. Annual Symp. Pattern Recognition Association of South Africa (PRASA), pp 49-53, Vanderbijlpark, South Africa, 2011. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications]
