A Southern African corpus for multilingual name pronunciation
| dc.contributor.author | Giwa, Oluwapelumi | |
| dc.contributor.author | Davel, Marelie H. | |
| dc.contributor.author | Barnard, Etienne | |
| dc.date.accessioned | 2018-03-07T07:55:56Z | |
| dc.date.available | 2018-03-07T07:55:56Z | |
| dc.date.issued | 2011 | |
| dc.description.abstract | We describe the challenges that arise in predicting the pronunciations of proper names in a multilingual society. In order to improve our understanding of this issue – which is of significant practical importance for applications of speech technology – we have designed and collected a multilingual corpus of proper names. Both the names and the speakers are drawn from four South African languages, namely isiZulu, Sesotho, English and Afrikaans. We describe how the corpus was designed in order to probe the interaction between the speaker’s language and the origin of the name, and discuss the practical steps that were taken in collecting the spoken utterances. A statistical investigation of the prompt material reveals some of the systematic differences between the languages. | en_US |
| dc.description.sponsorship | This corpus is being developed in collaboration with Jean- Pierre Martens from the University of Ghent (Belgium) and Derik Thirion from Molo Innovations (South Africa). Corpus development is being sponsored by the Department of Arts and Culture of the government of the Republic of South Africa; their support is gratefully acknowledged. | en_US |
| dc.identifier.citation | Oluwapelumi Giwa, Marelie H Davel and Etienne Barnard, “A Southern African corpus for multilingual name pronunciation”, in Proc. Annual Symp. Pattern Recognition Association of South Africa (PRASA), pp 49-53, Vanderbijlpark, South Africa, 2011. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications] | en_US |
| dc.identifier.uri | https://www.researchgate.net/publication/235425724_A_Southern_African_corpus_for_multilingual_name_pronunciation | |
| dc.identifier.uri | http://hdl.handle.net/10394/26543 | |
| dc.language.iso | en | en_US |
| dc.publisher | Pattern Recognition Association of South Africa and Mechatronics International Conference | en_US |
| dc.subject | Multilingual name pronunciation | en_US |
| dc.subject | Predicting the pronunciations of proper names | en_US |
| dc.subject | Applications of speech technology | en_US |
| dc.title | A Southern African corpus for multilingual name pronunciation | en_US |
| dc.type | Presentation | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- giwa-2011-southern-african-corpus.pdf
- Size:
- 74.03 KB
- Format:
- Adobe Portable Document Format
- Description:
- giwa-2011-southern-african-corpus
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.61 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
