Search
Now showing items 1-7 of 7
Multilingual pronunciations of proper names in a Southern African corpus
(Pattern recognition association of South Africa (PRASA), 2012)
We present our process for the development and analysis of a multilingual names corpus, called Multipron-split. It is derived from Multipron, a corpus collected in previous work [1], where names and speakers were drawn ...
Improved transition models for cepstral trajectories
(Pattern recognition association of South Africa (PRASA), 2012)
We improve on a piece-wise linear model of the trajectories of Mel Frequency Cepstral Coefficients, which are commonly used as features in Automatic Speech Recognition. For this purpose, we have created a very clean ...
Comparing grapheme-based and phoneme-based speech recognition for Afrikaans
(PRASA, 2012)
This paper compares the recognition accuracy of a phoneme-based automatic speech recognition system with that of a grapheme-based system, using Afrikaans as case study. The first system is developed using a conventional ...
N-gram based language identification of individual words
(PRASA, 2013)
Various factors influence the accuracy with which the language of individual words can be classified using n-grams. We consider a South African text-based language identification (LID) task and experiment with two different ...
Automatic alignment of audiobooks in Afrikaans
(Pattern recognition association of South Africa (PRASA), 2012)
This paper reports on the automatic alignment of audiobooks in Afrikaans. An existing Afrikaans pronunciation dictionary and corpus of Afrikaans speech data are used to generate baseline acoustic models. The baseline system ...
The semi-automated creation of stratified speech corpora
(Pattern recognition association of South Africa (PRASA), 2013)
Smartphones provide an efficient means for the collection of speech data; however, the quality of the corpora created in this fashion is not predictable. We describe an approach that allows us to post-process and rank ...
Context-dependent modelling of English vowels in Sepedi code-switched speech
(Pattern recognition association of South Africa (PRASA), 2012)
When modelling code-switched speech (utterances that contain a mixture of languages), the embedded language often contains phones not found in the matrix language. These are typically dealt with by either extending the ...