Comparing grapheme-based and phoneme-based speech recognition for Afrikaans
Abstract
This paper compares the recognition accuracy of a phoneme-based automatic speech recognition system with that of a grapheme-based system, using Afrikaans as case study. The first system is developed using a conventional pronunciation dictionary, while the latter system uses the letters of each word directly as the acoustic units to be modelled. We ensure that the pronunciation dictionary we use is highly accurate and then investigate the extent to which ASR performance degrades when the dictionary is removed. We analyse this effect at different data set sizes and classify the causes of performance degradation. With grapheme-based ASR outperforming phoneme-based ASR in certain word categories, we find that relative error rates are highly dependent on word category, which points towards strategies for compensating for grapheme-based inaccuracies
URI
http://hdl.handle.net/10394/12122https://www.researchgate.net/publication/235425731_Comparing_grapheme-based_and_phoneme-based_speech_recognition_for_Afrikaans?channel=doi&linkId=0a85e537dcdbf2cb85000000&showFulltext=true