Localising the Mozilla Common Voice platform for South Africa’s official languages
DOI:
https://doi.org/10.55492/dhasa.v4i01.4437Keywords:
under-resourced languages, speech resources, Mozilla Common VoiceAbstract
Despite many attempts to address the situation, South Africa's official languages remain under-resourced in terms of the text and speech data required to implement state-of-the-art language technology. To ensure that no language is left behind, resource development should remain a priority until a strong digital presence has been established for all indigenous languages. This paper provides an overview of previous projects that were specifically aimed at speech resource development and introduces an ongoing initiative to launch South Africa's languages on the Mozilla Common Voice platform.Downloads
Published
2023-01-25
Issue
Section
Articles
License
Copyright (c) 2023 Febe de Wet, Andiswa Bukula, Willem Karsten, Martin Puttkammer, Erwin Schillack, Roné Wierenga, Roald Eiselen
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
How to Cite
Localising the Mozilla Common Voice platform for South Africa’s official languages. (2023). Journal of the Digital Humanities Association of Southern Africa , 4(01). https://doi.org/10.55492/dhasa.v4i01.4437