Morphology-based investigation of differences between spoken and written isiZulu

Authors

  • Marais Laurette CSIR
  • Ilana Wilken CSIR

DOI:

https://doi.org/10.55492/dhasa.v3i01.3860

Keywords:

spoken language, written language, voice computing, isiZulu

Abstract

Research attempting to describe and quantify the differences between spoken and written language has been done for languages such as English, but not for isiZulu. In this paper, we present a quantitative investigation into such differences by considering the morphology of tokens in a transcribed spoken isiZulu corpus and a written isiZulu corpus. We use morpheme tags as a proxy for features that typically differ between spoken and written language, and calculate relative differences of the occurrence of specific morpheme tags from analyses produced by ZulMorph, a finite-state morphological analyser for isiZulu. This analysis presents information that could inform the development of voice-enabled computer applications for isiZulu.

Downloads

Published

2022-02-25

How to Cite

Laurette, M., & Wilken, I. (2022). Morphology-based investigation of differences between spoken and written isiZulu. Journal of the Digital Humanities Association of Southern Africa, 3(01). https://doi.org/10.55492/dhasa.v3i01.3860