An Enhancement of African Low-Resource Corpora with NLP IgboT5

Authors

  • Jacinta Chioma Odirichukwu Department of Computer Science, Federal University of Technology, Owerri, Imo State, Nigeria
  • Reginald Nnadozie Nnamdi Department of Philosophy, Veritas University, Abuja, FCT, Nigeria
  • Simon Peter Chimaobi Odirichukwu Department of Health,Primary Health Development Agency, Owerri, Imo State, Nigeria

Keywords:

T5, Igbo Dataset, NLP, Transformer Model, IgboT5

Abstract

This paper adopts the Text-to-Text Transfer Transformer (T5) for the Igbo language Natural Language Processing Tasks. IgboT5 enhances the previous digital Igbo Thesaurus through the creation of a high-quality Igbo dataset. The paper fine-tunes a multilingual T5 model and evaluates it on tasks such as definition generation, paraphrasing, translation, and context completion. This paper contributes to the advancement of low resource African languages and opens doors for future Natural Language Processing (NLP) applications.

Downloads

Published

2026-06-10