<< Back

Development of a neural machine translation model optimized with BERT for translation from Quechua to Spanish (#1636)

Read Article

Date of Conference

July 17-19, 2024

Published In

"Sustainable Engineering for a Diverse, Equitable, and Inclusive Future at the Service of Education, Research, and Industry for a Society 5.0."

Location of Conference

Costa Rica

Authors

Sulla Torres, José Alfredo

Cueva Medina, Beatrice

Tuco Casquino, Gabriel Fabrizio

Abstract

Quechua, a Native American language spoken by over 3 million people in Peru, plays a significant cultural role but is at risk of decline due to limited resources and the dominance of Spanish. This paper proposes a Quechua-to-Spanish neural machine translation (NMT) model using a Transformer-based architecture and a semi-supervised approach known as LMfusion. The model is trained on parallel datasets, and PRPE morphological segmentation is employed during preprocessing. Initial results show promise, and integrating the QuBERT language model is expected to enhance translation quality. Additionally, a user-friendly web interface has been developed to facilitate Quechua-Spanish translation. This research aims to address the challenges of translating a low-resource language like Quechua and contribute to improved communication between Quechua and Spanish speakers, preserving cultural heritage and facilitating equitable access to information and services.

Read Article