Multi-Lingual Contextual Audio Transcription and Translation to Selected Native Languages.

  • Olubusola Olufunke Nuga
  • , Kamoli Akinwale Amusa
  • , Ayorinde Joseph Olanipekun
  • , Sikiru O. Ismail
  • , Samuel Blessing Owusu
  • , Joshua Oluwatimileyin Salako
  • , Sikirulahi Opeyemi Abdulkareem
  • , Ahmed Oluwafemi Lawal
  • , Peter Olamide Onabanjo

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper presents a contextual audio transcription and multilingual translation system designed for native languages. Utilizing advanced technologies such as AssemblyAI for accurate audio-to-text conversion, BERTopic for contextual topic modeling, and OpenAI’s API for indigenous language translation, the system demonstrates exceptional performance. AssemblyAI achieves low word error rates (WER), outperforming other transcription models. BERTopic effectively extracts meaningful topics, surpassing traditional models like Latent Dirichlet Allocation (LDA) in coherence and interpretability. The translation component, powered by GPT-4, produces accurate and contextually appropriate translations with low perplexity scores. This integrated approach bridges communication gaps in multilingual and multicultural contexts, offering a valuable tool for educators, professionals, and content creators to promote inclusivity and digital accessibility. While challenges with noisy data and language diversity remain, this system highlights the effectiveness of combining transcription, topic modeling, and translation tasks.
Original languageEnglish
Pages1-6
Number of pages6
Publication statusPublished - 20 Sept 2025
Event2025 2nd Asia Pacific Conference on Innovation in Technology (APCIT) - Mysore, India
Duration: 19 Sept 202520 Sept 2025

Conference

Conference2025 2nd Asia Pacific Conference on Innovation in Technology (APCIT)
Country/TerritoryIndia
Period19/09/2520/09/25

Fingerprint

Dive into the research topics of 'Multi-Lingual Contextual Audio Transcription and Translation to Selected Native Languages.'. Together they form a unique fingerprint.

Cite this