Lelapa NLP - Lesotho ASR Task

End-to-end ASR evaluation for code-switched Sesotho-English speech with benchmarking, error analysis, and reproducible reporting.

Product Research Scientist task project focused on automatic speech recognition for code-switched Sesotho-English data.

Highlights:

  • Benchmarking across Whisper and Wav2Vec2 pipelines.
  • Error slicing and analysis to identify model failure patterns.
  • Reproducible notebook and script workflow for rapid evaluation.
  • Practical recommendations for data, decoding, and deployment improvements.

Repository: Lelapa NLP - Lesotho ASR Task