csikasote / BembaSpeechView external linksLinks
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources. The corpus has 14, 438 utterances culminating into over 24 hours of speech.
☆37Jul 31, 2025Updated 6 months ago
Alternatives and similar repositories for BembaSpeech
Users that are interested in BembaSpeech are comparing it to the libraries listed below
Sorting:
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆18Apr 29, 2024Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆113Apr 26, 2024Updated last year
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Jan 26, 2025Updated last year
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 3 months ago
- Curate online wolof text resources that can be used to build models☆27Jan 15, 2026Updated last month
- VoxAngeles Corpus☆13Aug 23, 2025Updated 5 months ago
- An R package for implementing and evaluating Maximum Entropy Optimality Theory models☆10Jan 28, 2026Updated 2 weeks ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Neural based model for automatic diacritics restoration.☆25Nov 13, 2018Updated 7 years ago
- Repo & Project for the Imminent Research Grant code & tasks☆12May 20, 2024Updated last year
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆14May 24, 2022Updated 3 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- A repository containing links to useful phonological software☆12Feb 16, 2023Updated 3 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- Sequence algorithms for use in Flashlight.☆14Jan 12, 2026Updated last month
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 4 months ago
- a repository for trainabale tts multi speaker☆14Nov 28, 2021Updated 4 years ago
- ☆56Dec 19, 2022Updated 3 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆19Sep 20, 2024Updated last year
- ☆15Aug 25, 2022Updated 3 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Nov 1, 2024Updated last year
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago