This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources. The corpus has 14, 438 utterances culminating into over 24 hours of speech.
☆38Jul 31, 2025Updated 8 months ago
Alternatives and similar repositories for BembaSpeech
Users that are interested in BembaSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Jan 26, 2025Updated last year
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆19Mar 26, 2026Updated 3 weeks ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15May 24, 2022Updated 3 years ago
- An R package for implementing and evaluating Maximum Entropy Optimality Theory models☆10Feb 24, 2026Updated last month
- A repository containing links to useful phonological software☆12Feb 16, 2023Updated 3 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆26May 12, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 11 months ago
- Read in a 'Praat' 'TextGrid' File☆17Oct 28, 2025Updated 5 months ago
- Curate online wolof text resources that can be used to build models☆28Mar 7, 2026Updated last month
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- phone inventory library☆17May 15, 2023Updated 2 years ago
- MAFAND-MT☆62Jul 9, 2024Updated last year
- Repo & Project for the Imminent Research Grant code & tasks☆12May 20, 2024Updated last year
- Whisper Speech Quality Assessment (WhiSQA)☆16Apr 3, 2026Updated 2 weeks ago
- Working towards a free acoustic model for the automatic recognition of New Zealand English☆19Aug 17, 2012Updated 13 years ago
- How to detect language and translate text data into the language of your choice when working on a NLP project☆11Jan 13, 2021Updated 5 years ago
- CMU multilingual speech repository☆30Apr 15, 2022Updated 4 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Sep 6, 2024Updated last year
- ☆11Jul 12, 2021Updated 4 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆43Oct 13, 2022Updated 3 years ago
- A Simple Flask App to interact with your Machine Translation Model☆13Feb 26, 2020Updated 6 years ago
- SyPhon: Constraint-based Learning of Phonological Rules☆11Mar 5, 2025Updated last year
- A streamlit app that creates a web demo of the project: https://github.com/bryandlee/animegan2-pytorch☆12Apr 6, 2022Updated 4 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆15Dec 19, 2022Updated 3 years ago
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆17Oct 20, 2020Updated 5 years ago
- Introduction to Random Forest Algorithm for classification problem and how to select important feaatures in your dataset.☆12Aug 1, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- asr2k☆52Jun 2, 2024Updated last year
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- ☆57Dec 19, 2022Updated 3 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- Use this package to compute language entropy from language history data.☆10Jul 27, 2020Updated 5 years ago
- ☆10Mar 20, 2021Updated 5 years ago