neulab/AfricanVoices

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neulab/AfricanVoices)

neulab / AfricanVoices

Hosts text-to-speech corpus and speech synthesizers for African languages.

☆19

Alternatives and similar repositories for AfricanVoices

Users that are interested in AfricanVoices are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

unza-speech-lab / zambezi-voice
View on GitHub
Repository for multilingual speech data resources for native languages of Zambia.
☆22Oct 9, 2024Updated last year
Ashesi-Org / Financial-Inclusion-Speech-Dataset
View on GitHub
A speech dataset to support financial inclusion created by Ashesi University and Nokwary Technologies with funding from Lacuna Fund.
☆15Updated this week
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
chdh / klatt-syn-app
View on GitHub
GUI applikation for the Klatt formant synthesizer package
☆13Jun 26, 2026Updated last month
milkymap / pdf2gpt-index
View on GitHub
build gpt-index using chatgpt and sentence-transformers
☆14Apr 8, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
gauthelo / kallaama-speech-dataset
View on GitHub
A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.
☆20Mar 26, 2026Updated 4 months ago
coqui-ai / open-bible-scripts
View on GitHub
scipts for working with open.bible data
☆26Jan 24, 2022Updated 4 years ago
masakhane-io / masakhane-pos
View on GitHub
POS for African languages
☆21Jun 25, 2025Updated last year
uds-lsv / afro-maft
View on GitHub
☆17Jan 12, 2023Updated 3 years ago
idiap / IdiapTTS
View on GitHub
A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis
☆23Dec 31, 2021Updated 4 years ago
Mister-iks / ai_suggest_deployment
View on GitHub
AI SUGGEST is a powerful command-line assistant that leverages AI to provide accurate Linux commands based on natural language queries. S…
☆11Aug 22, 2024Updated last year
Flux9665 / ArticulatoryTextFrontend
View on GitHub
This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…
☆14Sep 23, 2024Updated last year
alirezamshi / small100
View on GitHub
Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…
☆30Feb 8, 2023Updated 3 years ago
rnd2110 / MorphAGram
View on GitHub
A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars
☆17Jun 14, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
neonbjb / tts-scores
View on GitHub
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
☆175Dec 18, 2023Updated 2 years ago
masakhane-io / masakhane-ner
View on GitHub
☆122Oct 15, 2025Updated 9 months ago
Speech-Lab-IITM / Hindi-ASR-Challenge
View on GitHub
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆10Nov 5, 2020Updated 5 years ago
HuPER29 / HuPER
View on GitHub
☆16Mar 19, 2026Updated 4 months ago
neulab / newlang-tech
View on GitHub
A guide to building language technology in new languages.
☆59Feb 1, 2022Updated 4 years ago
masakhane-io / lafand-mt
View on GitHub
MAFAND-MT
☆63Jul 9, 2024Updated 2 years ago
uvsq-info / l1-python
View on GitHub
Un template pour un projet Python
☆38Nov 17, 2024Updated last year
EveryVoiceTTS / EveryVoice
View on GitHub
The EveryVoice TTS Toolkit - Text To Speech for your language
☆43Updated this week
typotheque / syllabics-knowledge
View on GitHub
open source knowledge for Syllabics font design and development
☆10Nov 13, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DerXter / NumMenu-Bot
View on GitHub
An example of a chatbot with a number-based menu that can be used as a starting point for a project.
☆29Apr 24, 2024Updated 2 years ago
ReadAlongs / SoundSwallower
View on GitHub
An even smaller speech recognizer / force aligner
☆36May 5, 2026Updated 2 months ago
CoEDL / vad-sli-asr
View on GitHub
A pipeline to isolate and transcribe one language in mixed-language speech
☆20Oct 25, 2022Updated 3 years ago
bitextor / warc2text
View on GitHub
Extracts plain text, language identification and more metadata from WARC records
☆23Apr 16, 2026Updated 3 months ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
roedoejet / convertextract
View on GitHub
Extract and find/replace text based on arbitrary correspondences while preserving original file formatting. This library is a fork from t…
☆11Sep 8, 2023Updated 2 years ago
MicrosoftTranslator / NTREX
View on GitHub
NTREX -- News Test References for MT Evaluation
☆87Jun 5, 2024Updated 2 years ago
Patil-Onkar / Remove-silence-from-an-audio
View on GitHub
☆10Jun 30, 2022Updated 4 years ago
rsprouse / klsyn
View on GitHub
Dennis Klatt's speech synthesis system, updated with a Python interface.
☆31Jun 23, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
Bartelds / acoustic-distance-measure
View on GitHub
Acoustic distance measure for comparing pronunciations
☆17Aug 2, 2022Updated 3 years ago
AI-Lab-Makerere / Data4Good
View on GitHub
This repository contains publicly available speech and text data in Luganda.
☆12Sep 4, 2020Updated 5 years ago
mhulden / pyfoma
View on GitHub
Python Finite-State Toolkit
☆68Updated this week
marytts / pavoque-data
View on GitHub
PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
ftyers / commonvoice-utils
View on GitHub
Linguistic processing for Common Voice
☆59Jan 18, 2024Updated 2 years ago
turinaf / Sagalee
View on GitHub
Automatic Speech Recognition Dataset for Oromo Language
☆30Mar 2, 2026Updated 4 months ago