Waxal-Multilingual/speech-data

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Waxal-Multilingual/speech-data)

Waxal-Multilingual / speech-data

This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models

☆17

Alternatives and similar repositories for speech-data

Users that are interested in speech-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

milkymap / pdf2gpt-index
View on GitHub
build gpt-index using chatgpt and sentence-transformers
☆14Apr 8, 2023Updated 3 years ago
lodjim / wolof-subtitle-generator
View on GitHub
wolof-subtiles-generator permet de générer des sous-titres en wolof pour des fichiers audio et de créer des vidéos avec les sous-titres i…
☆31Aug 27, 2023Updated 2 years ago
lodjim / naboo-email
View on GitHub
☆12Nov 9, 2025Updated 8 months ago
Galsenaicommunity / Wolof-TTS
View on GitHub
Hub des projets initiés par GalsenAI
☆18Jun 15, 2025Updated last year
PapiHack / gai-demo
View on GitHub
Demo project of my talk for Galsen AI Dakar community on How to Deploy & Scale AI (ML or DL) Models With Kubernetes
☆10Jul 15, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
neulab / AfricanVoices
View on GitHub
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆19May 31, 2023Updated 3 years ago
GalsenDev221 / made.sn.ai
View on GitHub
This is a collection of AI tools built by Senegalese developers that can be used by anyone all over the world 🤖🌍
☆19Apr 12, 2023Updated 3 years ago
uds-lsv / afro-maft
View on GitHub
☆17Jan 12, 2023Updated 3 years ago
DannyNemer / aang
View on GitHub
Superior, precise, scalable NLU for creating natural language interfaces.
☆20Oct 17, 2022Updated 3 years ago
Mister-iks / ai_suggest_deployment
View on GitHub
AI SUGGEST is a powerful command-line assistant that leverages AI to provide accurate Linux commands based on natural language queries. S…
☆11Aug 22, 2024Updated last year
alirezamshi / small100
View on GitHub
Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…
☆30Feb 8, 2023Updated 3 years ago
orbitturner / orbitnextframework
View on GitHub
This is a Funny Easy Simple Lighweight Senegalese PHP Framework that have been made to help Nebies and Pro devs to code in a different wa…
☆12Aug 2, 2020Updated 5 years ago
abdouaziz / wolof
View on GitHub
Wolof is a library that you can use to do specific tasks in NLP with the Wolof language e.g. text classification in Wolof , NMT , ASR
☆32Nov 28, 2023Updated 2 years ago
masakhane-io / masakhane-pos
View on GitHub
POS for African languages
☆21Jun 25, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
masakhane-io / masakhane-ner
View on GitHub
☆122Oct 15, 2025Updated 9 months ago
DerXter / NumMenu-Bot
View on GitHub
An example of a chatbot with a number-based menu that can be used as a starting point for a project.
☆29Apr 24, 2024Updated 2 years ago
AIAnytime / SLIM-Models-by-LLMWare
View on GitHub
SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.
☆21Feb 11, 2024Updated 2 years ago
peter-kimanzi / xmen
View on GitHub
☆11Aug 28, 2024Updated last year
DerXter / Repertoire-d-institutions-data-science-au-S-n-gal
View on GitHub
Ceci est une liste d'institutions (Établissements de formations, Entreprises et ONG) présentes au Sénégal où il est possible de se former…
☆53Mar 5, 2026Updated 4 months ago
Patil-Onkar / Remove-silence-from-an-audio
View on GitHub
☆10Jun 30, 2022Updated 4 years ago
may- / joeys2t
View on GitHub
Minimalist Speech-to-Text toolkit for educational purposes
☆13Feb 1, 2024Updated 2 years ago
ksowah / TAXIO-CLIENT
View on GitHub
☆16Oct 28, 2022Updated 3 years ago
bonaventuredossou / MLM_AL
View on GitHub
☆24May 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GalsenDev221 / made.in.senegal
View on GitHub
A platform showcasing tools and solutions crafted by Senegalese developers for a global audience 🌍
☆104Updated this week
Ashesi-Org / Financial-Inclusion-Speech-Dataset
View on GitHub
A speech dataset to support financial inclusion created by Ashesi University and Nokwary Technologies with funding from Lacuna Fund.
☆15Updated this week
radi-cho / RSTOD
View on GitHub
Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.
☆17Feb 27, 2023Updated 3 years ago
getalp / ALFFA_PUBLIC
View on GitHub
☆53Dec 3, 2021Updated 4 years ago
bambadiagne / github-stats
View on GitHub
This application allows you to list GitHub users located in Senegal by ranking and view individual statistics for each user. With this to…
☆14May 30, 2026Updated last month
dialoguetoolkit / chattool
View on GitHub
Dialogue Experimental Toolkit (DiET)
☆19May 26, 2026Updated last month
cambridgeltl / ACL2022_tutorial_multilingual_dialogue
View on GitHub
Materials for "Natural Language Processing for Multilingual Task-Oriented Dialogue" Tutorial at ACL 2022
☆14May 21, 2022Updated 4 years ago
GSNCodes / Image-Classification-Streamlit-TensorFlow
View on GitHub
A basic web-app for image classification using Streamlit and Tensorflow
☆14Dec 11, 2022Updated 3 years ago
swiss-ai / parity-aware-bpe
View on GitHub
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [ACL 2026]
☆19Apr 18, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
IdrissaN / Zindi-Hackathons
View on GitHub
The Goal of this repo is to provide the solutions of Zindi Hackathons
☆15Feb 4, 2022Updated 4 years ago
catherinearnett / morphscore
View on GitHub
This is the repository for MorphScore, a tokenizer evaluation framework for morphological alignment.
☆17Jul 10, 2025Updated last year
bonaventuredossou / ffr-v1
View on GitHub
Towards developing a Robust Translation Model for African languages: Pilot Project FFR v1.0.
☆47May 12, 2024Updated 2 years ago
ymoslem / CTranslate-NMT-Web-Interface
View on GitHub
Machine Translation (MT) Web Interface for OpenNMT and FairSeq models using CTranslate and Streamlit
☆15Dec 24, 2021Updated 4 years ago
xieyuankun / ALLM-ADD-FT-GRPO
View on GitHub
☆18Feb 6, 2026Updated 5 months ago
daoodaba975 / sn.youtuber.dev.list
View on GitHub
📺 A curated collection of Senegalese YouTube channels dedicated to development and technology.
☆21Feb 6, 2026Updated 5 months ago
Sidibedev / expo-splashscreen-generator
View on GitHub
This allows you to generate a splashscreen compatible to Expo
☆25May 8, 2022Updated 4 years ago