gauthelo / kallaama-speech-dataset
A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.
☆13Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for kallaama-speech-dataset
- phone inventory library☆15Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆13Updated last year
- ☆12Updated 8 months ago
- ☆16Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- Repository for multilingual speech data resources for native languages of Zambia.☆14Updated last month
- ☆10Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- African accented clinical and general domain TTS☆9Updated 5 months ago
- Survey on speech generation work.☆12Updated 11 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆56Updated last year
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆16Updated 8 months ago
- ☆40Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆23Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆37Updated last year
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆32Updated 5 months ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- A merged version of multiple open-source German speech datasets.☆30Updated 6 months ago
- ☆16Updated 2 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 4 months ago
- ☆17Updated last year
- Scripts to create speech corpora from open.bible☆12Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 2 months ago
- ☆11Updated 3 years ago
- The project for speech translation☆11Updated last year