ogunlao / yoruba_speech_projectLinks
This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.
β18Updated 5 years ago
Alternatives and similar repositories for yoruba_speech_project
Users that are interested in yoruba_speech_project are comparing it to the libraries listed below
Sorting:
- β45Updated 3 years ago
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 5 years ago
- β49Updated 3 years ago
- scipts for working with open.bible dataβ26Updated 3 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ79Updated 4 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β17Updated 2 years ago
- phone inventory libraryβ17Updated 2 years ago
- Scripts to create speech corpora from open.bibleβ13Updated 3 years ago
- β67Updated 6 months ago
- Universal multilingual automatic speech transcription into IPAβ72Updated 10 months ago
- A python package for whisper normalizerβ71Updated 2 months ago
- β46Updated 8 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β87Updated 3 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/β¦β36Updated 5 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β80Updated 2 years ago
- β56Updated 3 years ago
- Text to Speech for Indic languagesβ52Updated 3 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β18Updated 4 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 3 years ago
- Zero-shot Audio Classification using Whisperβ79Updated 3 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ50Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.β112Updated last year
- Dataset Release for Intent Classification from Speechβ48Updated 10 months ago
- Datasets for turn-taking researchβ17Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ151Updated last year
- A guide to building language technology in new languages.β59Updated 3 years ago
- A python package for deep multilingual punctuation prediction.β152Updated last year
- YorΓΉbΓ‘ language training text for NLP, ASR and TTS tasksβ81Updated 2 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β101Updated 4 months ago
- Behavioral probing of language acquisition models at the lexical and syntactic levelβ17Updated 2 years ago