ogunlao / yoruba_speech_project
This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.
β16Updated 4 years ago
Alternatives and similar repositories for yoruba_speech_project:
Users that are interested in yoruba_speech_project are comparing it to the libraries listed below
- β43Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β86Updated 2 years ago
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- scipts for working with open.bible dataβ24Updated 3 years ago
- β46Updated 2 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 3 years ago
- Scripts to create speech corpora from open.bibleβ13Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β13Updated last year
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipelineβ32Updated 2 years ago
- β17Updated 3 years ago
- Repo & Project for the Imminent Research Grant code & tasksβ10Updated 11 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β85Updated last year
- Text to Speech for Indic languagesβ50Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β82Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.β104Updated last year
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.β51Updated 2 years ago
- Word Error Rate Estimationβ13Updated 4 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languagesβ73Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β17Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 2 years ago
- YorΓΉbΓ‘ language training text for NLP, ASR and TTS tasksβ76Updated 2 years ago
- A merged version of multiple open-source German speech datasets.β31Updated 11 months ago
- β42Updated 7 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) togetherβ47Updated 2 years ago
- phone inventory libraryβ16Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downsβ¦β31Updated 4 years ago
- Zero-shot Audio Classification using Whisperβ80Updated 2 years ago
- Various speech datasets made available to the publicβ116Updated 4 months ago