ogunlao / yoruba_speech_projectLinks
This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.
β18Updated 4 years ago
Alternatives and similar repositories for yoruba_speech_project
Users that are interested in yoruba_speech_project are comparing it to the libraries listed below
Sorting:
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β17Updated 2 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 3 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ78Updated 3 years ago
- Scripts to create speech corpora from open.bibleβ13Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β87Updated 2 years ago
- β47Updated 2 years ago
- scipts for working with open.bible dataβ24Updated 3 years ago
- β43Updated 2 years ago
- β43Updated 7 years ago
- Repo & Project for the Imminent Research Grant code & tasksβ12Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β82Updated 2 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β86Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.β106Updated last year
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 2 years ago
- A merged version of multiple open-source German speech datasets.β31Updated last year
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/β¦β35Updated 6 months ago
- Linguistic processing for Common Voiceβ55Updated last year
- IPA tokeniserβ18Updated last year
- phone inventory libraryβ16Updated 2 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipelineβ32Updated 2 years ago
- πAn easy-to-use package to restore punctuation of the text.β116Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- Various speech datasets made available to the publicβ123Updated 7 months ago
- Text to Speech for Indic languagesβ51Updated 3 years ago
- asr2kβ51Updated last year
- β56Updated 2 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>β19Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downsβ¦β32Updated 4 years ago
- Country-level Arabic dialect identification (17 Arabic countries)β47Updated 5 years ago