aismlv / zindi-ai4d-wolof
4th place solution to Zindi's low-resource automatic speech recognition competition
☆8Updated 3 years ago
Alternatives and similar repositories for zindi-ai4d-wolof:
Users that are interested in zindi-ai4d-wolof are comparing it to the libraries listed below
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- Knowledge distillation of wav2vec2 (from huggingface)☆9Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- ☆42Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆72Updated 3 years ago
- A merged version of multiple open-source German speech datasets.☆31Updated 8 months ago
- Example code for a neural transducer model.☆61Updated 11 months ago
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆30Updated 3 years ago
- Scripts to create speech corpora from open.bible☆12Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆13Updated last year
- ☆41Updated 2 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆253Updated 2 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆139Updated 10 months ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- Tensorflow Audio Classification Models☆12Updated last year
- A PyPI package for fast word/character error rate (WER/CER) calculation☆69Updated last year
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆246Updated last year
- Curate online wolof text resources that can be used to build models☆23Updated 6 months ago
- ☆42Updated 3 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated last year
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆11Updated 2 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- ☆38Updated 3 years ago
- Various speech datasets made available to the public☆107Updated last month
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆98Updated 8 months ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆170Updated last month