aismlv / zindi-ai4d-wolofLinks

4th place solution to Zindi's low-resource automatic speech recognition competition

☆8

Alternatives and similar repositories for zindi-ai4d-wolof

Users that are interested in zindi-ai4d-wolof are comparing it to the libraries listed below

Sorting:

kingabzpro / WOLOF-ASR-Wav2Vec2
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
☆17Updated 3 years ago
kssteven418 / Squeezeformer
[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
☆252Updated 2 years ago
Open-Speech-EkStep / vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
☆87Updated 2 years ago
Open-Speech-EkStep / indic-punct
☆43Updated 2 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆111Updated 2 years ago
revdotcom / speech-datasets
Various speech datasets made available to the public
☆122Updated 6 months ago
upskyy / Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
☆142Updated 2 years ago
thevasudevgupta / gsoc-wav2vec2
GSoC'2021 | TensorFlow implementation of Wav2Vec2
☆90Updated 3 years ago
chuachinhon / wav2vec2_transformers
Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…
☆32Updated 4 years ago
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
german-asr / megs
A merged version of multiple open-source German speech datasets.
☆31Updated last year
masakhane-io / masakhane-pos
POS for African languages
☆17Updated this week
kensho-technologies / pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆446Updated last year
Open-Speech-EkStep / ULCA-asr-dataset-corpus
☆46Updated 2 years ago
lorenlugosch / transducer-tutorial
Example code for a neural transducer model.
☆62Updated last year
burchim / EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆216Updated 2 years ago
navana-tech / baseline_recipe_is21s_indic_asr_challenge
Multilingual and code-switching ASR challenges for low resource Indian languages.
☆20Updated 3 years ago
Edresson / Wav2Vec-Wrapper
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆82Updated 2 years ago
besacier / AMMIcourse
☆42Updated 3 years ago
BUTSpeechFIT / BrnoLM
A neural language modeling toolkit built on PyTorch
☆18Updated 2 years ago
harvard-edge / multilingual_kws
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆174Updated 6 months ago
anton-l / wav2vec-toolkit
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆31Updated 4 years ago
ccoreilly / wav2vec2-service
☆38Updated 3 years ago
ymoslem / Arabisc
Context-Sensitive Neural Spelling Checker
☆20Updated 9 months ago
awsaf49 / audio_classification_models
Tensorflow Audio Classification Models
☆12Updated last year
ducanhdt / openai_whisper_finetuning
☆49Updated 2 years ago
alpoktem / bible2speechDB
Scripts to create speech corpora from open.bible
☆13Updated 3 years ago
attilanagy234 / neural-punctuator
Complimentary code for our paper Automatic punctuation restoration with BERT models
☆50Updated last year
m3hrdadfi / soxan
Wav2Vec for speech recognition, classification, and audio classification
☆263Updated 3 years ago
msalhab96 / RNN-Transducer
PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper
☆12Updated 3 years ago