Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆111Aug 31, 2022Updated 3 years ago
Alternatives and similar repositories for Wav2Vec2_PyCTCDecode
Users that are interested in Wav2Vec2_PyCTCDecode are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆468Jul 13, 2023Updated 2 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆347May 15, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 3 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆380Nov 22, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆379Feb 4, 2024Updated 2 years ago
- ☆357Mar 17, 2024Updated 2 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆116May 20, 2019Updated 7 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆13Nov 29, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆205Feb 22, 2022Updated 4 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆81Dec 4, 2022Updated 3 years ago
- ☆41Jan 14, 2022Updated 4 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Apr 21, 2021Updated 5 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆39Feb 23, 2023Updated 3 years ago
- Wav2Vec2 finetune and inference code for IITP AI Grand Challenge☆36Feb 22, 2022Updated 4 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 5 years ago
- ☆12Jan 15, 2015Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- PyTorch CTC Decoder bindings☆858Apr 4, 2024Updated 2 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 5 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆573Apr 2, 2023Updated 3 years ago
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- ☆184May 26, 2023Updated 3 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆371Oct 12, 2021Updated 4 years ago
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools