Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆111Aug 31, 2022Updated 3 years ago
Alternatives and similar repositories for Wav2Vec2_PyCTCDecode
Users that are interested in Wav2Vec2_PyCTCDecode are comparing it to the libraries listed below
Sorting:
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆468Jul 13, 2023Updated 2 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆377Feb 4, 2024Updated 2 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- ☆357Mar 17, 2024Updated last year
- Wav2Vec2 finetune and inference code for IITP AI Grand Challenge☆36Feb 22, 2022Updated 4 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- ☆40Jan 14, 2022Updated 4 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆21Aug 9, 2023Updated 2 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆112Feb 27, 2022Updated 4 years ago
- Paper Review about Speech Recognition · NLP☆10Mar 25, 2021Updated 4 years ago
- ☆13Sep 25, 2024Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Mar 25, 2023Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Apr 21, 2021Updated 4 years ago
- Grapheme to phoneme conversion with deep learning.☆420Dec 8, 2023Updated 2 years ago
- ☆184May 26, 2023Updated 2 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆115May 20, 2019Updated 6 years ago
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆469Sep 20, 2023Updated 2 years ago
- The official repository for Audio ALBERT☆67Jan 21, 2022Updated 4 years ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Jan 21, 2022Updated 4 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆38Feb 23, 2023Updated 3 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆565Apr 2, 2023Updated 2 years ago