patrickvonplaten/Wav2Vec2_PyCTCDecode

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/patrickvonplaten/Wav2Vec2_PyCTCDecode)

patrickvonplaten / Wav2Vec2_PyCTCDecode

Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode

☆110

Alternatives and similar repositories for Wav2Vec2_PyCTCDecode

Users that are interested in Wav2Vec2_PyCTCDecode are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
speech-paper-reading / speech-paper-reading
View on GitHub
Repository for speech paper reading
☆33Aug 19, 2021Updated 4 years ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Edresson / Wav2Vec-Wrapper
View on GitHub
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆80May 20, 2023Updated 3 years ago
voidful / MMLM
View on GitHub
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
☆16Dec 10, 2024Updated last year
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
techiaith / docker-huggingface-stt-cy
View on GitHub
Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace
☆13Nov 29, 2022Updated 3 years ago
oliverguhr / wav2vec2-live
View on GitHub
A live speech recognition using Facebooks wav2vec 2.0 model.
☆378Feb 4, 2024Updated 2 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
kmario23 / KenLM-training
View on GitHub
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
☆116May 20, 2019Updated 7 years ago
jonatasgrosman / wav2vec2-sprint
View on GitHub
☆206Feb 22, 2022Updated 4 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
anton-l / wav2vec-toolkit
View on GitHub
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆30Apr 21, 2021Updated 5 years ago
lucidrains / n-grammer-pytorch
View on GitHub
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆81Dec 4, 2022Updated 3 years ago
ccoreilly / wav2vec2-service
View on GitHub
☆41Jan 14, 2022Updated 4 years ago
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year
upskyy / Paper-Review
View on GitHub
Paper Review about Speech Recognition · NLP
☆10Mar 25, 2021Updated 5 years ago
sanchit-gandhi / seq2seq-speech
View on GitHub
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
☆39Feb 23, 2023Updated 3 years ago
voithru / wav2vec2_finetune
View on GitHub
Wav2Vec2 finetune and inference code for IITP AI Grand Challenge
☆36Feb 22, 2022Updated 4 years ago
sooftware / RNN-Transducer
View on GitHub
PyTorch implementation of RNN-Transducer(RNN-T).
☆81May 6, 2021Updated 5 years ago
smatthewenglish / trst
View on GitHub
☆12Jan 15, 2015Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
awni / future_speech
View on GitHub
The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
parlance / ctcdecode
View on GitHub
PyTorch CTC Decoder bindings
☆860Apr 4, 2024Updated 2 years ago
sooftware / lightning-asr
View on GitHub
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆50May 19, 2021Updated 5 years ago
ga642381 / Taiwanese-Translation
View on GitHub
Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus
☆13Oct 15, 2022Updated 3 years ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated last year
facebookresearch / voxpopuli
View on GitHub
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
☆574Apr 2, 2023Updated 3 years ago
google-research / longt5
View on GitHub
☆183May 26, 2023Updated 3 years ago
jonatasgrosman / huggingsound
View on GitHub
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
☆470Sep 20, 2023Updated 2 years ago
axelspringer / DeepPhonemizer
View on GitHub
Grapheme to phoneme conversion with deep learning.
☆432Dec 8, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆374Oct 12, 2021Updated 4 years ago
pohanchi / AALBERT
View on GitHub
The official repository for Audio ALBERT
☆68Jan 21, 2022Updated 4 years ago
jasonppy / word-discovery
View on GitHub
Word Discovery in Visually Grounded, Self-Supervised Speech Models
☆27Dec 4, 2023Updated 2 years ago
sooftware / End-to-End-Speech-Recognition-Models
View on GitHub
PyTorch implementation of automatic speech recognition models.
☆38Jan 10, 2021Updated 5 years ago
voidful / nlp2go
View on GitHub
🏃 hosting nlp models in one line
☆20May 8, 2024Updated 2 years ago
upskyy / Transformer-Transducer
View on GitHub
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆114Feb 27, 2022Updated 4 years ago
SpeechColab / GigaSpeech
View on GitHub
Large, modern dataset for speech recognition
☆731Feb 26, 2024Updated 2 years ago