jonatasgrosman/wav2vec2-sprint

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jonatasgrosman/wav2vec2-sprint)

jonatasgrosman / wav2vec2-sprint

☆206

Alternatives and similar repositories for wav2vec2-sprint

Users that are interested in wav2vec2-sprint are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jonatasgrosman / huggingsound
View on GitHub
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
☆470Sep 20, 2023Updated 2 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
View on GitHub
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆110Aug 31, 2022Updated 3 years ago
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
jqueguiner / wav2vec2-sprint
View on GitHub
docker for HF wav2vec2-sprint
☆13Mar 26, 2021Updated 5 years ago
jonatasgrosman / asrecognition
View on GitHub
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
☆51Mar 6, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆25Jul 25, 2021Updated 4 years ago
m3hrdadfi / soxan
View on GitHub
Wav2Vec for speech recognition, classification, and audio classification
☆276Apr 2, 2022Updated 4 years ago
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
chuachinhon / wav2vec2_transformers
View on GitHub
Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…
☆32Mar 20, 2021Updated 5 years ago
qinyuenlp / wav2vec_finetune
View on GitHub
ASR: fine-tune wav2vec 2.0 with transformers
☆21Sep 13, 2021Updated 4 years ago
anton-l / wav2vec-toolkit
View on GitHub
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆30Apr 21, 2021Updated 5 years ago
oliverguhr / wav2vec2-live
View on GitHub
A live speech recognition using Facebooks wav2vec 2.0 model.
☆378Feb 4, 2024Updated 2 years ago
facebookresearch / voxpopuli
View on GitHub
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
☆574Apr 2, 2023Updated 3 years ago
parambharat / whisper-finetuning
View on GitHub
Repository contains code to fine-tune WhisperASR model
☆23Dec 16, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MiuLab / SpokenCSE
View on GitHub
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding
☆11May 19, 2023Updated 3 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
ramizasr21 / comptia-network-n10-009-dumps
View on GitHub
Skillcertpro dumps Priced reasonably at around $20, these resources offer excellent value considering the quality, lifetime access, and u…
☆10Jul 18, 2024Updated 2 years ago
ccoreilly / wav2vec2-service
View on GitHub
☆41Jan 14, 2022Updated 4 years ago
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
CODEJIN / PWGAN_for_HiFiSinger
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
RF5 / transfusion-asr
View on GitHub
Transcribing Speech with Multinomial Diffusion, training code and models.
☆80Sep 27, 2023Updated 2 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
ccoreilly / wav2vec2-catala
View on GitHub
Wav2Vec 2.0 catalan training scripts and models
☆12Jun 18, 2021Updated 5 years ago
nhattruongpham / mmser
View on GitHub
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
☆15Jan 23, 2024Updated 2 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,558Mar 12, 2026Updated 4 months ago
stnava / sccan
View on GitHub
sparse canonical correlation analysis for neuroimaging
☆17Oct 16, 2013Updated 12 years ago
nkrao220 / accent-classification
View on GitHub
Accent Classification in Speech
☆25Jul 24, 2019Updated 7 years ago
vlomme / AGAIN-MelGan-Voice-Cloning
View on GitHub
Русско-Английский вокодер на GAN
☆17Jun 15, 2021Updated 5 years ago
philschmid / transformers-pytorch-text-classification
View on GitHub
☆14Jan 24, 2022Updated 4 years ago
pyannote / DEPRECATED-pyannote-audio-hub
View on GitHub
[deprecated] Pretrained models for pyannote-audio 1.x
☆71Jul 20, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
Slyne / ctc_decoder
View on GitHub
A ctc decoder for both online and offline asr model
☆66Nov 18, 2023Updated 2 years ago
b04901014 / FT-w2v2-ser
View on GitHub
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
☆152Oct 26, 2021Updated 4 years ago
nilc-nlp / DNLT-BP
View on GitHub
Datasets of Neuropsychological Language Tests in Brazilian Portuguese
☆14Oct 14, 2025Updated 9 months ago
sarulab-speech / jtubespeech
View on GitHub
☆233Nov 13, 2023Updated 2 years ago
khanld / ASR-Wav2vec-Finetune
View on GitHub
Finetune Wa2vec 2.0 For Speech Recognition
☆149Feb 6, 2025Updated last year
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,161Nov 24, 2025Updated 8 months ago