oliverguhr/wav2vec2-live

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/oliverguhr/wav2vec2-live)

oliverguhr / wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

☆378

Alternatives and similar repositories for wav2vec2-live

Users that are interested in wav2vec2-live are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Edresson / Wav2Vec-Wrapper
View on GitHub
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆80May 20, 2023Updated 3 years ago
chuachinhon / wav2vec2_transformers
View on GitHub
Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…
☆32Mar 20, 2021Updated 5 years ago
ccoreilly / wav2vec2-service
View on GitHub
☆41Jan 14, 2022Updated 4 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
View on GitHub
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆110Aug 31, 2022Updated 3 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
tarun-bisht / wav2vec2-asr
View on GitHub
wav2vec2 asr with transformers
☆16Oct 26, 2021Updated 4 years ago
jonatasgrosman / huggingsound
View on GitHub
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
☆470Sep 20, 2023Updated 2 years ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
anton-l / wav2vec-toolkit
View on GitHub
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆30Apr 21, 2021Updated 5 years ago
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
m3hrdadfi / soxan
View on GitHub
Wav2Vec for speech recognition, classification, and audio classification
☆276Apr 2, 2022Updated 4 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
voidful / SpeechMix
View on GitHub
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
☆46Jul 3, 2025Updated last year
vietai / ASR
View on GitHub
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
☆106Sep 3, 2021Updated 4 years ago
asappresearch / wav2seq
View on GitHub
Official code for Wav2Seq
☆97Jul 19, 2022Updated 4 years ago
sinhat98 / adapter-wavlm
View on GitHub
☆46Feb 16, 2023Updated 3 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
georgian-io / Knowledge-Distillation-Toolkit
View on GitHub
[DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.
☆138Feb 20, 2024Updated 2 years ago
asappresearch / sew
View on GitHub
☆77Oct 25, 2021Updated 4 years ago
jonatasgrosman / wav2vec2-sprint
View on GitHub
☆206Feb 22, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
jonatasgrosman / asrecognition
View on GitHub
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
☆51Mar 6, 2023Updated 3 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
bhattbhavesh91 / wav2vec2-huggingface-demo
View on GitHub
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
☆29Jun 1, 2021Updated 5 years ago
sooftware / lightning-asr
View on GitHub
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆50May 19, 2021Updated 5 years ago
AlanBaade / SyllableLM
View on GitHub
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆63Jul 1, 2025Updated last year
shirayu / whispering
View on GitHub
Streaming transcriber with whisper
☆696May 1, 2023Updated 3 years ago
ttop32 / wav2vec2-live-japanese-translator
View on GitHub
real time japanese speech recognition translator using wav2vec2
☆39Jul 25, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wangyu09 / exkaldi-rt
View on GitHub
An online speech recognition extension toolkit of Kaldi
☆55Jun 23, 2021Updated 5 years ago
harvard-edge / multilingual_kws
View on GitHub
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆190Dec 6, 2024Updated last year
jumon / whisper-finetuning
View on GitHub
[WIP] Scripts for fine-tuning Whisper
☆221Jul 2, 2026Updated 3 weeks ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆374Oct 12, 2021Updated 4 years ago
facebookresearch / voxpopuli
View on GitHub
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
☆574Apr 2, 2023Updated 3 years ago
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
Felflare / rpunct
View on GitHub
📝An easy-to-use package to restore punctuation of the text.
☆120Apr 5, 2023Updated 3 years ago