ccoreilly/wav2vec2-service

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ccoreilly/wav2vec2-service)

ccoreilly / wav2vec2-service

☆41

Alternatives and similar repositories for wav2vec2-service

Users that are interested in wav2vec2-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Oct 14, 2024Updated last year
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
oliverguhr / wav2vec2-live
View on GitHub
A live speech recognition using Facebooks wav2vec 2.0 model.
☆378Feb 4, 2024Updated 2 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jonatasgrosman / huggingsound
View on GitHub
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
☆470Sep 20, 2023Updated 2 years ago
qcri / e-wer
View on GitHub
Word Error Rate Estimation
☆16Aug 25, 2020Updated 5 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
View on GitHub
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆110Aug 31, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
falabrasil / gitlab-resources
View on GitHub
This is a legacy repo. Dev occurs now on GitHub.
☆11Mar 28, 2021Updated 5 years ago
georgian-io / Knowledge-Distillation-Toolkit
View on GitHub
[DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.
☆138Feb 20, 2024Updated 2 years ago
alumae / kiirkirjutaja
View on GitHub
☆58Jul 3, 2026Updated 3 weeks ago
cvqluu / simple_diarizer
View on GitHub
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
☆158May 2, 2024Updated 2 years ago
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
nilc-nlp / CORAA
View on GitHub
☆64Apr 11, 2023Updated 3 years ago
RemiRigal / snreval-python
View on GitHub
This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…
☆12Jun 22, 2022Updated 4 years ago
eatsleepraverepeat / reMUDE
View on GitHub
(re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition
☆17Jul 25, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
falabrasil / kaldi-br
View on GitHub
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro
☆60May 26, 2022Updated 4 years ago
nvidia-riva / riva-asrlib-decoder
View on GitHub
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
☆91Feb 18, 2025Updated last year
mzarvandi / SER-wav2vec
View on GitHub
Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
IndoNLP / nusa-catalogue
View on GitHub
Dataset Catalogue Homepage for Indonesian Languages
☆12Feb 19, 2024Updated 2 years ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
kumarvc / Fastapi-docker-kubernetes-streamlit
View on GitHub
☆27Jan 27, 2021Updated 5 years ago
Nathan-Roll1 / PSST
View on GitHub
Prosodic Speech Segmentation with Transformers
☆28Feb 25, 2024Updated 2 years ago
tuanct1997 / Federated-Learning-ASR-based-on-wav2vec-2.0
View on GitHub
☆18Mar 13, 2024Updated 2 years ago
asappresearch / wav2seq
View on GitHub
Official code for Wav2Seq
☆97Jul 19, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kingabzpro / WOLOF-ASR-Wav2Vec2
View on GitHub
Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
☆18Nov 13, 2021Updated 4 years ago
csikasote / BembaSpeech
View on GitHub
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…
☆41Jul 31, 2025Updated 11 months ago
alxmamaev / ultimate_tts
View on GitHub
☆13Aug 7, 2021Updated 4 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
Open-Speech-EkStep / data-acquisition-pipeline
View on GitHub
☆18Apr 28, 2021Updated 5 years ago
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago
Speech-Lab-IITM / Hindi-ASR-Challenge
View on GitHub
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆10Nov 5, 2020Updated 5 years ago