☆40Jan 14, 2022Updated 4 years ago
Alternatives and similar repositories for wav2vec2-service
Users that are interested in wav2vec2-service are comparing it to the libraries listed below
Sorting:
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆469Jul 13, 2023Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 4 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated this week
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- ☆57Apr 18, 2023Updated 2 years ago
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆57May 26, 2022Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆377Feb 4, 2024Updated 2 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆38Jul 31, 2025Updated 7 months ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- ☆15Mar 25, 2024Updated last year
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆469Sep 20, 2023Updated 2 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- ☆18Mar 13, 2024Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆155May 2, 2024Updated last year
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- ☆17Apr 14, 2023Updated 2 years ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Jul 25, 2024Updated last year
- Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.☆22Nov 6, 2021Updated 4 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆18Nov 13, 2021Updated 4 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆24Jan 7, 2022Updated 4 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆28Jul 31, 2025Updated 7 months ago
- ☆59Apr 11, 2023Updated 2 years ago
- General tools for voice analysis.☆25Jul 30, 2025Updated 7 months ago
- Model for recasing and repunctuating ASR transcripts☆143Apr 10, 2024Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- ☆22Jul 8, 2021Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆56Dec 19, 2022Updated 3 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago