☆41Jan 14, 2022Updated 4 years ago
Alternatives and similar repositories for wav2vec2-service
Users that are interested in wav2vec2-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆378Feb 4, 2024Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 2 months ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- ☆11Nov 5, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆471Sep 20, 2023Updated 2 years ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 7 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆346May 15, 2024Updated last year
- ☆17Apr 14, 2023Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆469Jul 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 5 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆138Feb 20, 2024Updated 2 years ago
- ☆57Apr 18, 2023Updated 3 years ago
- ☆15Mar 25, 2024Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆380Nov 22, 2021Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆155May 2, 2024Updated 2 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Jul 25, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆59May 26, 2022Updated 3 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated 2 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆24Jan 7, 2022Updated 4 years ago
- ☆18Mar 13, 2024Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆27Feb 25, 2024Updated 2 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆18Nov 13, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆39Jul 31, 2025Updated 9 months ago
- ☆18Apr 28, 2021Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆27Jan 27, 2021Updated 5 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago