Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
☆23Aug 16, 2021Updated 4 years ago
Alternatives and similar repositories for wav2vec2_stt_python
Users that are interested in wav2vec2_stt_python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Grapheme to phoneme model for PyTorch☆43Jul 21, 2022Updated 3 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Speech in Flax/JAX☆15Jul 11, 2022Updated 3 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- An echo cancellation library for browsers using DTLN-aec☆26Oct 18, 2023Updated 2 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Read-only unofficial mirror of the OpenGrm Thrax Grammar Development Tools☆16May 2, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Oct 3, 2021Updated 4 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- Korean text data preprocess toolkit for NLP☆18Jun 11, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Feb 9, 2026Updated 2 months ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆20Jun 24, 2022Updated 3 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Nov 20, 2014Updated 11 years ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Jul 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Smart Language Model☆46Dec 21, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- ☆10Mar 20, 2021Updated 5 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model☆24Oct 28, 2023Updated 2 years ago
- Toolkit for training/adapting CMU Sphinx acoustic models☆17May 25, 2018Updated 7 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago