sskorol / vosk-api-gpu
Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC
☆41Updated 2 years ago
Alternatives and similar repositories for vosk-api-gpu:
Users that are interested in vosk-api-gpu are comparing it to the libraries listed below
- Model for recasing and repunctuating ASR transcripts☆133Updated 10 months ago
- ☆38Updated 3 years ago
- Text To Speech Synthesis with Vosk☆159Updated 3 weeks ago
- ☆22Updated 3 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆20Updated 3 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆153Updated 8 months ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆58Updated 3 years ago
- How to create your own model for vosk☆70Updated 3 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆31Updated 6 months ago
- ☆43Updated last week
- Normalize Text in Russian☆26Updated last year
- ☆30Updated 3 years ago
- Speech analytics package for call-center☆22Updated 4 years ago
- Tacotron2 + Waveglow Russian☆43Updated 5 years ago
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- ☆11Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Updated 2 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆118Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 9 months ago
- ☆34Updated 5 months ago
- Accentor and transcriptor for Russian language☆122Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆303Updated 3 months ago
- the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTT…☆50Updated 3 years ago
- ☆23Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆39Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆341Updated last year
- python wrapper for rnnoise library☆46Updated 2 years ago