alphacep / vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
β8,137Updated last week
Related projects β
Alternatives and complementary repositories for vosk-api
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ929Updated 2 months ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,284Updated 8 months ago
- A fast, local neural text to speech systemβ6,584Updated 3 weeks ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ4,383Updated last week
- VOSK Speech Recognition Toolkitβ383Updated 2 years ago
- Offline speech recognition for Android with Vosk library.β755Updated 11 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.β14,298Updated last month
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β25,366Updated 2 months ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β8,445Updated last week
- A small speech recognizerβ3,950Updated last month
- A fast local neural text to speech engine for Mycroftβ1,075Updated 11 months ago
- Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleβ4,983Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β12,524Updated 3 months ago
- Open-source offline translation library written in Pythonβ3,915Updated last month
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β4,234Updated 3 weeks ago
- Port of OpenAI's Whisper model in C/C++β35,738Updated this week
- Noise supression using deep filteringβ2,538Updated last month
- Deezer source separation library including pretrained models.β25,941Updated 3 weeks ago
- A PyTorch-based Speech Toolkitβ8,938Updated last week
- Examples of how to use or integrate DeepSpeechβ821Updated last year
- Open-Source Large Vocabulary Continuous Speech Recognition Engineβ1,844Updated 6 months ago
- Faster Whisper transcription with CTranslate2β12,540Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ4,962Updated 3 months ago
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ35,482Updated 3 months ago
- A python package to analyze and compare voices with deep learningβ2,784Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,673Updated 9 months ago
- gentle forced alignerβ1,454Updated 6 months ago
- End-to-End Speech Processing Toolkitβ8,510Updated this week
- OpenAI Whisper ASR Webservice APIβ2,109Updated last month
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β6,345Updated this week