alphacep / vosk-apiLinks
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
☆13,778Updated last week
Alternatives and similar repositories for vosk-api
Users that are interested in vosk-api are comparing it to the libraries listed below
Sorting:
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,206Updated 4 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆7,573Updated last week
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,542Updated last year
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆9,221Updated this week
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.☆5,904Updated 3 weeks ago
- Silero Models: pre-trained text-to-speech models made embarrassingly simple☆5,655Updated last week
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,260Updated 2 months ago
- Offline speech recognition for Android with Vosk library.☆978Updated last week
- A small speech recognizer☆4,233Updated 2 weeks ago
- VOSK Speech Recognition Toolkit☆483Updated 3 years ago
- A PyTorch-based Speech Toolkit☆10,915Updated last week
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,576Updated last month
- Faster Whisper transcription with CTranslate2☆19,376Updated 3 weeks ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆43,837Updated last year
- Offline Text To Speech synthesis for python☆2,452Updated 3 weeks ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆19,123Updated last month
- On-device wake word detection powered by deep learning☆4,541Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆2,041Updated this week
- A fast, local neural text to speech system☆10,334Updated 3 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆8,838Updated this week
- Open-Source Large Vocabulary Continuous Speech Recognition Engine☆1,922Updated 6 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,482Updated last month
- Python library and CLI tool to interface with Google Translate's text-to-speech API☆2,560Updated 3 weeks ago
- https://hf.co/hexgrad/Kokoro-82M☆5,054Updated 4 months ago
- Python interface to the WebRTC Voice Activity Detector☆2,411Updated last year
- Examples of how to use or integrate DeepSpeech☆856Updated 2 years ago
- An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.☆1,580Updated last month
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆9,511Updated last week
- End-to-End Speech Processing Toolkit☆9,632Updated last week
- Recurrent neural network for audio noise reduction☆5,201Updated 9 months ago