Picovoice / leopard
On-device speech-to-text engine powered by deep learning
☆427Updated this week
Related projects: ⓘ
- On-device streaming speech-to-text engine powered by deep learning☆583Updated this week
- On-device Speech-to-Intent engine powered by deep learning☆616Updated 2 weeks ago
- An On-Premises, Streaming Speech Recognition System☆681Updated 2 years ago
- Command-line tools for speech and intent recognition on Linux☆1,088Updated 6 months ago
- Botium Speech Processing☆946Updated 5 months ago
- On-device voice activity detection (VAD) powered by deep learning☆165Updated 2 weeks ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆335Updated last year
- speech to text benchmark framework☆603Updated 8 months ago
- Pytorch based speech enhancement toolkit.☆329Updated 6 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆421Updated 10 months ago
- On-device voice assistant platform powered by deep learning☆565Updated this week
- 🐸STT integration examples☆117Updated last year
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆361Updated 8 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆103Updated last year
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆288Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆194Updated last month
- Examples of how to use or integrate DeepSpeech☆816Updated last year
- Streaming transcriber with whisper☆685Updated last year
- VOSK Speech Recognition Toolkit☆378Updated 2 years ago
- State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI to…☆407Updated 2 years ago
- End to end text to speech system using gruut and onnx☆823Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆496Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversations☆274Updated last year
- A fast local neural text to speech engine for Mycroft☆1,033Updated 9 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆315Updated 7 months ago
- Voice models for Mimic 3 text to speech system☆121Updated 2 months ago
- A real-time transcription project using React and socketio☆144Updated last year
- A quick experiment to achieve almost realtime transcription using Whisper.☆185Updated last year
- A library for real-time voice processing in web browsers☆195Updated last week