Real-Time Whisper Voice Recognition with vosk model feedback.
☆120Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for vosper
Users that are interested in vosper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆833Sep 12, 2025Updated 8 months ago
- Streaming transcriber with whisper☆696May 1, 2023Updated 3 years ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆359Jul 20, 2025Updated 10 months ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆185Sep 22, 2022Updated 3 years ago
- plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere☆14May 19, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Apr 28, 2021Updated 5 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in☆786Apr 30, 2024Updated 2 years ago
- Real time speech to text transcription app.☆439Jan 14, 2023Updated 3 years ago
- Real time transcription with OpenAI Whisper.☆2,938Apr 15, 2025Updated last year
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Thin wrapper around OpenAI Whisper API with streaming support☆84Dec 5, 2025Updated 5 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆92Aug 28, 2023Updated 2 years ago
- Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes☆29Mar 1, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 3 years ago
- Python library to write, read, and verify transparency metadata in audio files for AI transparency compliance.☆18Aug 17, 2025Updated 9 months ago
- A nearly-live implementation of OpenAI's Whisper.☆4,031May 15, 2026Updated last week
- Speaker diarization service☆27Feb 24, 2026Updated 3 months ago
- Keyword Spotting for detecting a word in an audio file☆17Jul 21, 2019Updated 6 years ago
- An interface for llama.cpp, ChatGPT, Gemini, Claude, and Kimi☆29Updated this week
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆289Dec 30, 2025Updated 4 months ago
- ☆15Aug 25, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆72Dec 30, 2023Updated 2 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Jun 24, 2023Updated 2 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 11 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆120Feb 4, 2023Updated 3 years ago
- A collection of all our phonemeizers for dataset construction and inference☆30Feb 21, 2025Updated last year
- Official source for Catalan Language Models and resources made within Aina project.☆26Jul 28, 2023Updated 2 years ago
- ☆27Nov 3, 2025Updated 6 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆14Mar 15, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Live transcription with OpenAi Whisper☆50Nov 11, 2022Updated 3 years ago
- Simple diarization model☆54Jun 13, 2025Updated 11 months ago
- ☆11Aug 24, 2022Updated 3 years ago
- ☆15Jan 27, 2023Updated 3 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Jul 15, 2024Updated last year
- Phonotate.App is a local, open-source Electron app built with React designed to simplify creating training data for StyleTTS 2 and voice …☆11Jan 17, 2025Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year